QuickCppLib 0.10
Eliminate all the tedious hassle when making state-of-the-art C++ 14 - 23 libraries!
Loading...
Searching...
No Matches
quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize > Class Template Reference

Calculates the single error correcting double error detecting (SECDED) Hamming Error Correcting Code for a blocksize block of bytes. For example, a secdec_ecc<8> would be the very common 72,64 Hamming code used in ECC RAM, or secdec_ecc<4096> would be for a 32784,32768 Hamming code. More...

#include "secded_ecc.hpp"

Public Types

enum  verify_status { corrupt = 0 , okay = 1 , healed = 2 }
 The outcomes from verify() More...
 
typedef unsigned int result_type
 The largest ECC which can be calculated.
 

Public Member Functions

constexpr secded_ecc ()
 Constructs an instance, configuring the necessary lookup tables.
 
constexpr result_type result_bits_valid () const noexcept
 The number of bits valid in result_type.
 
result_type operator() (result_type ecc, const char *buffer) const noexcept
 Accumulate ECC from fixed size buffer.
 
result_type operator() (result_type ecc, const char *buffer, size_t length) const noexcept
 Accumulate ECC from partial buffer where length <= blocksize.
 
result_type operator() (const char *buffer) const noexcept
 
result_type operator() (const char *buffer, size_t length) const noexcept
 
result_type find_bad_bit (result_type good_ecc, result_type bad_ecc) const noexcept
 
verify_status verify (char *buffer, result_type good_ecc) const noexcept
 Verifies and heals when possible a buffer, returning non zero if the buffer is error free.
 

Detailed Description

template<size_t blocksize>
class quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >

Calculates the single error correcting double error detecting (SECDED) Hamming Error Correcting Code for a blocksize block of bytes. For example, a secdec_ecc<8> would be the very common 72,64 Hamming code used in ECC RAM, or secdec_ecc<4096> would be for a 32784,32768 Hamming code.

Warning
This class needs to be completely reimplemented using SSE and AVX. Its current implementation works fine, but performance is far below what it can be.

Did you know that some non-ECC RAM systems can see 1e-12 flips/bit/hour, which is 3.3 bits flipped in a 16Gb RAM system per 24 hours). See Schroeder, Pinheiro and Weber (2009) 'DRAM Errors in the Wild: A Large-Scale Field Study'.

After construction during which lookup tables are built, no state is modified and therefore this class is safe for static storage (indeed if C++ 14 is available, the constructor is constexpr). The maximum number of bits in a code is a good four billion, I did try limiting it to 65536 for performance but it wasn't worth it, and one might want > 8Kb blocks maybe. As with all SECDED ECC, undefined behaviour occurs when more than two bits of error are present or the ECC supplied is incorrect. You should combine this SECDED with a robust hash which can tell you definitively if a buffer is error free or not rather than relying on this to correctly do so.

The main intended use case for this routine is calculating the ECC on data being written to disc, and hence that is where performance has been maximised. It is not expected that this routine will be frequently called on data being read from disc i.e. only when its hash doesn't match its contents which should be very rare, and then a single bit heal using this routine is attempted before trying again with the hash. Care was taken that really enormous SECDEDs are fast, in fact tuning was mostly done for the 32784,32768 code which can heal one bad bit per 4Kb page as the main thing we have in mind is achieving reliable filing system code on computers without ECC RAM and in which sustained large quantities of random disc i/o produce a worrying number of flipped bits in a 24 hour period (anywhere between 0 and 3 on my hardware here, average is about 0.8).

Performance notes of this current implementation:

For a Skylake CPU:

  • MSVC Fixed buffer: 125.719 Mb/sec, or 23.48 cycles/byte
  • MSVC Variable buffer: 144.167 Mb/sec, or 20.48 cycles/byte
  • MSVC heal: 80.4498 Mb/sec
  • GCC7 Fixed buffer: 201.509 Mb/sec, or 14.65 cycles/byte
  • GCC7 Variable buffer: 317.97 Mb/sec, or 9.28 cycles/byte
  • GCC7 heal: 126.099 Mb/sec
  • cla6 Fixed buffer: 276.962 Mb/sec, or 10.66 cycles/byte
  • cla6 Variable buffer: 210.248 Mb/sec, or 14.04 cycles/byte
  • cla6 heal: 171.174 Mb/sec

Note that better than 1Gb/sec is easily possible if I rewrite the implementation.

Complexity\nO(N) where N is the blocksize
Errors returnable\nThrows constexpr exceptions in constructor only, otherwise entirely noexcept.

Member Typedef Documentation

◆ result_type

template<size_t blocksize>
typedef unsigned int quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::result_type

The largest ECC which can be calculated.

Member Enumeration Documentation

◆ verify_status

The outcomes from verify()

Enumerator
corrupt 

The buffer had more than a single bit corrupted or the ECC was invalid.

okay 

The buffer had no errors.

healed 

The buffer was healed.

387 {
388 corrupt = 0, //!< The buffer had more than a single bit corrupted or the ECC was invalid
389 okay = 1, //!< The buffer had no errors
390 healed = 2 //!< The buffer was healed
391 };
@ corrupt
The buffer had more than a single bit corrupted or the ECC was invalid.
Definition secded_ecc.hpp:388
@ healed
The buffer was healed.
Definition secded_ecc.hpp:390
@ okay
The buffer had no errors.
Definition secded_ecc.hpp:389

Constructor & Destructor Documentation

◆ secded_ecc()

template<size_t blocksize>
constexpr quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::secded_ecc ( )
inlineconstexpr

Constructs an instance, configuring the necessary lookup tables.

158 {
159 for(size_t n = 0; n < sizeof(result_type) * bits_per_byte; n++)
160 ecc_twospowers[n] = ((result_type) 1 << n);
161 result_type length = blocksize * bits_per_byte;
162 // This is (data bits + parity bits + 1) <= 2^(parity bits)
163 for(result_type p = 1; p < sizeof(result_type) * bits_per_byte; p++)
164 if((length + p + 1) <= ecc_twospowers[p])
165 {
166 bitsvalid = p;
167 break;
168 }
169 if((bits_per_byte - 1 + bitsvalid) / bits_per_byte > sizeof(result_type))
170#ifdef __cpp_exceptions
171 throw std::runtime_error("secdec_ecc: ECC would exceed the size of result_type!");
172#else
173 abort();
174#endif
175 for(result_type i = 0; i < blocksize * bits_per_byte; i++)
176 {
177 // Make a code bit
178 result_type b = i + 1;
179#if QUICKCPPLIB_SECDED_INTRINSICS && 0 // let constexpr do its thing
180#ifdef _MSC_VER
181 unsigned long _topbit;
182 _BitScanReverse(&_topbit, b);
183 result_type topbit = _topbit;
184#else
185 result_type topbit = bits_per_byte * sizeof(result_type) - __builtin_clz(b);
186#endif
187 b += topbit;
188 if(b >= ecc_twospowers[topbit])
189 b++;
190// while(b>ecc_twospowers(_topbit+1)) _topbit++;
191// b+=_topbit;
192// if(b>=ecc_twospowers(_topbit)) b++;
193#else
194 for(size_t p = 0; ecc_twospowers[p] < (b + 1); p++)
195 b++;
196#endif
197 ecc_table[i] = (unsigned short) b;
198 if(b > (unsigned short) -1)
199#ifdef __cpp_exceptions
200 throw std::runtime_error("secdec_ecc: Precalculated table has exceeded its bounds");
201#else
202 abort();
203#endif
204 }
205 }
unsigned int result_type
The largest ECC which can be calculated.
Definition secded_ecc.hpp:106

Member Function Documentation

◆ result_bits_valid()

template<size_t blocksize>
constexpr result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::result_bits_valid ( ) const
inlineconstexprnoexcept

The number of bits valid in result_type.

207{ return bitsvalid; }

◆ operator()() [1/4]

template<size_t blocksize>
result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::operator() ( result_type  ecc,
const char *  buffer 
) const
inlinenoexcept

Accumulate ECC from fixed size buffer.

211 {
212#ifdef _MSC_VER
213#pragma warning(push)
214#pragma warning(disable : 4127) // conditional expression is constant
215#endif
216 if(blocksize < sizeof(unit_type) * 8)
217 return (*this)(ecc, buffer, blocksize);
218#ifdef _MSC_VER
219#pragma warning(pop)
220#endif
221 // Process in lumps of eight
222 const unit_type *_buffer = (const unit_type *) buffer;
223 // #pragma omp parallel for reduction(^:ecc)
224 for(size_t i = 0; i < blocksize; i += sizeof(unit_type) * 8)
225 {
226 union
227 {
228 unsigned long long v;
229 unit_type c[8];
230 };
231 result_type prefetch[8];
232 v = *(unsigned long long *) (&_buffer[0 + i / sizeof(unit_type)]); // min 1 cycle
233#define QUICKCPPLIB_SECDED_ROUND(n) \
234 prefetch[0] = ecc_table[(i + 0) * 8 + n]; \
235 prefetch[1] = ecc_table[(i + 1) * 8 + n]; \
236 prefetch[2] = ecc_table[(i + 2) * 8 + n]; \
237 prefetch[3] = ecc_table[(i + 3) * 8 + n]; \
238 prefetch[4] = ecc_table[(i + 4) * 8 + n]; \
239 prefetch[5] = ecc_table[(i + 5) * 8 + n]; \
240 prefetch[6] = ecc_table[(i + 6) * 8 + n]; \
241 prefetch[7] = ecc_table[(i + 7) * 8 + n]; \
242 if(c[0] & ((unit_type) 1 << n)) \
243 ecc ^= prefetch[0]; \
244 if(c[1] & ((unit_type) 1 << n)) \
245 ecc ^= prefetch[1]; \
246 if(c[2] & ((unit_type) 1 << n)) \
247 ecc ^= prefetch[2]; \
248 if(c[3] & ((unit_type) 1 << n)) \
249 ecc ^= prefetch[3]; \
250 if(c[4] & ((unit_type) 1 << n)) \
251 ecc ^= prefetch[4]; \
252 if(c[5] & ((unit_type) 1 << n)) \
253 ecc ^= prefetch[5]; \
254 if(c[6] & ((unit_type) 1 << n)) \
255 ecc ^= prefetch[6]; \
256 if(c[7] & ((unit_type) 1 << n)) \
257 ecc ^= prefetch[7];
258 QUICKCPPLIB_SECDED_ROUND(0) // prefetch = min 8, bit test and xor = min 16, total = 24
266#undef QUICKCPPLIB_SECDED_ROUND // total should be 1+(8*24/3)=65
267 }
268 return ecc;
269 }
constexpr Combined c
Definition test_optional.cpp:1460
#define QUICKCPPLIB_SECDED_ROUND(n)

◆ operator()() [2/4]

template<size_t blocksize>
result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::operator() ( result_type  ecc,
const char *  buffer,
size_t  length 
) const
inlinenoexcept

Accumulate ECC from partial buffer where length <= blocksize.

272 {
273 const unit_type *_buffer = (const unit_type *) buffer;
274 // #pragma omp parallel for reduction(^:ecc)
275 for(size_t i = 0; i < length; i += sizeof(unit_type))
276 {
277 unit_type c = _buffer[i / sizeof(unit_type)]; // min 1 cycle
278 if(!c) // min 1 cycle
279 continue;
280 char bitset[bits_per_byte * sizeof(unit_type)];
281 result_type prefetch[bits_per_byte * sizeof(unit_type)];
282 // Most compilers will roll this out
283 for(size_t n = 0; n < bits_per_byte * sizeof(unit_type); n++) // min 16 cycles
284 {
285 bitset[n] = !!(c & ((unit_type) 1 << n));
286 prefetch[n] = ecc_table[i * bits_per_byte + n]; // min 8 cycles
287 }
288 result_type localecc = 0;
289 for(size_t n = 0; n < bits_per_byte * sizeof(unit_type); n++)
290 {
291 if(bitset[n]) // min 8 cycles
292 localecc ^= prefetch[n]; // min 8 cycles
293 }
294 ecc ^= localecc; // min 1 cycle. Total cycles = min 43 cycles/byte
295 }
296 return ecc;
297 }

◆ operator()() [3/4]

template<size_t blocksize>
result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::operator() ( const char *  buffer) const
inlinenoexcept
366{ return (*this)(0, buffer); }

◆ operator()() [4/4]

template<size_t blocksize>
result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::operator() ( const char *  buffer,
size_t  length 
) const
inlinenoexcept
367{ return (*this)(0, buffer, length); }

◆ find_bad_bit()

template<size_t blocksize>
result_type quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::find_bad_bit ( result_type  good_ecc,
result_type  bad_ecc 
) const
inlinenoexcept

Given the original ECC and the new ECC for a buffer, find the bad bit. Return (result_type)-1 if not found (e.g. ECC corrupt)

371 {
372 result_type length = blocksize * bits_per_byte, eccdiff = good_ecc ^ bad_ecc;
373 if(_is_single_bit_set(eccdiff))
374 return (result_type) -1;
375 for(result_type i = 0, b = 1; i < length; i++, b++)
376 {
377 // Skip parity bits
378 while(_is_single_bit_set(b))
379 b++;
380 if(b == eccdiff)
381 return i;
382 }
383 return (result_type) -1;
384 }

◆ verify()

template<size_t blocksize>
verify_status quickcpplib::_xxx::algorithm::secded_ecc::secded_ecc< blocksize >::verify ( char *  buffer,
result_type  good_ecc 
) const
inlinenoexcept

Verifies and heals when possible a buffer, returning non zero if the buffer is error free.

394 {
395 result_type this_ecc = (*this)(0, buffer);
396 if(this_ecc == good_ecc)
397 return verify_status::okay; // no errors
398 result_type badbit = find_bad_bit(good_ecc, this_ecc);
399 if((result_type) -1 == badbit)
400 return verify_status::corrupt; // parity corrupt?
401 buffer[badbit / bits_per_byte] ^= (unsigned char) ecc_twospowers[badbit % bits_per_byte];
402 this_ecc = (*this)(0, buffer);
403 if(this_ecc == good_ecc)
404 return healed; // error healed
405 // Put the bit back
406 buffer[badbit / bits_per_byte] ^= (unsigned char) ecc_twospowers[badbit % bits_per_byte];
407 return verify_status::corrupt; // more than one bit was corrupt
408 }
result_type find_bad_bit(result_type good_ecc, result_type bad_ecc) const noexcept
Definition secded_ecc.hpp:370

The documentation for this class was generated from the following file: