The Data Authenticity Protocol

We need a data authenticity protocol, stat!

On my journey to become more fluent with statistics and data analysis, I have been reading Sir David John Spiegelhalter’s book ‘The Art of Statistics.’ One of the take-homes on reading this book has been that the source of data is sometimes the greatest challenge to prove correct or authentic in origin.

It seems crazy to me that in a tech world where we have blockchain ledgers, PKI, Kerberos, etc. — we still do not have a way of proving raw dataset authenticity.

I hope to tackle this by identifying the problem and providing a straw-man solution. This post will not lead to a fully-fledged IETF RFC (yet) but will pave the way for a more formal proposal at a later date. ETA 2021