Skip to main content

TSXor: A Simple Time Series Compression Algorithm

  • Conference paper
  • First Online:
String Processing and Information Retrieval (SPIRE 2021)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12944))

Included in the following conference series:

Abstract

Time series are ubiquitous in computing as a key ingredient of many machine learning analytics, ranging from classification to forecasting. Typically, the training of such machine learning algorithms on time series requires to access the data in temporal order for several times. Therefore, a compression algorithm providing good compression ratios and fast decompression speed is desirable. In this paper, we present TSXor, a simple yet effective lossless compressor for time series. The main idea is to exploit the redundancy/similarity between close-in-time values through a window that acts as a cache, as to improve the compression ratio and decompression speed. We show that TSXor achieves up to \(3{\times }\) better compression and up to \(2{\times }\) faster decompression than the state of the art on real-world datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Vashi, S., Ram, J., Modi, J., Verma, S., Prakash, C.: Internet of Things (IoT): a vision, architectural elements, and security issues. pp. 492–496, February 2017. https://doi.org/10.1109/I-SMAC.2017.8058399

  2. Pelkonen, T., et al.: Gorilla: a fast, scalable, in-memory time series database. Proc. VLDB Endow. 8(12), 1816–1827 (2015). https://doi.org/10.14778/2824032.2824078. ISSN 2150-8097

    Article  Google Scholar 

  3. Burtscher, M., Ratanaworabhan, P.: FPC: a high-speed compressor for double-precision floating-point data. IEEE Trans. Comput. 58(1), 18–31 (2009). https://doi.org/10.1109/TC.2008.131. ISSN 0018-9340

    Article  MathSciNet  MATH  Google Scholar 

  4. Hamilton, J.D.: Time Series Analysis. Princeton University Press, Princeton (2020)

    Book  Google Scholar 

  5. Sazeides, Y., Smith, J.E.: The predictability of data values. In: Proceedings of 30th Annual International Symposium on Microarchitecture, pp. 248–258, December 1997. https://doi.org/10.1109/MICRO.1997.645815. ISSN 1072-4451

  6. Goeman, B., Vandierendonck, H., de Bosschere, K.: Differential FCM: increasing value prediction accuracy by improving table usage efficiency. In Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture, pp. 207–216, January 2001. https://doi.org/10.1109/HPCA.2001.903264

  7. Makonin, S., Ellert, B., Bajić, I.V., Popowich, F.: Electricity, water, and natural gas consumption of a residential house in Canada from 2012 to 2014. Sci. Data 3(1), 1–12 (2016)

    Article  Google Scholar 

  8. Fonollosa, J., Sheik, S., Huerta, R., Marco, S.: Reservoir computing compensates slow response of chemosensor arrays exposed to fast varying gas concentrations in continuous monitoring. Sens. Actuators B Chem. 215, 618–629 (2015)

    Article  Google Scholar 

  9. Max Planck Institute for Biogeochemistry. Max-Planck-Institut fuer Biogeochemie - Wetterdaten (2019). https://www.bgc-jena.mpg.de/wetter/

  10. Fothergill, S., Mentis, H., Kohli, P., Nowozin, S.: Instructing people for training gestural interactive systems. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 2012, New York, NY, USA, pp. 1737–1746 (2012). Association for Computing Machinery. https://doi.org/10.1145/2207676.2208303. ISBN 9781450310154

  11. Heber, G., Lunde, A., Shephard, N., Sheppard, K.: Oxford-Man Institute’s Realized Library (2009)

    Google Scholar 

  12. Reiss, A., Stricker, D.: Introducing a new benchmarked dataset for activity monitoring. In: 2012 16th International Symposium on Wearable Computers, pp. 108–109. IEEE (2012)

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by the projects: MobiDataLab (EU H2020 RIA, grant agreement N\(^{\underline{\mathrm{o}}}\)101006879), OK-INSAID (MIUR-PON 2018, grant agreement N\(^{\underline{\mathrm{o}}}\)ARS01_00917), and “Algorithms, Data Structures and Combinatorics for Machine Learning” (MIUR-PRIN 2017).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giulio Ermanno Pibiri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bruno, A., Nardini, F.M., Pibiri, G.E., Trani, R., Venturini, R. (2021). TSXor: A Simple Time Series Compression Algorithm. In: Lecroq, T., Touzet, H. (eds) String Processing and Information Retrieval. SPIRE 2021. Lecture Notes in Computer Science(), vol 12944. Springer, Cham. https://doi.org/10.1007/978-3-030-86692-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-86692-1_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-86691-4

  • Online ISBN: 978-3-030-86692-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics