Big Data introduction - Café Numérique Bruxelles

Data & Analytics

eric-rodriguez
of 36
Description
General introduction to Big Data terms and technologies: Velocity, Volume, Variety (3V) and Veracity (4V), NoSQL, Data Science, main data stores (key-value, column, document, graph), Elasticsearch, ...
Presentation of data.be products leveraging Big Data & Elasticsearch
Text
  • 1.Big Data Introduction
  • 2. About Me Eric Rodriguez Founder of data.be ! • Web entrepreneur • Data addict • Multi-Language: PHP, Java/ Groovy/Grails, .Net, … be.linkedin.com/in/erodriguez ! github.com/wavyx ! @wavyx
  • 3. Big? Data!
  • 4. Big Data is like teenage sex Everyone talks about it Nobody really knows how to do it Everyone thinks everyone else is doing it So everyone claims they are doing it… Quote: Dan Ariely
  • 5. 3V -Volume,Variety,Velocity
  • 6. Source: http://pennystocks.la/internet-in-real-time/
  • 7. Variety of Data
  • 8. • Health & Body sensors • Smart Home • Smart City • Industry applications • Environment
  • 9. Health & Body Sensors Source: http://postscapes.com/internet-of-things-examples/
  • 10. Smart Home Source: http://postscapes.com/internet-of-things-examples/
  • 11. Smart Cities Source: http://postscapes.com/internet-of-things-examples/
  • 12. Industry Source: http://postscapes.com/internet-of-things-examples/
  • 13. Environment Source: http://postscapes.com/internet-of-things-examples/
  • 14. The rise of Data Science
  • 15. NoSQL
  • 16. Big Data Landscape 2.0
  • 17. Keep Calm and Big Data
  • 18. Big DataTools
  • 19. Technologies Source:
  • 20. Not Only SQL Key-Value Column Document Graph
  • 21. real time, search and analytics engine open-source Lucene JSON schema free document
 store RESTful API documentation scalability high availability distributed multi tenancy per-operation
 persistence
  • 22. Elasticsearch core • Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java • Elasticsearch added value: “Simple is best” • Simple API (with documentation) • JSON & RESTful • Sharding & Replication • Extensibility: plugins and scripts • Interoperability: clients and integrations
  • 23. Use Cases • Full-Text Search • Data Store • Analytics • Alerts • Ads • …
  • 24. 4V -Veracity !
  • 25. 4V -Veracity !
  • 26. From Big Data toValue Wisdom! Knowledge! Information! Data!
  • 27. HOW TO FIND RELEVANT COMPANY INFORMATION ?
  • 28. BEFORE...
  • 29. WHY IS IT SO HARD TO FIND COMPREHENSIVE INFORMATION ?
  • 30. AFTER !
  • 31. COMPANY PAGE
  • 32. • VATValidity • Company Information • Geographic Search • EuropeanVAT Check API.DATA.BE
  • 33. PUBLICATION SEARCH
  • 34. Thank you! eric@data.be be.linkedin.com/in/erodriguez @wavyx
  • Comments
    Top