Tuesday, 05/29/2012
Wednesday, 05/30/2012
Chris Harris
Luca Garulli
Living in Rome, Luca became the CEO of NuvolaBase Ltd, the company behind the OrientDB – NoSQL Open Source project . Despite his young age, the 36 year old author of the Roma Meta Framework project quickly became renowned within the European NoSQL scene and also became member of the Sun/Oracle Expert Group for JSR 12 and 243. After travelling the European map and speaking at the big NoSQL events, Luca gave one of the Keynote speeches at NoSQL matters 2012!
NoSQL Adoption – What’s the Next Step?
Today, many companies are already using a NoSQL solution to handle a piece of data. What is the next step? Probably the adoption of NoSQL as a total replacement of Relational DBMS. To reach this Goal, NoSQL solutions have to support multiple models and features, that are already available in Relational DBMS. The challenge has started.
Doug Judd
Doug has worked as a software professional for twenty years, primarily focused on big data problems such as Internet search and scalable database technology. A NoSQL-expert, he is the CEO at Hypertable Inc., a company that provides commercial support for Hypertable, a high performance, massively scalable NoSQL database modeled after Google’s Bigtable. Starting out in the San Francisco Bay Area, Doug has already gained worldwide recognition as conference-speaker and an expert for building valuable, next generation, big data applications. Doug gave a Keynote speech at NoSQL matters 2012.
Scalable NoSQL – Past, Present and Future
Over the past five years, there has been an explosion of new NoSQL database technologies whose primary innovation has been the ability to break capacity barriers by harnessing the collective power and resource of large clusters of server class PCs. In this keynote address, Doug Judd, the original creator of Hypertable, will present a past, present, and future look at scalable NoSQL databases, providing insights into the forces that led to their creation, a review of the current state of the field, and factors that will influence their future evolution.
Pavlo Baron
Pavlo Baron is lead architect with codecentric AG. His passion are distributed systems and large data sets – the infrastructure behind what they call Big Data. Pavlo is frequent conference speaker and has written three German books: “Erlang/OTP”, “Pragmatic IT Architecture” and “Fragile Agile”. At NoSQL matters, he demonstrated some of the theoretical aspects of distributed systems – in a playful way, and hopes for some interesting discussions.
Theoretical Aspects of Distributed Systems, Playfully Illustrated
If you want to know about the principles and pitfalls of building distributed systems, you can either make your way through a whole pile of pretty prosaic theoretical books or you can just come to this talk and understand some of them real fast by watching a little playful demonstration. It‘s your choice.
Dirk Bartels
Dirk Bartels is responsible for Versant’s strategic product management, product marketing and the overall strategic vision of the company. Prior to Versant, Mr. Bartels was co-founder and CEO of Poet Software, one of the pioneers of the object database industry, and served as a Director at the Object Database Management Group (ODMG). Before joining Versant in 2010, Mr. Bartels worked as a strategy consultant for the Private Cloud software company Surgient (acquired by Quest) and a number of technology start up companies. Mr. Bartels received his Master’s of Computer Science degree from the Technische Universität Berlin.
NoSQL. A Technology for Real Time Enterprise Applications?
The session focuses on the evolving value proposition NoSQL-database types such as key-value, document, and graph/ object stores have and the influence of enterprise application architectures and requirements. After a brief introduction about the principles and the evolution of NoSQL, the presentation discusses critical requirements to make NoSQL work for enterprise application development in contrast to web applications. – Evaluating the technological foundation that led to the sucess of NoSQL technologies, – Considering the boundaries and trade-offs of NoSQL compared to typical enterprise requirements, and – Discussing additional capabilities for NoSQL stores for successful data management in real time enterprises. The session addresses project managers, software architects and database developers.
Stefan Edlich
Prof. Dr. Stefan Edlich is a senior lecturer at Beuth HS of Technology Berlin with a focus on Object Databases, NoSQL, Software-Engineering, Mobile Development and E-Learning. He sold his first commercial software in 1986 and has a 27 year development experience. Beside a many scientific papers and journal articles he is a continuous speaker on conferences, it-events concerning enterprise, OO, ODBMS topics since 1993 (like OOP or JAX). Furthermore he is the author of twelve IT books he wrote for Apress, OReilly, Spektrum, Elsevier, Hanser and other publishers. He is a founding member of OODBMS.org e.V.
In 2008 he started the the worlds First International Conference on Object Databases (ICOODB.org) which was continued 2009 at ETH-Zürich and 2010 in Frankfurt. Finally he runs the NoSQL Archiv, organizes NoSQL Events and wrote the worlds first two NoSQL books.
Talk 1: Convince your boss: choose the “right” database
Several hundred databases, a huge amount of persistence requirements and DB buzzwords dizzying around us. Welcome in the age of polyglot persistence. This talk will show ‘real’ database trade offs and well grounded NoSQL arguments. Therefore we present research for an expert system to recommend the best suitable database. We will show categories and questions to be asked in search for the truth. Furthermore we present typical experiences from NoSQL-Consulting in Germany where companies try to reinvent their persistence layer.
view the slides
Talk 2: NewSQL! NoSQL under attack
One of the biggest credits NoSQL deserves is to unveil the weaknesses of classical database systems. But it looks like the classic relational world has listened to these hints: Now they promise to provide scalability with full transactional support while retaining the familiar relational model with SQL access. This new group of systems is now referred to as “NewSQL” and leads to a fruitful competition between the two worlds. This talk summarizes the newest of NoSQL and shows systems that lead the NewSQL space.
view the slides
Peter Idestam-Almquist
Peter is the Chief Technology Officer of Starcounter. Peter has more than 20 years of experience in software development and management of IT companies. In 1996, he founded web technology consultancy company, SITE, and two years later, Peter founded consultancy broker company, Konsultmarknaden. He served as a Chief Technology Officer of both these companies before he joined Starcounter. Peter has a Ph.D. in computer science from Stockholm University, where he also taught as a Professor of Computer Science for ten years.
NewSQL Database for New Real-Time Applications
New real-time applications require databases that can process larger volumes of transactions than what is possible with current market-leading relational databases. And while more recent NoSQL databases are able to scale out so that they can handle extremely large volumes of data, they cannot guarantee consistency. While this is acceptable for some types of data such as social network bulletin boards, it does not work for most business critical data such as stock quantities or money. But what if there was a happy medium? A database that could process millions of transactions on a single machine, while maintaining the consistency that these business critical applications require? There is! In this presentation, Peter will describe a different kind of a NewSQL database, which will process millions of ACID-compliant transactions while scaling in on a single machine to fully ensure consistency. He will also delve into the technical differentiators between traditional relational databases, NoSQL and NewSQL databases and what applications they are best-suited for.
view the slides
Tim Lossen
Tim really enjoys talking about himself in the third person. Not. He lives in Berlin with his girlfriend and two little daughters. Tim is deeply in love with Ruby and recently helped to organize Euruko 2011. Tim works as backend engineer at social gaming startup Wooga. At night he likes to go down to the basement and hack on cool hardware projects like Superglobe or Evercube.
Into The Void – From MySQL to NoSQL to “Nothing”
Wooga creates casual social games, and we approach each new game as a greenfield development project. This gives us the chance to experiment with different approaches and to try out promising new technologies.
In this talk, I will explain how our backend architecture has evolved over time, and how our search for the „perfect backend stack“ has (so far) progressed through three distinct phases: from sharded relational database, over various NoSQL databases, to the question: “Do we really need a database, after all?”
“I will share what worked well, what didn‘t, what lessons we have learned on the way – and which other directions we would like to explore in the future.”
Daniel McGrath
Dan McGrath (@itcmcgrath) has worked in roles from Systems Architect at a financial institute to his current position as a Product Manager at Rocket Software where he works on their Database technology and associated tooling. Dan is a member of the Australian Computer Society and has been involved in IT for over a decade and has a fondness for the problems posed by scalability, availability and security.
Rocket U2 Databases & The MultiValue Model RDBMS & Hadoop
Automatic encryption of data at rest, RESTful web services, JSON support: The Rocket U2 databases (UniData & UniVerse) provide an enterprise NoSQL solution already in use everywhere from emergency systems to major financial institutions. Come learn about this proven, high-performance database technology running in thousands of organizations around the globe.
Martin Scholl
Martin Scholl managed the billing software that was responsible for a TelCo’s 400M EUR business. He now is a founder of infinipool GmbH where he develops NoSQL-class distributed data storage software. Martin Scholl has learned the ups and downs of huge database systems and database-driven software and believes in the disruptive force of data storage systems when they reflect application-level problems and requirements.
NoSQL: Back to the Future or is it Simply yet Another Database Feature?
NoSQL is about to reach the tip of the hype cycle. Therefore, it is time to take a break and thoroughly analyze the NoSQL trend.
This talk will trace NoSQL back to the early days of database approaches and will then reflect on NoSQL‘s current state and future: Is NoSQL as a technological approach to data storage just a feature or is it a unique category of its own? Is NoSQL a fundamentally new thing that will stay as a trend? What are current NoSQL systems lacking? Will there be a technology or trend that could disrupt NoSQL?
Jonathan Ellis
Jonathan is CTO and co-founder at DataStax as well as Project Chair of Apache Cassandra. Prior to his work on Cassandra, Jonathan built a multi-petabyte, scalable storage system based on Reed-Solomon encoding for backup provider Mozy.
Apache Cassandra: Real-World Scalability Today
The Cassandra distributed database has added many new features over the last year based on real-world needs of developers at Twitter, Netflix, Openwave, and others building massively scalable systems. This talk will cover the motivation and use cases behind features such as secondary indexes, Hadoop integration, SQL support, bulk loading, and more.
Chris Harris
Chris Harris is an European Solution Architect at 10gen. Prior to 10gen, Chris was EMEA Architect at SpringSource. Chris specializes in addressing and simplifying complex middleware architectures within development and operational environments. With the acquisition of SpringSource by VMware, Chris focused on how virtualization and cloud computing can be used to address the complexity within the Enterprise. Before joining SpringSource, Chris spent his time at RedHat/JBoss providing consultancy to major clients across EMEA.
Sharding with MongoDB
Doug Judd
Doug has worked as a software professional for twenty years, primarily focused on big data problems such as Internet search and scalable database technology. A NoSQL-expert, he is the CEO at Hypertable Inc., a company that provides commercial support for Hypertable, a high performance, massively scalable NoSQL database modeled after Google’s Bigtable. Starting out in the San Francisco Bay Area, Doug has already gained worldwide recognition as conference-speaker and an expert for building valuable, next generation, big data applications.
Hypertable – The Storage Infrastructure behind One of the World‘s Largest Email Services
Rediff.com India (Nasdaq: REDF) is one of India‘s top Internet portals, providing email, search, news, entertainment and shopping services to India and the global Indian community. With over 100 million registered users, Rediff.com is one of the largest Indian Internet portals and one of the top email providers worldwide. Rediffmail is Rediff.com‘s popular email service and has experienced steady growth ever since it was launched in 1998. Architected for high availability, Rediffmail is a geo-distributed application served out of three separate data centers. By 2011, the request load generated by the application had begun to overwhelm the underlying storage system, causing frequent outages. Metadata updates in connection with inbox management turned out to be the primary culprit, generating over 75 percent of the storage system request volume. In late 2011, the Rediffmail engineering team re-architected the system on top of Hypertable, solving the inbox management problem and eliminating the associated outages.
Jan Lehnardt
Jan Lehnardt has been involved with CouchDB since 2006 as a developer, consultant and evangelist. In late 2009 he co-founded CouchOne (now Couchbase) together with CouchDB inventor Damien Katz and contributor J Chris Anderson. He’s the co-author of O’Reilly’s CouchDB: The Definitive Guide. Jan is a co-organizer of the JSConf-family of conferences, most notably JSConf EU as well as the Buzzwords series of conferences focused on Big Data, Analysis, Search and Storage.
The No-Marketing Bullshit Introduction to Couchbase Server 2.0
Couchbase Server 2.0 is a love child of memcached and Apache CouchDB. This talk introduces the Couchbase architecture, and features it also demonstrates integration challenges as well as our solutions. Couchbase is an Apache 2.0 licensed open-source project, based on the existing memcached, Membase and CouchDB technologies, that aims to solve your data storage problems in a flexible, fast and reliable manner. In this talk, you will learn how it all works.
Couchbase provides clustered storage management, in-memory, disk-based and hybrid working sets, a high-performance key-value store with dynamic query capabilities as well as cross-cluster replication this allows you to mix and match what aspects of CAP your application needs. All which a familiar memcached-compatible APIs and no need for you to adjust your applications.
Mathias Meyer
Mathias (@roidrage) is an infrastructure mad man. The former Basho Developer Advocate is author of a comprehensive, hands-on guide to Riak, the NoSQL Handbook and books about Redis, MongoDB, CouchDB, Riak and Cassandra. Knowing a lot about Ruby on Rails developing, scaling and performance, the Berlin-based NoSQL specialist gave a talk about Riak at NoSQL matters 2012!
Designing for Concurrency with Riak
Riak provides a solid storage foundation to build highly concurrent applications. But its simplistic approach to resolving and detecting concurrent modifications to data leave quite a few people scratching their heads. We’ll look at how you can build applications and data structures to fully utilize Riak‘s strengths while still allowing for eventually consistent data.
Alexandre Morgaut
Alexandre is NantesJS creator, Web Developer, Web Architect, and Community Manager at 4D, he gives presentations about JavaScript, the Server-Side JavaScript, NoSQL, and Wakanda. Alexandre participates in Web Standards mailing lists, makes technical recommendations to anything related to HTML, JavaScript or HTTP, and works to promote JavaScript as a professional language.
Wakanda: NoSQL for Model-Driven Web Applications
Developing a business web application is still a long process in 2012. Model-Driven Development is at the heart of:
- Requirements design for the contractor and the product manager
- Productivity for the developer
- Consistency and security for the end-user
- Evolution toward future applications
The Wakanda platform – with its NoSQL object datastore – intends to let you create such model-driven applications. The presentation will explain and show how to create the application model, with its business and security rules, coded once, then made available everywhere without being “bypassable”. You will see enhancements to the maintenance and debug process, as everything is centralized in the model on the server-side (while maintaining potential pre-validation on the client to decrease the load on the server).
To add even more consistency, the same language is used everywhere: JavaScript.
Konstantin Osipov
Konstantin Osipov had been a software engineer and engineering lead at MySQL since 2003. His main areas in the server were locking, parsing, execution of prepared statements and stored procedures. He was the implementer of support for Dynamic SQL in 5.0. In 2010 Konstantin quit Oracle to join development of a new NoSQL solution: Tarantool. In his other life Konstantin is a Moscow dweller with a not uncommon combination of such hobbies as yoga, running and computers.
Tarntool/Box: Transactional NoSQL Database for Most Volatile Web Data
Tarantool/Box is an open-source, transactional database server. Its key property is high efficiency (hundreds of thousands of requests per second on a commodity server) combined with high level of customization, achievable with Lua stored procedures.
Tarantool utilizes lock-free cooperative multi-tasking environment and asynchronous I/O to achieve minimal overhead per request. Each request, which can be either a simple GET/PUT, classical for NoSQL systems, or a custom data access script in Lua, is run in a multi-version transactional environment.
The key advantage of Tarantool/Box, compared to existing open-source NoSQL solutions, is the ability to execute parts of application business logic automatically on the server, with sufficiently high performance. Sufficiently means that you may never need to shard.
Tarantool is written in C and Objective C. The server code and Drivers are available under terms of simplified BSD license.
In the talk I’ll demonstrate key features of the server using examples in Lua, Ruby and Erlang.
Salvatore Sanfilippo
Salvatore Sanfilippo is the creator and lead developer of the Redis database. Salvatore is also the author of hping, a widely-used network security tool and the inventor of the Idle Scan, one of the Nmap port scanning methods. He wrote the Jim Interpreter, a BSD licensed small footprint Tcl interpreter for embedded systems currently used by different companies. Salvatore’s development efforts are currently supported by VMware. Salvatore is a native of Sicily, where he lives and works.
Welcome to Redis 2.6
Salvatore Sanfilippo will present the new Redis 2.6 released for the first time. The new Lua scripting feature and the other major changes in this release will be explained showing use cases and comparing the solutions with the ones available in Redis 2.4.
Martin Schönert
The Cologne developer Martin Schönert first heard about NoSQL at the age of… well, we can’t tell exactly. But it seems, that he soak up the whole thing with the mother’s milk. Being in business for over 30 years, Martin has worked as project director, developer, product manager, technical manager, in enterprises from Cologne to the US, from startups over IBM to Deutsche Post AG.
ArangoDB
This talk will give an introduction to ArangoDB. ArangoDB is a new open source NoSQL database, whose goal it is to be a powerful universal database. It allows flexible data modelling: as key-value pairs, documents or graph data. It allows complex queries through an elegant query language. An embedded JavaScript interpreter allows you to add functionality to the database for transactions, complex manipulations of the objects, treating AvocadoDB as a complete application server – database stack. ArangoDB guarantees durability through MVCC with append-only journals and zero-administration availability through synchronous replication. It utilizes main memory and SSDs to deliver maximal performance. Even though ArangoDB is still a young project, it has already attracted international attention, especially in Japan and the USA. The community is currently working on APIs for Python, D, Ruby, Java and node.js.
Brian Clark
Brian Clark, Vice President, Product Management, has nearly 40 years of software and technology experience, and was one of the early architects of Objectivity/DB. Before joining Objectivity, Brian worked at Automation Technology Products, providing leading tools in the MCAD market. Prior to that, he was with Project Management Services at International Computers Limited, one of Europe’s leading computer companies at the time. Brian holds a B.S. degree in Computer Science from Sheffield University, England.
Maximize your Data with Realtime Big Data Analytics Using NoSQL and Graph Technologies
Join Brian Clark, the VP of Product Management at Objectivity, in a discussion of the latest trends in Big Data Analytics, defining Big Data and understanding how to maximize your existing architectures by utilizing NoSQL technologies to improve functionality and provide real-time results. There will be a focus on relationship analytics as well as an introduction to NoSQL data stores, object and graph databases, such as the architecture behind Objectivity/DB and InfiniteGraph.
Luca Garulli
Living in Rome, Luca became the CEO of NuvolaBase Ltd, the company behind the OrientDB – NoSQL Open Source project . Despite his young age, the 36 year old author of the Roma Meta Framework project quickly became renowned within the European NoSQL scene and also became member of the Sun/Oracle Expert Group for JSR 12 and 243.
Design your Application Using Persistent Graphs and OrientDB
This talk presents the OrientDB NoSQL Open Source project and its document-graph capabilities. NoSQL products promise big performance and scalability at the cost of many compromises like transactions, an easy query language and constraints. OrientDB offers a flexible model where it can be used in different ways depending on the use case.
This presentation deals with the OrientDB features and some different use cases where it can be applied.
Peter Neubauer
Peter is co-founder of a number of popular Open Source projects such as Neo4j, Tinkerpop, OPS4J and Qi4j. Peter loves connecting things, writing novel prototypes and throwing together new ideas and projects around graphs and society-scale innovation. Right now, Peter is concentrating on turning Open Source projects into profitable enterprises at Neo Technology, the company sponsoring the development of Neo4j, the Graph Database. If you want brainstorming – feed him a latte and you are in business.
Neo4J, Gremlin, Cypher: Graph Processing for Everybody
With property graph databases and NoSQL coming into fashion over the last years, the interest in graph algorithms, graph processing and recommender systems has increased considerably.
In this talk, Peter Neubauer is outlining the differences in approach between imperative scripting of graph traversals with Gremlin and nested iterators over a data structure, and declarative techniques like Cypher, a pattern matching language over Neo4j.
Peter will show basic recommendation algos, spreading activation and min/max flow calculations on live demo graphs. Be aware that things might break, but fun and pun is guaranteed.
Pere Urbon-Bayes
Pere Urbon-Bayes is a Senior Software Engineer with more than 10 years of experience, specialized on delivering graph processing technologies in the area of smart energy data, knowledge management and text analysis at Belectric IT Solutions (ehemals Aperis Gmbh). With special interest on backend development and to provide special solutions to provide meaning to the huge amount of data we’ve right now. When he is not laying around a computer he loves to enjoy the live together with his lovely wife, family and friends.
From Tables to Graph. Recommendation Systems, a Graph Database Use Case Analysis
Recommendation engines have changed a lot during the last years and the last big change is NoSQL, especially Graph Data- bases. With this presentation we intend to show how to build a Graph Processing technology, based on our experience in doing that for environments like Digital Libraries and Movies and Digital Media. First, we will introduce the state of the art on context aware Recommendation Engines, with special interest on how peo- ple are using Graph Processing, NoSQL, systems to scale this kind of solutions. After an introduction to the ecosystem, the next step is to have something to work with. So we will show the audience how to build a Recommendation Engine with a few steps.
The demonstration part will be made using the next technology stack: Sinatra as a simple web framework. Ruby as a programming language. OrientDB, Neo4j, Redis, etc. as a NoSQL technology stack. The result of our demonstration will be a simple engine, accessible through a REST API, to play and extend, so that atten- dants can learn by doing.
In the end our audience will have a full in- troduction to the field of Recommendati- on Engines, with special interest on Graph Processing, NoSQL, systems. Based on our experience making this technology for large scale architectures, we think the best way to learn this is by doing it and having an example to play with.
view the slides
Sebastian Cohnen & Timo Derstappen
Sebastian Cohnen is a freelance ruby & Node.js developer based in Cologne with special interests in NoSQL databases (CouchDB, Riak & Redis in particular). Since late 2010 he is working for Adcloud helping them to further improve and scale the Node.js-based Adserver.
Timo Derstappen is responsible for the technical concept and development of the Node.js based Adserver at Adcloud. In the last years he spent most of the time ramping up a distributed system from a few hunderd to thousands of requests per second.
NoSQL – Not Only a Fairy Tale
Whether IT is CouchDB, Redis, Riak or MongoDB we have investigated, implemented, tested and run several NoSQL databases at Adcloud. Although we are happy with most of our Decisions, we have learned our lessons, which we would like to share. We try to choose the right database for the right job. But sometimes it is hard to define the job.
Chris Harris
Chris Harris is an European Solution Architect at 10gen. Prior to 10gen, Chris was EMEA Architect at SpringSource. Chris specializes in addressing and simplifying complex middleware architectures within development and operational environments. With the acquisition of SpringSource by VMware, Chris focused on how virtualization and cloud computing can be used to address the complexity within the Enterprise. Before joining SpringSource, Chris spent his time at RedHat/JBoss providing consultancy to major clients across EMEA.
Building Hybrid Applications with MongoDB, RDBMS & Hadoop
Business moves fast, very fast. It’s driven by making decision based on facts, but how do you deal with facts that are hours or days old? Periodic data warehouse loads and ETL process no longer scale to meet the increasing business needs in the web economy. Today, modern organizations are combining high performance operational data stores like MongoDB with traditional business intelligence RDBMS solutions to create a hybrid than solves the business need to have fast accurate data for decision making with the ability to interpret and analyze data in every way.
This talk will try to answer the questions
- What are the business benefits using this approach?
- Which problem can now be solved using this approach?
- How can this solution meet the Needs of Business?
- Customer use cases
Axel Morgner
Axel is founder of structr—a free, open-source CMS and Content Management Framework based on the graph database Neo4j. He previously founded inxire GmbH, and served as a consultant and project manager at Oracle. After leaving the enterprise SQL world behind in 2010, he enjoys now being back to cutting egde technology every day. He’s based in Frankfurt/Main, Germany.
Structr – A CMS Implementation based on a Graph Database
This session offers a close look at the implementation and benefits of utilizing a graph database, namely Neo4j, to serve as a CMS back-end. As we will see, the classic CMS ORM approach has several drawbacks, such as the inherent complexity of mapping CMS content to tables, or the inflexibility of relational databases for semi-structured CMS data. Graph databases address these challenges by directly storing CMS data as a graph structure in a natural and trivial way, and allowing to alter the CMS schema at runtime. Along with a live Demonstration the talk will provide a short introduction of general CMS use-cases and requirements, and will cover the specific strategy and data model used to build open-source CMS structr, which can be applied to other cases as well.
Olaf Bachmann
Olaf Bachmann manages Google’s European ads abuse engineering efforts. Besides fighting against ads spam his teams are not only one of the largest users of NoSQL data analysis tools at Google, but also make significant contributions to their further development. Olaf holds a PhD in Computer Science from Kent State University (U.S.A.), lives near Cologne and works out of Zuerich.
NoNoSQL@Google
Relational Database Management Systems were hardly ever used at Google to store logs data. But SQL as a data exploration language has been becoming increasingly popular over the last years. In fact, nowadays the fast majority of data analysis jobs at Google are expressed by SQL queries. In this talk I give a brief history of data storage and exploration techniques deployed at Google and describe the current state of the art. I furthermore explore why SQL became so popular and illustrate my main conclusion: SQL‘s simplicity and conciseness rules over all other (currently available) alternatives.
Ofer Bengal & Yiftach Shoolman
Ofer is a serial entrepreneur who has founded and led companies in the areas of data communications, telecommunications, Internet, homeland security and medical devices. Ofer was founder & CEO of RIT Technologies (NASDAQ: RITT), a provider of sophisticated telecommunications and data communications systems to major world carriers. He began his career as an aerospace engineer in the Israeli Air Force and then founded and ran a consulting firm in the aerospace engineering field and was involved, as a hobby, in inventing, developing and licensing toy concepts. Ofer holds a B.Sc. (Cum Laude) in Aerospace Engineering from the Technion, Israel Institute of Technology.
Yiftach is an experienced technologist, having played leadership engineering and product roles in diverse fields including application acceleration, cloud computing, Software-as-a-Service (SaaS), Broadband Networks and Metro Networks. Yiftach was founder, President & CTO of Crescendo Networks (acquired by F5, NASDAQ:FFIV), VP Software Development at Native Networks (acquired by Alcatel, NASDAQ: ALU) and part of the founding team of ECI Telecom broadband division, where he served as VP Software Engineering. Yiftach holds a B.Sc. in Mathematics and Computer Science and has completed studies for a Masters Degree in Computer Science at Tel-Aviv University.
Taking In-Memory NoSQL to the Next Level
Using Redis and Memcached in their simple form may not be enough to cope with today’s scaling and high-availability challenges. We will describe the state of Memcached and Redis and suggest new, highly-efficient ways of operating them in the cloud. We will present the new concept of an Infinite Memory Pool for Memcached and Redis and show, how simple and hassle-free scaling and failure recovery can be.
Matt Casters
Matt is the l chief architect and lead developer of the popular open source data integration project called Kettle or Pentaho Data Integration. For the last 10 years he has been very busy writing this ETL tool which was open sourced in December 2005 and acquired by Pentaho (Open Source Business Intelligence) in 2006.
Crazy NoSQL Data Integration with Pentaho
With the arrival of a new armada of NoSQL databases chances are increasing constantly that you will have to integrate data from one of them. So in this talk I‘ll be going over crazy data integration between various relational and NoSQL databases.
- Parallel extraction and load of MySQL data into Hadoop
- Real-time update of PostgreSQL tables based on changes in CouchDB
- Extracting JSON information from MongoDB to load it into MySQL
- Populating elasticsearch results with product information
Obviously, I‘ll be giving a short introduction Pentaho Data Integration (Kettle), the graphical programming and design involved. I‘ll also be giving a short overview of the NoSQL landscape so that you can get an idea of what‘s going on in that space.
Vincent Delfosse
Vincent Delfosse is an IT Engineer since 1994. He has held various positions in the industry, in international IT companies or as a freelance, mainly as a system design architect. Since 2005, he is a researcher at the University of Liege in Belgium. He works at the LUCID-ULg and at the Geomatic-ULg. He focuses on architecture and spatial data, with a special interest on the BIM (Building Information Model).
UML as a Scheme Candidate for Graph Databases
During the lifetime of a building, a huge quantity of information will be captured by multiple actors: owners, managers, users, architects, electricians, plumbers, etc. As most of this information is not shared between these persons, the same analysis, verifications or measurements have to be done multiple times. SpatioData is a research project aiming at the development of a collaborative platform to support the effective sharing of such diverse data.
In order to reach this ambitious goal, it is important to provide the users with devices suiting their needs (mobile or station) and interfaces adapted to their specific activities. These various client applications will communicate to a centralized database through WebServices exhibiting a common data model.
The data model of such an application has to address many problems. Amongst those are:
- Complexity of the data model to represent the different aspects of the buildings, their actors, and the activities of the actors.
- Communicability, to ensure that developers willing to build client applications will understand and use the provided model in the same way.
- Extensibility, to provide client-side developers with a mechanism to add their own data schemes in the database.
- Flexibility, required in a research environment, where multiple questions are being co-resolved in parallel.
UML has been chosen for the representation of our data model, as it is a formalized and well-known format in the developers community. But instead of fully developing the data model, then implementing it and providing dedicated services to it, an original approach has been adopted, in the form of UML-oriented WebServices independent of any specific model. This solution has been built on top of a graph database (Neo4J) in a simple but powerful way, providing answers to the given problems above.
This communication will detail the complete system architecture and the process designed on top of it, to make sure thirdparty developers can take full advantage of this platform.
view the slidesview the article
Oliver Gierke
Oliver Gierke is engineer at SpringSource, a division of VMware, project lead of the Spring Data JPA, MongoDB and core module. He has been into developing enterprise applications and open source projects for over 6 years now. His working focus is centered around software architecture, Spring and persistence technologies. He is regularly speaking at German and international conferences as well as author of technology articles.
Data Access 2.0? Please Welcome: Spring Data!
Spring always provided sophisticated support for various Java data access technologies. The lately invented Spring Data project now takes the next step and introduces a consistent programming model for non-relational data stores and helps implementing data access layers in a consistent and easy-to grasp fashion – for both the NoSQL stores as well as more traditional APIs like JPA. The talk introduces the umbrella project, foundational concepts and abstractions and dives down into specialties of particular modules using MongoDB and Neo4J as examples.
Sylvain Lebresne
Sylvain is a committer and PMC member on the open-source Apache Cassandra distributed database working at Datastax. He used to work at Yakaz, a real-time classified ads web service, where he developed the storage infrastructure built on Cassandra. He holds a Ph.D in Computer Science from the University of Paris 7.
The Apache Cassandra Storage Engine
Apache Cassandra is a distributed database built to handle massive amounts of data on large clusters of commodity servers. This talk will present the storage engine at the core of Cassandra, motivating the use of a structure similar to a Log-Structured Merge Tree rather than of a usual B-Tree and its implications for the data model. We will also introduce most of the current features of that engine (secondary indexes, integrated caching, TTL etc.) including recent developments introduced in Cassandra 1.0 like compression/checksumming and the new leveled compaction.
view the slides
Bernd Ocklin
Bernd Ocklin joined the MySQL organization, MySQL Cluster, in September 2005. He first worked as a solution architect for highly available distributed databases and cluster solutions. Now he runs the MySQL Cluster development at Oracle. Bernd has designed and programmed data mining software in addition to writing a database core prior to using MySQL databases. which has provent to be useful in big scale data processing.
MySQL Cluster: Scaling to Billion Database Queries per Minute
MySQL Cluster is the high availability, low latency storage engine with auto-sharding and real-time capabilities, used worldwide to ensure that hundreds of millions of mobile phone users are always reachable.
This mature and proven technology is gaining rapid adoption in the web due to its simple scaling model, performance/and integrated HA that are needed to ensure that web services can evolve rapidly. MySQL Cluster has many attributes that make it ideal for new generations of dynamic, highly scalable applications, including:
- Auto-sharding across commodity hardware for write-scalability
- Cross-data center geographic synchronous and asynchronous replication with eventual consistency
- Online scaling and schema evolution
- SQL and NoSQL interfaces
This session looks at the existing NoSQL access methods for MySQL as well as the latest developments for the MySQL Cluster storage engine. You can get the best of both worlds – persistence, consistency, rich queries, high availability, scalability and simple, flexible APIs and schemas for agile development.
Learn how cluster as a globally distributed database engine is used in mobile phone networks, how web services profit from its scaleable fully event driven architecture.