Book Image

Amazon SimpleDB Developer Guide

Book Image

Amazon SimpleDB Developer Guide

Overview of this book

SimpleDB is a highly scalable, simple-to-use, and inexpensive database in the cloud from Amazon Web Services. But in order to use SimpleDB, you really have to change your mindset. This isn't a traditional relational database; in fact it's not relational at all. For developers who have experience working with relational databases, this may lead to misconceptions as to how SimpleDB works.This practical book aims to address your preconceptions on how SimpleDB will work for you. You will be quickly led through the differences between relational databases and SimpleDB, and the implications of using SimpleDB. Throughout this book, there is an emphasis on demonstrating key concepts with practical examples for Java, PHP, and Python developers.You will be introduced to this massively scalable schema-less key-value data store: what it is, how it works, and why it is such a game-changer. You will then explore the basic functionality offered by SimpleDB including querying, code samples, and a lot more. This book will help you deploy services outside the Amazon cloud and access them from any web host.You will see how SimpleDB gives you the freedom to focus on application development. As you work through this book you will be able to optimize the performance of your applications using parallel operations, caching with memcache, asynchronous operations, and more.
Table of Contents (16 chapters)
Amazon SimpleDB Developer Guide
Credits
Foreword
About the Authors
About the Reviewers
Preface

Serialized SimpleDB requests


In the previous section, we used a batch operation for updating multiple items in a single call. This will work just fine as long as you only need to update up to 25 items at a time. What if you need to update 1000 items, or even 100,000 items? You can certainly use the simple batch operation, but making the requests one after the other serially will seriously degrade your performance and slow your application down. Here is a simple Python script that updates items by making three different calls to SimpleDB, but in a serial fashion, that is one call after another. You can run this script with the time command on Unix/Linux/Mac OS X to get the execution time. This will give us a baseline to look at when we convert this same script into using parallel operations.

Running this through time on my laptop shows the following output:

Updated items for domain 'Domain:songs' to : {'112222222': {'Year': '2010'}, '6767969119': {'Genre': ''}}
Updated items for domain 'Domain...