Secondary indexes for Dynamodb flexibility

Question

Coming from a SQL background, trying to undestand NoSQL particularly DynamoDB options. Given this schema:

{
    "publist": [{
            "Author": "John Scalzi",
            "Title": "Old Man's War",
            "Publisher": "Tor Books",
            "Tags": [
                "DeepSpace",
                "SciFi"
            ]
        },
        {
            "Author": "Ursula Le Guin",
            "Title": "Wizard of Earthsea",
            "Publisher": "Mifflin Harcourt",
            "Tags": [
                "MustRead",
                "Fantasy"
            ]
        },
        {
            "Author": "Cory Doctorow",
            "Title": "Little Brother",
            "Publisher": "Doherty"
        }
    ]
}

I could have the main table have Author/Title as hash/range keys. A global secondary index could be Publisher/Title. What are the best practices here. How can I get a list of all Authors for a publisher without a total table scan? Cant have a secondary index because Publisher/Author is not unique! Also what are my options if I want all the titles that have a tag of DeepSpace?

EDIT: See RPM & Vikdor answers below. GSI need not be unique, so Publisher/Author is possible. But question remains: is there any workaround for getting all authors by tag, without full table scan?

rpmartz rpmartz · Accepted Answer · 2018-04-02T20:00:03

Cant have a secondary index because Publisher/Author is not unique!

Sure you can, just make sure your Publisher/Title index has Author as a projection - you can then do a query by publisher and just iterate over the results and collect the authors.

When you set up your indexes, you can choose which attributes are projected into the index. Having a Publisher or Publisher/Title key doesn't mean you can only view the Publisher or Publisher and Title, it means you can only query by Publisher or Title, so if you have all attributes or the Author attribute projected into your index, you can get a list of authors by publisher using a query and not a full table scan.

Secondary indexes for Dynamodb flexibility

3 Answers