使用DynamoDBMapper扫描DynamoDB项目

最新推荐文章于 2024-05-06 21:22:54 发布

dnc8371

最新推荐文章于 2024-05-06 21:22:54 发布

阅读量274

点赞数

文章标签： java python mysql 数据库大数据

之前，我们介绍了如何使用DynamoDBMapper或底层Java api查询DynamoDB数据库。

除了发出查询之外，DynamoDB还提供扫描功能。
扫描的目的是获取您在DynamoDB表上可能拥有的所有项目。
因此，扫描不需要任何基于我们的分区键或您的全局/本地二级索引的规则。扫描提供的功能是基于已获取的项目进行过滤，并从已获取的项目中返回特定属性。

下面的代码段通过过滤具有较低日期的项目来对“登录名”表进行扫描。

public List<Login> scanLogins(Long date) {

        Map<String, String> attributeNames = new HashMap<String, String>();
        attributeNames.put("#timestamp", "timestamp");

        Map<String, AttributeValue> attributeValues = new HashMap<String, AttributeValue>();
        attributeValues.put(":from", new AttributeValue().withN(date.toString()));

        DynamoDBScanExpression dynamoDBScanExpression = new DynamoDBScanExpression()
                .withFilterExpression("#timestamp < :from")
                .withExpressionAttributeNames(attributeNames)
                .withExpressionAttributeValues(attributeValues);

        List<Login> logins = dynamoDBMapper.scan(Login.class, dynamoDBScanExpression);

        return logins;
    }

DynamoDBMapper的另一个重要功能是并行扫描。并行扫描将扫描任务划分为多个工作程序，每个逻辑段一个。工作人员并行处理数据并返回结果。
通常，扫描请求的性能在很大程度上取决于DynamoDB表中存储的项目数。因此，并行扫描可能会解除扫描请求的某些性能问题，因为您必须处理大量数据。

public List<Login> scanLogins(Long date,Integer workers) {

        Map<String, String> attributeNames = new HashMap<String, String>();
        attributeNames.put("#timestamp", "timestamp");

        Map<String, AttributeValue> attributeValues = new HashMap<String, AttributeValue>();
        attributeValues.put(":from", new AttributeValue().withN(date.toString()));

        DynamoDBScanExpression dynamoDBScanExpression = new DynamoDBScanExpression()
                .withFilterExpression("#timestamp < :from")
                .withExpressionAttributeNames(attributeNames)
                .withExpressionAttributeValues(attributeValues);

        List<Login> logins = dynamoDBMapper.parallelScan(Login.class, dynamoDBScanExpression,workers);

        return logins;
    }

在对我们的应用程序使用扫描之前，我们必须考虑到扫描会获取所有表项。因此，它在费用和性能上都有很高的成本。此外，它可能会消耗您的配置容量。
通常，最好坚持查询并避免扫描。

您可以在github上找到带有单元测试的完整源代码。

翻译自: https://www.javacodegeeks.com/2016/10/scan-dynamodb-items-dynamodbmapper.html

dnc8371

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
使用DynamoDBMapper扫描DynamoDB项目

之前，我们介绍了如何使用DynamoDBMapper或底层Java api查询DynamoDB数据库。除了发出查询之外，DynamoDB还提供扫描功能。扫描的目的是获取您在DynamoDB表上可能拥有的所有项目。因此，扫描不需要任何基于我们的分区键或您的全局/本地二级索引的规则。扫描提供的功能是基于已获取的项目进行过滤，并从已获取的项目中返回特定属性。下面的代码段通过...
复制链接

扫一扫