A company provides a service that helps users from around the world discover new restaurants. The
service has 50 million monthly active users. The company wants to implement a semantic search
solution across a database that contains 20 million restaurants and 200 million reviews. The company
currently stores the data in PostgreSQL.
The solution must support complex natural language queries and return results for at least 95% of
queries within 500 ms. The solution must maintain data freshness for restaurant details that update
hourly. The solution must also scale cost - effectively during peak usage periods.
Which solution will meet these requirements with the LEAST development effort?
Question No 2
A company is using Amazon Bedrock and Anthropic Claude 3 Haiku to develop an AI assistant. The AI
assistant normally processes 10,000 requests each hour but experiences surges of up to 30,000
requests each hour during peak usage periods. The AI assistant must respond within 2 seconds while
operating across multiple AWS Regions.
The company observes that during peak usage periods, the AI assistant experiences throughput
bottlenecks that cause increased latency and occasional request timeouts. The company must
resolve the performance issues.
Which solution will meet this requirement?
Question No 3
A company uses an AI assistant application to summarize the company’s website content and
provide information to customers. The company plans to use Amazon Bedrock to give the application
access to a foundation model (FM).
The company needs to deploy the AI assistant application to a development environment and a
production environment. The solution must integrate the environments with the FM. The company
wants to test the effectiveness of various FMs in each environment. The solution must provide
product owners with the ability to easily switch between FMs for testing purposes in each
environment.
Which solution will meet these requirements?
Question No 4
A company deploys multiple Amazon Bedrock – based generative AI (GenAI) applications across
multiple business units for customer service, content generation, and document analysis. Some
applications show unpredictable token consumption patterns. The company requires a
comprehensive observability solution that provides real - time visibility into token usage patterns
across multiple models. The observability solution must support custom dashboards for multiple
stakeholder groups and provide alerting capabilities for token consumption across all the foundation
models that the company’s applications use.
Which combination of solutions will meet these requirements with the LEAST operational overhead?
(Select TWO.)
Question No 5
An enterprise application uses an Amazon Bedrock foundation model (FM) to process and analyze 50
to 200 pages of technical documents. Users are experiencing inconsistent responses and receiving
truncated outputs when processing documents that exceed the FM's context window limits.
Which solution will resolve this problem?