AIF-C01 study cheatsheet

One-line cues for terms and in-scope AWS services. Unofficial; verify against the current exam guide.

Not from AWS. Use alongside aif-c01-in-scope-services.html and your domain decks. Lists and product names change over time.

Core terms

AI / ML / GenAI
Artificial intelligence (AI)Systems that perform tasks requiring human-like reasoning, perception, or decision support.
Machine learning (ML)Learns patterns from data to predict or decide without explicit step-by-step programming for every case.
Deep learningML using layered neural nets—strong for images, text, speech at scale when data/compute allow.
Supervised learningTraining on labeled input→output pairs (classification, regression).
Unsupervised learningFind structure without labels (clustering, anomaly patterns, dimensionality reduction).
Reinforcement learningAgent learns via rewards/penalties from environment interactions over time.
TrainingFitting model parameters on historical data; typically batch, compute-heavy, offline.
InferenceRunning a trained model on new inputs for predictions or generated output.
Fine-tuningAdapting a pretrained model to a narrower domain/task with extra labeled or instruction data.
Foundation model (FM)Large pretrained model (often transformers) generalizing across many downstream tasks via prompting or adaptation.
Generative AIModels that synthesize new content (text, code, images) vs only classifying or scoring inputs.
TokenChunk of text a model processes/bills; context length = max tokens the model can consider at once.
EmbeddingDense vector representing text/image meaning for similarity search and retrieval.
HallucinationConfident but false or unsupported model output; mitigated by grounding, guardrails, and evaluation.
PromptInstruction + context given to an FM; prompt engineering shapes quality without weight updates.
TemperatureSampling randomness—lower = more deterministic; higher = more creative/varied.
RAGRetrieve relevant documents/chunks, then condition the FM—reduces hallucination and adds freshness.
Vector store / k-NNStores embeddings and returns nearest neighbors for semantic search in RAG pipelines.
AgentFM + tools (API, search, code) that plans steps or calls actions, not single-shot completion.
MLOpsOperational practices for reproducible training, deployment, monitoring, and rollback of ML.
DriftData or concept change over time degrading model quality; needs monitoring and retraining.
Bias (ML)Systematic skew in predictions across groups—address with data, evaluation, and constraints.
ExplainabilityEvidence of why a model produced an output (SHAP, attention, counterfactuals, simplified models).
Metrics (recognition)
AccuracyFraction of correct predictions across all classes.
PrecisionOf positive predictions, how many were true positives—care about false alarms in the positive class.
RecallOf actual positives, how many were found—care about missing positives.
F1Harmonic balance of precision and recall when you need one scalar score.
Confusion matrixCounts of TP/FP/TN/FN—basis for precision/recall and class imbalance analysis.

Architecture patterns (exam stems)

“Which AWS …?” shortcuts
Managed FM APIBedrock (models, Knowledge Bases, Guardrails, Agents).
Custom train + deploy lifecycleSageMaker AI; quick starts: JumpStart.
Enterprise doc Q&A with connectorsKendra vs self-built RAG on OpenSearch/pgvector—match stem to “managed search product” vs DIY vectors.
Vector similarity at scaleOpenSearch k-NN; Aurora/RDS Postgres extensions; Neptune for graph+vector cases.
Speech in / text outTranscribe.
Text in / audio outPolly.
OCR / forms from PDFTextract.
Image labels / moderationRekognition.
Chatbot with intentsLex vs open-ended FM chat—Lex for structured dialog.
RecommendationsPersonalize.
Human in the loopA2I for review when confidence low or policy requires humans.
Datasets & artifactsS3; long-term retention: Glacier.

Responsible AI (pillars → tools)

Fairness, explainability, privacy, robustness
FairnessMeasure disparate impact; Clarify (bias metrics), balanced data, Guardrails content/topic filters.
ExplainabilityClarify explainability; SHAP-style insights; human-readable rationales where feasible.
PrivacyMinimize PII in prompts/training; Macie on S3; KMS encryption; avoid logging secrets.
Robustness / safetyGuardrails (Bedrock), content filters, red-teaming, evaluation harnesses.
TransparencyDocument data lineage, model cards, human review (A2I) for high-stakes decisions.

Security & governance (shared + AWS)

Controls that appear in scenario questions
Least privilegeIAM roles scoped to invoke model, read S3 prefix, use KMS key.
Audit who did whatCloudTrail API history.
Encryption keysKMS CMKs for data at rest; TLS in transit.
Secrets in appsSecrets Manager (rotation) vs hard-coding API keys.
Config driftAWS Config rules on encryption, public access, tags.
Compliance evidenceAudit Manager; Artifact for reports.

In-scope services (exam one-liners)

Analytics
AWS Data ExchangeSubscribe to third-party datasets inside AWS for ML/analytics.
Amazon EMRDistributed Spark/Hadoop for large-scale data prep before ML.
AWS GlueServerless ETL, crawlers, Data Catalog for lake→analytics/ML pipelines.
AWS Glue DataBrewVisual data cleaning/normalization for analysts.
AWS Lake FormationGovernance and fine-grained access on data lake assets.
Amazon OpenSearch ServiceSearch + k-NN/vectors; common RAG retrieval backend.
Amazon QuickSightBI dashboards and metrics—not model training.
Amazon RedshiftCloud data warehouse; SQL analytics and joins for ML features.
Cloud financial management
AWS BudgetsAlerts and limits when spend exceeds plans (incl. AI workloads).
AWS Cost ExplorerSlice and analyze historical AWS costs by dimension.
Compute & containers
Amazon EC2VMs/GPUs for custom training or BYO hosting when managed APIs are not enough.
AWS LambdaEvent-driven glue: invoke Bedrock/SageMaker, lightweight orchestration.
Amazon ECSRun containerized inference/training workers with AWS-native orchestration.
Amazon EKSManaged Kubernetes for portable, complex ML microservices.
Database
Amazon AuroraRelational; often Postgres-compatible patterns incl. vectors with extensions.
Amazon RDSManaged SQL; vector-on-Postgres option for RAG indexes.
Amazon DocumentDBMongo-compatible documents; document + embedding scenarios.
Amazon DynamoDBLow-latency key-value for app state, metadata—not primary semantic search engine.
Amazon ElastiCacheIn-memory cache to cut latency or repeated retrieval cost.
Amazon NeptuneGraph DB; relationship-heavy knowledge + graph/vector patterns.
Developer tools
KiroAWS builder IDE/agent positioning—recognition for “accelerate development with AI.”
Strands AgentsAgent builder tooling on AWS—recognition-level.
Amazon QGenAI assistant in IDE, console, and business workflows.
Machine learning — A–L
Amazon A2IHuman review workflows for low-confidence or regulated predictions.
Amazon BedrockPrimary managed FM layer: models, Knowledge Bases, Guardrails, Agents.
Amazon Bedrock AgentCoreBuilding blocks for production agents on Bedrock.
Amazon ComprehendManaged NLP: entities, sentiment, classification, PII.
Amazon KendraEnterprise intelligent search with connectors—contrast with DIY vector on OpenSearch.
Amazon LexConversational bots (intents/slots) for chat and voice.
Amazon NovaAWS FM family in Bedrock ecosystem—FM choice recognition.
Machine learning — M–Z
Amazon PersonalizeReal-time recommendations for retail/media use cases.
Amazon PollyText-to-speech synthesis.
Amazon RekognitionImage/video analysis: labels, moderation, faces.
Amazon SageMaker AIEnd-to-end custom ML build/train/deploy/monitor.
SageMaker JumpStartPrebuilt models and solution templates in SageMaker.
Amazon TextractExtract printed/handwritten text, forms, tables from documents.
Amazon TranscribeSpeech-to-text for audio/video.
Amazon TranslateNeural machine translation.
AWS TransformModernization toward cloud-ready analytics/AI estates (per guide scope—verify naming).
Management & governance
AWS CloudTrailAPI audit log for governance and investigations.
Amazon CloudWatchMetrics, alarms, logs for operational health.
AWS ConfigResource inventory and rule-based compliance checks.
AWS Trusted AdvisorAutomated best-practice checks (cost, security, limits).
AWS Well-Architected ToolStructured workload reviews vs the Well-Architected pillars.
Networking & CDN
Amazon VPCPrivate networks, subnets, SGs—foundation for private AI endpoints and data paths.
Amazon CloudFrontGlobal CDN for low-latency delivery of apps and static assets.
Security, identity, compliance
AWS IAMUsers, roles, policies controlling access to models and data.
AWS KMSEncryption keys for data and secrets at rest.
AWS Secrets ManagerStore and rotate application secrets (API keys, DB creds).
Amazon MacieDiscover sensitive data (e.g. PII) in S3 before training or indexing.
Amazon InspectorVulnerability scanning for EC2/ECR workloads (recognition).
AWS ArtifactDownload compliance reports and agreements.
AWS Audit ManagerContinuous control evidence mapped to frameworks.
Storage
Amazon S3Object store for datasets, features, model artifacts, logs.
Amazon S3 GlacierLow-cost archival for long-retention model or data history.

End of cheatsheet. Print this page from your browser if you want a PDF.