Package to Convert

Amazon Keywords and Categories: The Setup That Gets You Found

Seven keyword slots, up to ten categories. The highest-leverage discovery work — and the line where optimization becomes penalized stuffing.

By Vanessa R. Thomas July 4, 2026 10 MIN READ

An open laptop on a wooden desk showing a book-listing dashboard with keyword fields, surrounded by a spiral notebook with handwritten phrase research and a coffee cup, in warm editorial light — Illustration: The Author's Game

Amazon gives you two levers for discovery and you have to pull both of them right. The first is seven backend keyword boxes, each capped at 50 characters, totaling 350 characters of phrase real estate. The second is three browse categories per format — the shelves your book sits on, recalculated every two hours by sales velocity, each one potentially earning you an orange bestseller badge when you reach the top slot. Fill these fields correctly and Amazon's search and recommendation engine routes your book to buyers around the clock at no ad spend. Fill them poorly and a cover, a description, and months of writing sit in effective silence.

The discovery math makes metadata matter more than almost anything you do after the manuscript is done. In Kindlepreneur's analysis of click behavior, the book ranked first for a given search captures about 27% of all clicks; second gets about 12%; third about 9%; and the top three positions together pull more than 60% of clicks for that term. The sixth position captures about 6%. Ranking third instead of eighth for the right phrase is not a small improvement — it can be the difference between a book that earns out and a book that disappears.

The governing rule for both fields: every keyword phrase must honestly describe what the reader receives, and every category must accurately match the book's content. Amazon's Metadata Guidelines state directly: "We do not tolerate keywords or categories that mislead or manipulate our customers." Violations carry graded consequences — from silent search suppression through listing removal to permanent account termination with royalty forfeiture. Optimize for the reader most likely to love the book, not for the badge most likely to be earned.

What do the seven keyword slots actually do — and what breaks them?

Amazon indexes every word across the seven backend boxes, the title, subtitle, description, and author name into a single composite index. The practical consequence: a word needs to appear only once across all fields to be indexed. Every repeated word is a wasted character in a 350-character budget. This is the rule most new authors get wrong — repeating a keyword across boxes, or repeating a word already in your title, does nothing except burn a slot you could have spent reaching a different reader.

Each box is for one natural phrase a real reader would type into the Kindle Store search bar — five to seven words, in the order a human says them. "Cozy mystery with cats small town" is a keyword. "Mystery cozy cat small-town" is a card catalog entry Amazon buyers do not type. The rules that cause listing suppression or account warnings trace directly to Amazon's published keyword policy:

No commas. Amazon may treat a comma as a phrase separator; words after the comma can be silently ignored. Use spaces.
No quotation marks. Quotes lock a phrase to exact-match only, eliminating all phrase and partial-match coverage — the opposite of what you want.
No Amazon program names. "Kindle Unlimited," "KDP Select," "Prime," and "Kindle" in a keyword box trigger zero-tolerance automated suppression with no warning.
No other authors' names. Piggybacking a competitor's name is a trademark violation Amazon auto-detects for major names.
No subjective or promotional language. "Best," "free," "#1," "new," "on sale," "bestselling" — all prohibited, all flagged.
No words already in your title or subtitle. Your title and subtitle keywords rank roughly 37% better than the same keywords in backend boxes alone (based on Kindlepreneur's 120-book experiment crawling 100,000+ keyword possibilities). Repeating them in backend slots wastes coverage you already have at higher weight.

Phrase match is the mechanism that makes a full slot far more valuable than a sparse one. A phrase like "dystopian YA sci fi pirate romance" indexes simultaneously for YA romance, sci fi romance, dystopian YA, and pirate romance — four distinct searches from one 50-character slot. Amazon also rearranges word order automatically and handles spelling variants, so deliberately misspelling keywords is penalized, not rewarded. The practical discipline: fill every box as close to the 50-character limit as possible. The Kindlepreneur experiment confirmed near-exponential indexing growth as character count approaches the limit — a half-empty box is half the discovery surface.

Amazon's A10 algorithm, layered in 2024 with the COSMO semantic model and expanded through 2025–2026 with the Rufus conversational AI shopping assistant, rewards intent-coverage breadth over keyword-density repetition. COSMO can surface a book whose keywords read "amateur sleuth bakery owner with feline companion" for a search for "cozy mystery cat" — no exact-word overlap required. Rufus, which drove an estimated $12 billion in incremental annualized sales by Q4 2025 and grew 210% year-over-year in interactions, reads keywords, description, reviews, and category signals together to answer natural-language queries. Stuffing — incoherent word lists, identical phrases repeated across boxes — is both a policy violation and an algorithmic liability under the current system.

How do you research which phrases actually have buyers?

You cannot fill seven boxes well by guessing, and AI-generated keyword lists without Amazon-specific search data produce phrases with plausible sound and zero real volume. The research has to come from Amazon's own demand signal. Open Amazon in an incognito window, set the search dropdown to "Kindle Store," and work these three methods in sequence:

Alphabet Soup. Type your seed phrase into the search bar, then append each letter of the alphabet A through Z one at a time. Amazon's autocomplete returns confirmed, high-volume search completions for each letter. Autocomplete order signals relative search volume — phrases appearing first are searched more often. A single seed phrase run through the full alphabet typically yields around 50 genuine search variants. This is free and uses Amazon's own data, which is why it outperforms any list built from intuition.

Reverse ASIN. Publisher Rocket ($199 one-time as of 2025) lets you enter a competitor's ASIN and returns every keyword that book ranks for, along with estimated search volume and its competition score. Competition scores run 0–100, operating like a traffic light: green (0–40) means a new title can realistically reach the top results for that phrase; yellow (41–65) is viable with existing reviews or ad support; red (66–100) is dominated by entrenched titles and should be avoided for a debut. These scores incorporate review counts, book age, BSR of the top five results, and keyword presence in title or subtitle.

The Goldilocks zone. A keyword is worth targeting when page-one competitors carry roughly 50–200 reviews and sit at an overall Kindle BSR between about 20,000 and 100,000. Competitors with thousands of reviews are too entrenched; competitors with no reviews signal a market with no demand. For any sales estimate, always read the overall Kindle BSR, never a subcategory rank — a subcategory position can look strong at BSR 300,000 (a few copies per week) while the keyword itself drives almost no traffic. Run the top two or three results for any candidate phrase through a BSR-to-sales calculator before you commit. Easy-to-rank and worth-ranking are two different tests; a keyword has to pass both.

The slot diversity framework below assigns each box to a different discovery angle, so you maximize coverage without redundancy:

Slot	Job	Example (small-town romance)
1	Core genre + subgenre long-tail phrase	small town contemporary romance series
2	Primary trope or character descriptor	enemies to lovers second chance billionaire
3	Secondary trope or setting phrase	cowboy romance Texas rancher small town
4	Category anchor — reinforces chosen category	clean wholesome small town love story
5	Category anchor or format/audience modifier	sweet romance series HEA guaranteed
6	Comp-title bridge	books like Colleen Hoover small town
7	Broad coverage — reader demographics or gift angle	gifts for women book lovers romance readers

How do you pick three categories without landing on a ghost?

Amazon offers over 16,000 category options across its book catalog. The three you select at publication determine which browse shelves you appear on and which sales velocity earns you the orange bestseller badge — a signal that reportedly lifts click-through rates by 30–50%. The 2023 reform hard-coded the limit at three categories per format (ebook and paperback are separate assignments), and removed the earlier option to request additional categories via KDP Support. Several mechanics determine whether those three slots work for you or waste themselves:

Ghost categories. Approximately 27% of categories selectable in the KDP dashboard are ghost categories: they accept a selection and may display on the product page, but have no browseable storefront, generate no bestseller rank, and deliver zero organic discovery. One March 2026 analysis by Dale L. Roberts identified 958 new ghost or fake categories added to KDP in a single update. Selecting a ghost is not a penalty trigger, but the opportunity cost is total — no badge, no browse traffic, no rank from that slot.

Duplicate categories. Approximately 54% of KDP category strings lead to the same browse page as another string. Selecting two duplicate paths uses two of your three slots for one effective placement.

The depth rule. Always drill to the deepest relevant leaf node. Selecting "Romance" places you among millions of competing titles. Selecting "Romance > Contemporary > Small Town & Rural" gives you a far smaller field and automatically includes you in all parent categories above it — you get the broad and the narrow for the price of one slot. In Kindlepreneur's data, reaching #1 in "Language Experience Approach" requires approximately 15 daily sales (BSR ~10,567), while reaching #1 in "Test Preparation" requires approximately 723 daily sales (BSR ~155) — a 55× difference within the same general subject area. Category choice determines feasibility.

The BSR-to-sales check. Before committing to any category, find the #1 ranked book in it, record its overall ABSR, run it through a free BSR-to-sales calculator, and confirm the required daily sales are achievable at your realistic launch velocity. Most new authors generate 5–20 sales in their first week. Categories where the top slot requires 100+ daily sales are not viable at launch without a substantial existing platform.

The ghost-category test. Open Amazon in an incognito window, set the store dropdown to "Kindle Store," and navigate to your intended category via the sidebar — not the search bar. If you cannot reach a browseable page with real books and a #1 badge slot, the category is a ghost. Replace it with a deeper real one. Run this check before you publish, not after.

Category anchor boxing. Reserve one or two of your seven keyword boxes for phrases Amazon associates internally with your declared categories. If your metadata language does not align with the terminology the algorithm expects for a browse node, Amazon can silently reassign the book post-publication — often to a ghost or irrelevant category. Confirmed keyword-to-category mappings include: "small town clean romance" → Romance > Clean & Wholesome; "cozy mystery female detective" → Mystery > Cozy; "psychological thriller missing wife" → Thriller > Psychological. Publisher Rocket's category keyword list surfaces these mappings systematically. Including one anchor phrase per chosen category is low-cost insurance against drifting off your own shelf.

Where does legitimate optimization become penalized stuffing?

The line is stated in Amazon's Metadata Guidelines and applies identically to keywords and categories: every field must accurately describe what the reader receives, placed in the field where Amazon instructs it. Optimization that serves accurate discovery is permitted and encouraged. Metadata designed to game placement, attract the wrong readers, or misrepresent the book is prohibited — and increasingly detected automatically by systems that were not in place three years ago.

The graded consequence spectrum matters because the most common outcome is invisible. Silent search suppression — the algorithm simply stops serving the book — is the penalty most authors never identify because the book remains "live" in the dashboard. Non-compliant listings can receive up to 40% less traffic than optimized listings, and suppressed listings account for up to 10% of Amazon's inactive products. Terms that trigger the zero-tolerance tier — Amazon program names, trademarked competitor names, promotional claims — cause automated suppression with no warning and no human review. Persistent violations reach account suspension or termination; only about 10–20% of suspension appeals result in reinstatement.

Category stuffing was a common practice under the pre-2023 system, which allowed up to 10 categories per format and where a book selling 15–20 copies per day could reach #1 in obscure niches with no content relevance required. The 2023 reform capped selections at three per format and introduced BISAC-code-aligned AI categorization. Amazon's 2026 algorithm update (A10 combined with COSMO and Rufus) actively penalizes category-badge gaming: it detects cover-versus-category mismatches through image recognition, and it uses conversion rate as a long-term ranking signal. A book placed in the wrong category attracts readers who do not want it, who do not buy it, and whose lack of purchase trains the algorithm to demote the listing — a compounding penalty that no badge compensates for. The "Also Bought" associations that develop from mismatched category traffic also skew the recommendation engine in ways that can take months of corrections to undo.

Metadata that earns organic rank is the same metadata that earns satisfied readers: accurate genre, honest trope signals, real comp-title bridges, and categories the book's actual content belongs in. The seven boxes and three slots are not a marketing stunt to be gamed — they are the mechanism by which Amazon's search engine routes the right buyer to the right book. Built correctly, that mechanism works every hour of every day without a dollar of ad spend. Built to deceive, it works against you from the moment the algorithm detects the mismatch.

Frequently asked

How many keyword slots does Amazon KDP give you, and what are the character limits?

Amazon KDP allocates exactly seven backend keyword boxes per book per format, each capped at 50 characters, for a total budget of 350 characters. Ebook and paperback are treated as separate formats, so you fill seven boxes for each. Each box is meant to hold one natural phrase a real buyer would type into the Kindle Store search bar — not a list of single words, not a comma-separated tag cloud. Amazon indexes every individual word across all seven boxes, the title, subtitle, description, and author name into a single composite index, which means a word needs to appear only once across all fields to be indexed. Repeating a word wastes scarce character budget without providing any additional ranking benefit.

What is the alphabet soup method for finding keywords, and why should you run it in incognito?

Alphabet soup is the most reliable free method for harvesting real buyer search phrases from Amazon's own demand data. Open the Amazon Kindle Store in a private or incognito browser window (so your personal purchase history does not skew autocomplete results), type your seed phrase into the search bar, then append each letter of the alphabet — A through Z — one at a time. Amazon's autocomplete returns confirmed, high-volume search completions for each letter. Autocomplete order signals relative search volume: phrases appearing first are searched more often than those appearing later. A single seed phrase run through the full alphabet typically yields around 50 genuine search variants. These phrases are not estimates — they reflect actual buyer query patterns on Amazon, which is why they outperform any list an AI tool or an author can invent from intuition alone.

What is a ghost category in Amazon KDP, and how do you detect one before publishing?

A ghost category is a category string that appears in the KDP dashboard, accepts your selection, and may even display on your product page — but has no browseable storefront page, generates no bestseller rank, and delivers zero organic discovery traffic. Roughly 27% of KDP-selectable categories are ghosts, and the problem is growing: one March 2026 analysis identified 958 new ghost or fake categories added to KDP in a single update. Selecting a ghost wastes one of your three precious category slots on a shelf no reader will ever browse. The detection method is mechanical: before committing to any category, open Amazon in an incognito window, set the store dropdown to "Kindle Store," and navigate to the category via the sidebar. If you cannot reach a live page with real books and an orange #1 bestseller badge slot, the category is a ghost — do not use it.

Which keywords does Amazon KDP prohibit, and what are the enforcement consequences?

Amazon's keyword help page states a zero-tolerance policy for metadata meant to advertise, promote, or mislead. Prohibited keyword categories include: other authors' names, other books' titles, Amazon program names (Kindle Unlimited, KDP Select, Prime, Kindle), subjective quality claims (best, #1, amazing), time-sensitive language (new, on sale, available now), promotional terms (bestselling, free), intentional misspellings, quotation marks around phrases, HTML in keyword fields, and any term that does not accurately describe the book. Consequences follow a graded spectrum: the most common is silent search suppression, where the book loses visibility without any notification. Non-compliant listings can receive up to 40% less traffic than optimized listings. Terms flagged as zero-tolerance — Amazon program names, trademark violations — trigger automated suppression with no warning or review. Account suspension is reserved for repeated or serious violations; only about 10–20% of suspension appeals result in reinstatement.

What is category anchor boxing, and why does it prevent silent category reassignment?

Category anchor boxing means reserving one or two of your seven keyword boxes for phrases that Amazon internally associates with your declared browse categories. Amazon's algorithm can silently reassign a book post-publication if the backend keywords do not signal alignment with the chosen category — a mismatch between your metadata language and the terminology the algorithm expects for that browse node. Anchor phrases close that gap. Confirmed keyword-to-category mappings from the research corpus include: "small town clean romance" → Romance > Clean & Wholesome; "cozy mystery female detective" → Mystery > Cozy; "psychological thriller missing wife" → Thriller > Psychological. Publisher Rocket's category keyword list surfaces these mappings systematically. Including one category-anchor phrase per chosen category is a low-cost insurance policy against waking up to find your book shelved somewhere it does not belong.

What is the Goldilocks zone for keyword competition, and how does Publisher Rocket measure it?

The Goldilocks zone for a new author is a keyword where page-one competitors carry roughly 50–200 reviews and sit at an overall Kindle BSR between about 20,000 and 100,000. Competitors with thousands of reviews are too entrenched; competitors with no reviews signal a market with no demand. Always read the overall Kindle BSR, never a subcategory rank, because a subcategory position can look strong at BSR 300,000 — a few copies per week — while the keyword drives almost no traffic. Publisher Rocket ($199 one-time as of 2025) quantifies this as a competition score from 0 to 100: green (0–40) means the phrase is realistically rankable for a new title in the top results; yellow (41–65) is viable with existing reviews or ad support; red (66–100) is dominated by entrenched titles and should be avoided for a debut. Pair the competition score with a BSR-to-sales calculator check on the top two or three results — confirm those books are actually selling — before committing to any phrase.

How does Amazon's COSMO and Rufus AI layer change the way keyword phrases should be written?

Amazon layered COSMO (a semantic knowledge-graph model) in 2024 and expanded Rufus (a conversational AI shopping assistant) through 2025–2026. Both systems reward intent-coverage breadth over keyword-density repetition. COSMO can surface a book whose keywords read "amateur sleuth bakery owner with feline companion" in response to a search for "cozy mystery cat" — no exact-word overlap required — because the knowledge graph maps related concepts. Rufus reads keywords, description, reviews, and category signals together to answer natural-language queries like "what's a good fantasy series for fans of Brandon Sanderson." The practical consequence: stuffing identical terms across multiple keyword boxes, or repeating phrases already indexed in the title, is both a policy violation and an algorithmic liability. COSMO-enhanced search relevance produced a 60% improvement in model accuracy (frozen encoders) compared to pre-COSMO baselines. Rufus drove an estimated $12 billion in incremental annualized sales by Q4 2025. Writing for breadth of distinct reader intents — tropes, settings, character types, comp-title bridges — outperforms any density-first approach under the current system.