Decode NEET PG mock test analytics like a topper: subject accuracy, time per question, topic weakness, percentile vs marks, confidence calibration, and the last-month interpretation playbook.

Version 1.0 — Published May 2026
A typical NEET PG mock test yields approximately 80-120 wrong or skipped answers across 200 questions. Reading the analytics layer-by-layer surfaces 8-12 actionable weak spots that, if addressed in the next 7-10 days, can move your score by 15-30 marks. Decoding one mock fully is worth three mocks half-decoded. Use this 12-lens reading protocol:
The marginal value of an additional mock test drops sharply after 6-8 full-length mocks. Beyond that point, the test-taking experience itself adds little — you already know the timing, the format, the fatigue curve. What adds value is the diagnostic information from the mocks you have already taken. A typical 200-question mock yields ~80-120 wrong or skipped answers; reading the analytics layer-by-layer surfaces 8-12 actionable weak spots that, if addressed in 7-10 days, move your score by 15-30 marks.
Re-taking another mock without first decoding the previous one is the single most common mistake in the last 60 days — it consumes 3.5 hours of solving plus 2-3 hours of review without revealing new weak spots because you have not closed the loop on the previous mock.
Compounding rule: 1 mock fully decoded = 3 mocks half-decoded for marks-per-hour return.
Start practicing NEET PG MCQs with AI-powered explanations.
Start Free PracticeMaster GI secretions, digestion, absorption transporters, motility patterns, and gut hormones with high-yield NEET PG 2026 traps and India-context examples.
Master labor stages, Friedman vs Zhang curves, WHO partograph, AMTSL, episiotomy and India JSY/LaQshya policies for NEET PG 2026 OBG MCQs.
5 anterior segment ophthalmology image MCQs for NEET PG: hypopyon and Behcet, Kayser-Fleischer ring in Wilson, Brushfield spots in Down, corneal arcus, and pterygium vs pinguecula.
Daily MCQs, study tips, and topper strategies on Telegram.
Join on Telegram →The first chart most platforms show is a bar chart of accuracy by subject. Sort subjects from lowest to highest accuracy. Identify the 3-4 weakest subjects.
| Subject | Mock 1 | Mock 2 | Mock 3 | Trend |
|---|---|---|---|---|
| Anatomy | 58 percent | 62 percent | 66 percent | Rising |
| Physiology | 71 percent | 73 percent | 75 percent | Rising slow |
| Pathology | 68 percent | 64 percent | 60 percent | Falling — red flag |
| Pharmacology | 72 percent | 75 percent | 78 percent | Rising |
| Medicine | 55 percent | 56 percent | 55 percent | Flat — red flag |
| Surgery | 64 percent | 68 percent | 72 percent | Rising |
| OBG | 70 percent | 72 percent | 74 percent | Rising slow |
| Pediatrics | 76 percent | 78 percent | 80 percent | Rising |
| PSM | 82 percent | 84 percent | 85 percent | Strong |
| Forensic Medicine | 65 percent | 70 percent | 75 percent | Rising fast |
Interpretation:
Limitation: Subject-wise accuracy is too coarse to drive remediation. Telling yourself "study more Medicine" wastes hours. Drop to topic-level (Lens 3).
Mock tests give you ~54 seconds per question (200 questions in 180 minutes = 54 sec/Q average). Subjects with heavy diagrams and long stems (Surgery operative cases, Pediatrics neonatology, OBG management) consume more time; quick-recall subjects (PSM, Pharmacology, Forensic Medicine) consume less.
| Subject | Optimal sec/Q | Actual sec/Q | Status |
|---|---|---|---|
| Anatomy | 50 | 65 | Slow — concept gap |
| Physiology | 50 | 48 | Optimal |
| Pathology | 55 | 75 | Very slow — second-guessing |
| Pharmacology | 40 | 35 | Optimal |
| Medicine | 60 | 90 | Very slow — confidence gap |
| Surgery | 55 | 62 | Slightly slow |
| OBG | 55 | 50 | Optimal |
| Pediatrics | 55 | 55 | Optimal |
| PSM | 40 | 30 | Too fast — rushed |
| Forensic Medicine | 35 | 35 | Optimal |
Two red flags:
Action: target slow subjects with concept revision; target rushed subjects with accuracy review (often the rushed subjects have hidden accuracy drop of 5-10 percent that the headline number masks).
This is where decoding pays off. Drop from subject to topic. NEETPGAI displays a colour-coded heatmap.
| Subject | Strong topic cells (over 75 percent) | Borderline (50-75 percent) | Weak (under 50 percent) |
|---|---|---|---|
| Pathology | Cardiac, Endocrine | Haematology, GI | Renal, Liver, Neoplasia |
| Medicine | Cardiology, Endocrinology | GI, Respiratory | Nephrology, Haematology, Rheumatology |
| Surgery | General, Vascular | Urology, Plastics | Hepatobiliary, Paediatric surgery |
| OBG | Antenatal, Labour | Gynaec endocrine | Infertility, Gynaec oncology |
Interpretation:
NEETPGAI auto-clusters cells into red (under 40 percent), amber (40-60 percent), and green (over 60 percent) and ranks them by total mark contribution so you can prioritise the highest-yield weak spots first.
Modern NEET PG papers include 4-5 question types. Track accuracy by type.
| Question type | Accuracy | Comment |
|---|---|---|
| Single best answer (SBA) — straightforward recall | 72 percent | Mainstay; should be highest |
| Clinical vignette case-based | 68 percent | Tests application; revise common cases |
| Assertion-Reason (A-R) | 58 percent | Format weakness — needs targeted practice |
| Image-based MCQ | 60 percent | Format weakness — drill image PYQs |
| Match-the-following / sequence | 70 percent | Acceptable |
| NOT / EXCEPT (negative stem) | 55 percent | Comprehension trap — practice underlining NOT |
Action: the format with the lowest accuracy is often a learned bias not a knowledge gap. 30-50 targeted practice questions in that format restores 5-10 percent accuracy.
Some students answer a question, mark it for review, and change the answer in the second pass — sometimes for the better, sometimes for the worse.
| Pattern | First-attempt correct | After review | Net |
|---|---|---|---|
| Changed wrong to right | 22 | After review: 35 (gained 13) | +13 |
| Changed right to wrong | 18 (after review: 9) | Lost 9 | -9 |
| Net review benefit | +4 | Slight positive |
Interpretation:
Heuristic: "Never change your first answer unless you have positive new evidence (e.g., a recall trigger from another question on the paper). Doubt alone is not enough."
Raw marks vary by mock difficulty. Percentile normalises.
| Mock | Marks/720 | Percentile vs platform cohort | All-India rank projection |
|---|---|---|---|
| Mock 1 (early) | 480 | 62nd | ~25,000 |
| Mock 2 | 510 | 68th | ~18,000 |
| Mock 3 | 525 | 73rd | ~12,000 |
| Mock 4 | 540 | 78th | ~8,500 |
| Mock 5 | 555 | 82nd | ~6,000 |
| Mock 6 | 565 | 85th | ~4,500 |
| Mock 7 (target) | 580 | 88th | ~3,000 |
| Mock 8 (target) | 595 | 91st | ~2,000 |
Interpretation:
Indian context (2025 cycle):
Tag each answer as Sure / Doubtful / Guessed. The confidence matrix reveals 4 cells.
| Correct | Wrong | |
|---|---|---|
| Sure | Sure-and-correct (target over 90 percent) | Sure-and-wrong (target under 5 percent) |
| Doubtful | Doubtful-and-correct | Doubtful-and-wrong |
| Guessed | Guessed-and-correct (random hits) | Guessed-and-wrong |
Interpretation:
Calibration targets:
NEETPGAI auto-flags Sure-and-wrong questions as "priority concept rewrites" on the analytics dashboard.
Some subjects have natural timing characteristics. Deviations signal trouble.
| Subject | Typical sec/Q | Red flag |
|---|---|---|
| Anatomy | 50-55 | Over 70 means concept gap |
| Pathology | 55-60 | Over 75 means second-guessing |
| Pharmacology | 35-45 | Over 60 means recall gap; under 25 means rushed |
| Medicine | 55-65 | Over 80 means clinical judgement gap |
| Surgery | 55-65 | Over 80 means operative-case unfamiliar |
| OBG | 50-60 | Over 75 means management-decision uncertain |
| Pediatrics | 50-60 | Over 75 means dose / milestone uncertain |
| PSM | 35-45 | Under 25 means rushed (PSM is the classic over-confidence subject) |
| Forensic Medicine | 30-40 | Over 50 means section-number confusion |
Track the option position of correct answers in your wrong responses. Is there a bias? AI-generated MCQs in older banks were positionally biased toward option A (45 percent A, 34 percent B, 14 percent C, 7 percent D in pre-shuffle audits) — NEETPGAI's question bank now shuffles options uniformly to defeat this, but legacy banks may not.
| Pattern | Sign | Remediation |
|---|---|---|
| Skew toward option A | Likely working with an older / unfixed question bank | Switch to NEETPGAI shuffled bank |
| NOT / EXCEPT misses | You overlook the negative stem | Underline NOT/EXCEPT in every question stem; practice 30-50 EXCEPT questions |
| "All of the above" misses | You stop reading after finding one right option | Read all 4 options before answering |
| Look-alike distractors (one right answer surrounded by 3 plausible variations) | Concept confusion | Practice "compare-and-contrast" flashcards |
| Obsolete framework distractors (DSM-IV, ICD-10 in DSM-5-tested questions) | Out-of-date textbook | Use updated source for the topic |
The hardest part of mock analytics is the "now what?" step. NEETPGAI automates this.
/practice?mode=pyq&year=2025&subjectId=N pre-selects the PYQ mode with year and subject filter; useful for drilling year-specific PYQsThe principle is close-the-loop — every mock analysis must end in a concrete practice queue and a flashcard set, not a vague resolution to "study more".
There is a transition point in your prep timeline where the value of additional mocks drops below the value of deep review. Signs you have reached it:
At this point, the marginal value of an additional mock is small. Switch to deep review:
Re-introduce one mock every 7-10 days to recalibrate.
The same mock analytics dashboard reads differently depending on time-to-exam.
3 months out — wide-angle interpretation:
6 weeks out — narrow-angle interpretation:
Last month — surgical interpretation:
Last week — confidence consolidation only:
Last 24 hours — only confidence cards:
Neurocognitive rationale: New memories take 24-72 hours to consolidate; anything learned in the last week is unreliable under exam stress. Cohort data shows last-month strategic interpretation adds 15-30 marks vs students who keep doing more mocks.
Here is how a topper aspirant might decode a specific mock test.
Mock 5 result: 555/720; 88th percentile.
Step 1 (10 min) — Lens 1 + 2: Subject scan. Falling subjects — Pathology (60 percent), Medicine (55 percent), OBG (57 percent). Slow subjects — Pathology (75 sec/Q), Medicine (90 sec/Q). Action: target these 3 first.
Step 2 (20 min) — Lens 3 + 4: Topic drill-down. Top weak cells — Renal pathology (38 percent), Nephrology in Medicine (40 percent), Gynae oncology (42 percent), Liver pathology (44 percent). Question type weakness — Assertion-Reason (55 percent), NOT/EXCEPT (52 percent).
Step 3 (15 min) — Lens 5 + 6 + 7: Review pattern audit. Net review benefit -3 (destructive second-guessing). Sure-and-wrong rate 12 percent (too high). Percentile 88th — on track but plateauing.
Step 4 (10 min) — Lens 8 + 9: Timing red flags — Medicine and Pathology over 75 sec/Q; PSM rushed at 28 sec/Q (cross-check accuracy: 78 percent — slipping). NOT/EXCEPT pattern misses 8 of 12 — clear comprehension trap.
Step 5 (20 min) — Lens 10: Generate NEETPGAI adaptive practice queue. 25 questions each for Renal pathology, Nephrology, Gynae oncology, Liver pathology, Assertion-Reason format, NOT/EXCEPT format. Total 150 questions to drill over next 5 days.
Step 6 (10 min) — Lens 11 + 12: Decide review-vs-mock. With 6 weeks left, stay narrow-angle. Allocate next 5 days to focused practice queue + Sure-and-wrong flashcards; re-mock in 7 days.
Step 7 (5 min) — Documentation: Add 25 Sure-and-wrong concept rewrites to flashcard system. Set spaced repetition. Note "underline NOT/EXCEPT" as exam-day reminder.
Total decoding time: 90 minutes. Yields ~20-25 marks improvement in the next 2-3 mocks if executed.
The marginal value of an additional mock test drops sharply once you have completed 6-8 full-length mocks. Beyond that point, the test-taking experience itself adds little — you already know the timing, the format, and the fatigue curve. What adds value is the diagnostic information from the mocks you have already taken. A typical 200-question mock test yields approximately 80-120 wrong or skipped answers; reading the analytics layer-by-layer surfaces 8-12 actionable weak spots that, if addressed in the next 7-10 days, can move your score by 15-30 marks. Re-taking another mock without first decoding the previous one is the single most common mistake in the last 60 days — it consumes 3.5 hours of solving plus 2-3 hours of review without revealing new weak spots because you have not closed the loop on the previous mock. The compounding rule is simple: 1 mock fully decoded = 3 mocks half-decoded for marks-per-hour return.
Subject-level accuracy (e.g., 'Pathology 65 percent, Pharmacology 58 percent') is the surface-layer metric every mock report shows. It is useful for very broad triage but is too coarse to drive remediation — telling you 'study more Pathology' wastes hours. Topic-level accuracy (e.g., 'Cardiac pathology 78 percent, Renal pathology 42 percent, Haematology 71 percent') is the actionable metric. It pinpoints the 20-30 specific topic cells where you actually lose marks, allowing you to target 1-2 days of focused revision per cell. Most mock test platforms (including NEETPGAI) display both subject and topic accuracy; cohort data shows students who study at the topic level move accuracy 20-30 percent faster than those who stay at the subject level. NEETPGAI's topic-level heatmap auto-clusters topic cells into red (under 40 percent), amber (40-60 percent), and green (over 60 percent) and ranks them by total mark contribution so you can prioritise the highest-yield weak spots first. The principle is simple — the finer the granularity of the metric, the higher the actionable yield per hour spent on it.
Confidence calibration measures how well your self-rated confidence on each question matches your actual accuracy. Some platforms (NEETPGAI included) let you tag each answer as Sure / Doubtful / Guessed. The calibration matrix reveals 4 patterns. (1) Sure-and-correct (target 90+ percent in this cell) — your concept knowledge is solid; minimal revision needed. (2) Sure-and-wrong — the most dangerous pattern; you confidently held a wrong belief; needs concept rewriting from primary sources. (3) Doubtful-and-correct — you knew enough to guess right; build flashcards to convert these to Sure-and-correct. (4) Guessed-and-correct — chance hits; do not bank on these; treat as gaps for next mock. The cell to watch most closely is Sure-and-wrong. A well-calibrated student has under 5 percent Sure-and-wrong; an over-confident student has 15-25 percent in this cell and is at high risk of compounding errors on exam day. NEETPGAI auto-flags Sure-and-wrong questions as 'priority concept rewrites' on the analytics dashboard. Calibrating across 4-6 mocks moves over-confident students to 5-8 percent Sure-and-wrong over 30 days.
Marks in mock tests vary by mock difficulty — a 540/720 score on a hard mock may be equivalent to a 600/720 on an easy mock. Percentile (your rank divided by total candidates, expressed as percentage) normalises across mock difficulty and across the cohort you compete against. A consistent 90th-percentile finish across 4-6 mocks indicates a top-1000 to top-3000 All-India rank trajectory, irrespective of the raw marks. Most platforms display both — track percentile as the primary metric and use marks only to compare similar-difficulty mocks. NEETPGAI shows percentile against the platform cohort (the closest proxy to the actual NEET PG cohort once 50,000+ aspirants are on it) and a trajectory line of percentile-vs-time across mocks. The trajectory is the most predictive single metric — a rising trajectory (60 to 70 to 78 to 84) over 6 mocks indicates 30-60 rank improvement per week and a high-confidence top-rank trajectory; a flat or falling trajectory needs an honest conversation with yourself about study strategy. The 2025 NEET PG cut-off for AIIMS-Delhi-equivalent ranks was approximately the 99.7th percentile; PGIMER MBBS-aspirant cut-off approximately 99.5th; standard MD seat at top state colleges approximately 98th-99th.
Mock analytics interpretation changes fundamentally over the timeline to exam. Three months out — wide-angle interpretation. Look at subject-level patterns, total time management, broad topic gaps, conceptual frameworks; spend 60-70 percent of remaining study time on broad concept revision and 30-40 percent on weak topics. Tolerable to spend 1-2 hours on each weak topic with primary source revision. Six weeks out — narrow-angle interpretation. Drill down to specific topics; identify the top-10 weak topic cells contributing most to lost marks; spend 40-50 percent on these weak cells, 30 percent on consolidation, 20 percent on mocks. Last month — surgical interpretation. Stop broad revision; focus entirely on (1) Sure-and-wrong concept rewrites, (2) leech-card review, (3) top-5 weak topic cells, (4) mistake-bank flashcards. Do not start any new topic. Last week — confidence consolidation only. Review only the 100-150 highest-yield trap cards and known-confident cards; no new mocks; sleep, hydration, and mental rehearsal of the exam-day routine. Last 24 hours — only confidence cards; no new content. The neurocognitive reason is that new memories take 24-72 hours to consolidate; anything learned in the last week is unreliable under exam stress. Cohort data shows last-month strategic interpretation adds 15-30 marks vs students who keep doing more mocks.
This content is for educational purposes for NEET PG exam preparation. It is not a substitute for professional medical advice, diagnosis, or treatment. Clinical information has been reviewed by qualified medical professionals.
Written by: NEETPGAI Editorial Team Reviewed by: Pending SME Review Last reviewed: May 2026