Pediatrics Question Quality Review
Executive Summary
This review covers a candidate sample of 100 validated non-gold questions drawn from a pool of 7,754 Pediatrics items. The sample was evaluated against 8 benchmark questions and 12 recent PYQs as the quality bar.
The headline finding is stark: the candidate sample is dominated by low-cognitive-demand recall items that are structurally and conceptually far below the benchmark standard. The Blooms distribution tells the story directly — 49 of 100 candidate questions sit at Blooms Level 1, and 40 at Level 2. Only 11 reach Level 3, and none reach Level 4. The benchmark set, by contrast, operates entirely at Blooms Level 3 with rich clinical vignettes. The PYQ set, even at its simpler end, uses clinical context and requires application.
Beyond cognitive depth, the reviewed set contains multiple broken delivery items (image-dependent questions with no image), at least two factually unsafe or wrong-key items, a cluster of "All of the above" and "None of the above" formatted questions that are structurally weak, and a meaningful number of items that are so trivially simple they add no discriminatory value for PG-level examination.
The subject is large (7,754 items) and the problems observed in this sample appear to be systemic rather than isolated. Priority action should focus on disabling the most trivial and broken items immediately, fixing the structurally repairable mid-tier items, and using the benchmark set as the template for any new item generation.
Summary counts across the reviewed sample:
| Issue Category | Approximate Count in Sample |
|---|---|
| Wrong Key or Factually Unsafe | 4–5 |
| Broken Delivery (image missing, malformed options) | 5–6 |
| Low-Value But Correct (Blooms 1, trivial recall) | 35–40 |
| Repetitive or Duplicative Coverage | 8–10 |
| Worthwhile Concept, Weak Execution | 15–20 |
| Wrong Subject or Wrong Topic Placement | 2–3 |
What Good Looks Like
The benchmark and PYQ sets establish a clear quality bar. The following features define a high-quality Pediatrics item for Indian PG examinations:
Clinical vignette with decision-forcing context. Every benchmark question presents a patient scenario with age, presenting complaint, examination findings, and relevant investigations. The candidate must synthesize information, not retrieve a single fact. For example, question e8b7fc20 presents a 4-year-old with drooling, stridor, and SpO2 of 88% and asks for immediate management — the correct answer requires understanding why direct laryngoscopy is dangerous and why OR-controlled intubation is preferred over emergency room intubation.
Distractors that are clinically plausible and educationally meaningful. In 8162ad33, all four options describe real pathophysiological mechanisms of bone disease. A candidate who does not understand the difference between defective mineralization, osteoclastic resorption, and collagen synthesis failure cannot guess the answer. In f095ab6f, the distractors for SAM management (high-protein diet from day 1, RUTF immediately, standard formula) are all things a candidate might plausibly consider, making the question genuinely discriminatory.
Management and reasoning questions, not identification questions. The benchmark set consistently asks "what is the most appropriate next step" or "which factor is most predictive" rather than "what is the most common" or "what is the definition of." Even the simpler PYQs like 54912a50 (Kawasaki treatment) and 4ad89d5c (9-month developmental delay) require the candidate to apply knowledge to a scenario.
Correct Blooms calibration. Benchmark items are uniformly Blooms 3. PYQs range from Blooms 1 to 4, but even the Blooms 1 PYQs (bbe55b1f, 409a18ed) test clinically relevant recall that appears in actual examinations. The candidate sample, by contrast, contains Blooms 1 items testing facts like "what is the newborn period" and "what is the average birth length" — facts that have never appeared in INICET or NEET-PG in isolation.
No structural gimmicks. Benchmark items do not use "All of the above," "None of the above," or "All EXCEPT" as the correct answer. Options are parallel in structure, specific, and independently evaluable.
Main Issue Categories
1. Wrong Key or Factually Unsafe
Why this pattern is bad. A wrong key is the most serious quality failure in a question bank. It directly harms candidates who reason correctly and are penalized for it. In a high-stakes PG examination context, a wrong key also damages platform credibility. Factually unsafe items — where the stated correct answer is contested, outdated, or contradicted by standard references — carry the same risk even if the error is not absolute.
How it shows up. In this sample, the pattern appears in two forms: (a) an answer that is straightforwardly incorrect by standard pediatric references, and (b) an answer that is internally inconsistent with the question stem.
Example question IDs and explanations:
86ec7aee— "What is the primary treatment for a dehydrated child?" with the correct answer marked as "5% dextrose." This is factually unsafe. The standard of care for dehydration in children is oral rehydration solution (ORS) for mild-to-moderate dehydration, or isotonic crystalloid (normal saline or Ringer's lactate) for severe dehydration requiring IV therapy. 5% dextrose is not a rehydration fluid and is contraindicated as a primary rehydration agent because it provides free water without electrolytes and can cause hyponatremia. This item could cause direct patient harm if a candidate internalizes it. Disable immediately.4a31fd2b— "A newborn is prone to hypothermia due to which of the following reasons?" with the correct answer marked as "Presence of brown fat." Brown adipose tissue (BAT) is the primary thermogenic mechanism in neonates — it generates heat through non-shivering thermogenesis and is protective against hypothermia, not a cause of it. The actual reasons for hypothermia susceptibility are the other three options: high surface area to mass ratio, limited subcutaneous fat, and proportionally large head. The key is inverted. Disable immediately.b7c68166— The question asks for the WHO definition of low birth weight. Two of the four options are identical ("Less than 2.5 kg" appears as both option A and option C, with option C marked correct). This is a broken duplicate option that also makes the question trivially answerable by elimination. Disable immediately.2a0c37a6— "Which of the following is NOT a feature of TAR syndrome?" with the correct answer marked as "Ventricular septal defect." TAR syndrome (Thrombocytopenia-Absent Radius) is associated with cardiac defects including ASD and VSD in approximately 30% of cases. VSD is a recognized association. The question as written implies VSD is not a feature, which is factually contested. The safer "not a feature" for TAR would be absent thumbs (thumbs are present in TAR, distinguishing it from Fanconi anemia) or autosomal dominant inheritance (it is autosomal recessive). The current key is at minimum ambiguous and potentially wrong. Fix or disable.f49fda07— The stem states "Delayed puberty is defined as the absence of secondary sexual characteristics by age 13 in girls and 14 in boys, or the absence of menarche by age 15" and then asks "What is the age threshold for primary amenorrhea without the development of secondary sexual characters that defines delayed puberty?" The stem has already embedded the answer (age 13 for girls without secondary sexual characteristics) but the correct answer is marked as 16 years. The 16-year threshold applies to primary amenorrhea with secondary sexual characteristics present, not to absence of secondary sexual characteristics. The stem conflates two different clinical definitions. This is internally inconsistent and factually imprecise. Fix or disable.
Recommended disposition: Disable 86ec7aee, 4a31fd2b, b7c68166 immediately. Fix or disable 2a0c37a6 and f49fda07 after expert clinical review.
2. Wrong Subject or Wrong Topic Placement
Why this pattern is bad. Misplaced questions contaminate topic-level analytics, distort difficulty calibration within a topic, and confuse candidates who are studying by topic. A question tagged to Pediatric Nutrition that is actually about adult pathology, or a question tagged to Neonatology that belongs in Pediatric Surgery, creates noise in both the source and destination topic.
How it shows up. In this sample, the pattern is observed in a small number of items where the clinical content clearly belongs to a different specialty or the topic tag does not match the question content.
Example question IDs and explanations:
de006b02— Tagged to "Pediatric Nutrition" and asks "Which of the following diseases will lead to the following appearance?" with the correct answer being "Paget's disease." Paget's disease is an adult bone disorder. It is not a pediatric nutrition topic, not a pediatric condition, and the question depends on a missing image. This item is misplaced in Pediatric Nutrition and belongs, if anywhere, in Orthopedics or Medicine. Disable.9506cbfe— Tagged to "Child Abuse and Neglect" and asks "A 5-month-old infant is brought in for a routine visit. What is the probable cause for the following findings?" with the correct answer being "Benign skin lesion." The question is entirely image-dependent (the "following findings" are not described in text). Without the image, the question is unanswerable. Additionally, the topic placement in Child Abuse and Neglect for what is apparently a benign skin finding question is questionable. This is simultaneously a broken delivery item and a potential misplacement. Disable.8118cf77— Tagged to "Infectious Diseases" and asks about the mechanism of Streptococcus pneumoniae causing otitis media. While technically a pediatric infectious disease question, the mechanism asked (direct extension from nasopharynx) is basic microbiology/ENT content that is more appropriately placed in Microbiology or ENT. The Blooms tag of Level 1 is also inconsistent with the difficulty tag of "2." This is a borderline misplacement rather than a clear error, but it should be reviewed for topic accuracy.
Recommended disposition: Disable de006b02 and 9506cbfe. Review 8118cf77 for topic reassignment.
3. Broken Delivery (Missing Image, Malformed Options, Incomplete Stem)
Why this pattern is bad. A question that depends on an image that is not rendered, or that has a stem referring to "the following findings" without providing them, is completely unanswerable on its own merits. Candidates either skip it (wasting an item slot) or guess randomly (introducing noise into performance data). Malformed options — such as duplicate answer choices — make questions trivially answerable by elimination and invalidate the psychometric value of the item.
How it shows up. In this sample, the pattern appears in three forms: (a) stems that reference an image or "the following" without any image or description present in the text, (b) duplicate options, and (c) stems that are incomplete sentences.
Example question IDs and explanations:
77226350— "Which is true about the instrument shown?" — The entire question depends on an image of an orchidometer (Prader orchidometer, based on the options). No image is present in the text. The question is unanswerable without it. Disable.b1e603e5— "Which is true about the reflex shown in the image?" — Again, entirely image-dependent with no image present. The options list four reflexes (Gallant, Parachute, ATNR, STNR). Without the image, this is a random guess. Disable.9506cbfe— "What is the probable cause for the following findings?" — Image-dependent, no image present, as noted above. Disable.de006b02— "Which of the following diseases will lead to the following appearance?" — Image-dependent, no image present. Disable.b7c68166— Options A and C are identical ("Less than 2.5 kg"). This is a malformed option set that makes the question trivially answerable by elimination and is likely a data entry error. Disable.4a13ab31— "What are the differential diagnoses for a left-sided abdominal mass in a baby?" with all three substantive options (splenic flexure, spleen, left kidney) being correct and the answer being "All of the above." This is not technically broken delivery, but the stem is so vague ("a baby" with no clinical context) and the answer format so weak that it functions as a broken item from a quality standpoint. Addressed further under Category 4.93c8e60c— "Wilms' tumor: What is the most common presenting symptom?" — The stem is a sentence fragment. It reads as a heading rather than a question. This is a minor formatting issue but reflects the broader pattern of incomplete stem construction in this sample.
Recommended disposition: Disable 77226350, b1e603e5, 9506cbfe, de006b02, b7c68166 immediately due to broken delivery. Flag 93c8e60c for stem rewrite.
4. Low-Value But Correct (Too Simple, Low-Yield, Trivia-Heavy, Weak Exam Relevance)
Why this pattern is bad. A question can be factually correct and still be a poor examination item. For Indian PG examinations (INICET, NEET-PG), the relevant cognitive demand is application and reasoning, not isolated recall of definitions, normal values, or single-fact associations. Items that test only whether a candidate has memorized a number or a label — without any clinical context — do not discriminate between candidates who understand the subject and those who have merely rote-learned a list. They also inflate the apparent coverage of a topic without adding genuine assessment value. When strong gold-standard coverage already exists for the same concept at a higher cognitive level, low-value items should be disabled rather than rewritten, because a rewrite would essentially produce a new item that duplicates existing gold-standard coverage.
How it shows up. This is the single largest problem category in the reviewed sample. Approximately 35–40 of the 100 candidate questions fall into this bucket. The pattern includes: bare-fact "what is the definition of X," "what is the most common Y," "what is the dose of Z," and "what is the normal value of W" questions with no clinical context, no vignette, and no reasoning requirement.
Example question IDs and explanations:
8f3c55a7— "The newborn period is defined as the first ________ days after birth?" This is a definitional recall item at Blooms 1. It has no clinical relevance to examination performance and tests nothing beyond memorization of a textbook definition. Disable.6b0cb9cf— "What is the average length of a full-term child at birth?" Four numerical options, correct answer 50 cm. Pure recall of a normal value. No clinical context. Disable.4205d339— "At what age does a child's ICF and ECF composition become similar to that of an adult?" Pure recall of a physiological fact with no clinical application. Disable.750c0e95— "What device is used to prevent hypothermia in a preterm neonate?" with the correct answer being "Incubator." This is a Blooms 1 recall item. The distractor "Radiant warmer" is actually also used for preterm neonates (and is the preferred device for resuscitation), making the question additionally ambiguous. Disable.ac07da88— "What is the concentration of epinephrine used in neonatal resuscitation?" with the correct answer being "1:10,000." This is a pure dose-recall item. While the fact is clinically important, the question format (bare recall with no scenario) does not test whether a candidate can apply this knowledge. The benchmark item343e7bcfdemonstrates how neonatal resuscitation can be tested at a much higher level. Disable.b4b39147— "Which of the following scoring systems is used to assess respiratory distress in neonates?" Correct answer: Silverman-Anderson score. Pure recall, Blooms 1. Disable.1a4d36b8— "Salmon patch usually disappears by what age?" Pure recall of a dermatological fact about a benign neonatal skin finding. No clinical relevance to PG examination. Disable.5141a9d7— "In Kwashiorkor, the letter K is post-fixed to denote what?" This is etymological trivia. The answer (edema) is clinically important, but the question format tests knowledge of a naming convention rather than clinical understanding of kwashiorkor. Disable.b3404967— "Which enzyme is increased in rickets?" Correct answer: Alkaline phosphatase. This is a single-fact recall item. The benchmark item8162ad33covers rickets at Blooms 3 with a full clinical vignette and pathophysiology reasoning. This item adds nothing. Disable.0ada9469— "What is the most common tumor of the kidney in children?" Correct answer: Wilms' tumor. Blooms 1, no clinical context. The benchmark item7773c80ecovers Wilms' tumor/neuroblastoma at Blooms 3. Disable.09872e1e— "Which of the following age groups falls under the early adolescence age group?" Pure definitional recall. Disable.37e5de73— "At what age does the adolescent growth spurt typically begin?" Pure recall. Disable.ce7f1eda— "A baby is considered large for gestational age (LGA) if their birth weight is:" Pure definitional recall. Disable.f4336ead— "What is the typical lifespan of neonatal red blood cells?" Pure recall of a physiological value. Disable.83cd95aa— "Fanconi's anemia is a type of:" with the correct answer being "Constitutional anemia." This is a classification recall item with no clinical value at PG level. Disable.51964003— "What is the most common tumor of infancy?" Correct answer: Hemangioma. Blooms 1, no context. Disable.58d36a98— "Which of the following is used in the treatment of ALL?" with the correct answer being "All of the above" (Vincristine, L-asparaginase, Prednisolone). This tests recall of a drug list and uses the structurally weak "All of the above" format. Disable.8c766b37— "What is the recommended dose of zinc for a 9-month-old infant experiencing acute diarrhea?" Correct answer: 20 mg daily. Pure dose recall. Disable.83eb3bb0— "A 9-month-old child has been diagnosed with measles. What is the recommended dose of vitamin A administration?" Correct answer: 1,00,000 IU. Pure dose recall with a thin clinical wrapper. Disable.0504a122— "Hungry diarrhoea is seen in which of the following conditions?" Correct answer: Marasmus. Single-fact association, Blooms 1. Disable.29f99ef2— "Pseudohermaphroditism in a female child is most commonly due to which of the following?" Correct answer: 21-hydroxylase deficiency. While the concept is high-yield, the question format is bare recall. The benchmark item7f46d7dbcovers CAH at Blooms 3 with a clinical scenario. Disable in favor of the higher-quality item.cff3962c— "At what age does the tonic neck reflex typically disappear?" Pure recall of a developmental milestone. Disable.4ba18bfc— "Handedness develops by age of?" Pure recall. Disable.b49a62b8— "The mother of a 6-month-old infant is concerned that her baby may be teething. You explain to her that the first teeth to erupt in most children are which of the following?" While there is a thin clinical wrapper, this is pure recall of a dental eruption sequence. Disable.
Recommended disposition: Disable all items listed above. The concepts they cover (rickets, SAM, Wilms' tumor, CAH, neonatal resuscitation) are already covered at higher quality in the benchmark and PYQ sets. Rewriting these items would produce duplicates of existing gold-standard coverage.
5. Repetitive or Duplicative Coverage
Why this pattern is bad. When multiple items in the bank test the same narrow fact or the same clinical scenario at the same cognitive level, they dilute the discriminatory power of the bank, inflate apparent topic coverage, and waste item slots that could be used for genuinely different concepts. In a bank of 7,754 items, duplication is expected, but it becomes a quality problem when the duplicates are all low-quality items covering the same ground as existing gold-standard items.
How it shows up. In this sample, several concept clusters appear multiple times at low cognitive demand, and several items directly overlap with PYQ content without adding any new angle.
Example question IDs and explanations:
Kawasaki disease is covered by PYQs
54912a50(treatment) andf5a67c2b(diagnostic criteria), and by benchmark item4f17d182(IVIG-resistant Kawasaki). In the candidate sample,a7ccb11easks "Of the following vasculitides, coronary artery aneurysms are most often seen in which condition?" with the correct answer being Kawasaki disease. This is a Blooms 1 recall item that adds nothing to the existing high-quality Kawasaki coverage. Disable.Dehydration management appears in multiple candidate items:
69f60d39(fluid replacement in severe neonatal dehydration),d6e4b550(classification of dehydration),86ec7aee(primary treatment for dehydrated child — also wrong key), and95dfb937(fluid for burns). These items cluster around the same concept at low cognitive demand and overlap with PYQ0d222b0f(SAM with shock). Disable86ec7aee(wrong key); review69f60d39andd6e4b550for consolidation.Developmental milestones are covered by multiple low-quality items in the sample:
acc8e482(triangle vs diamond drawing),92bf7631(10-month milestones),2989f6ca(7-month milestones),cff3962c(tonic neck reflex disappearance),4ba18bfc(handedness),3535c381(full name knowledge),b49a62b8(first tooth eruption). All are Blooms 1 recall items. The PYQ4ad89d5ccovers developmental milestones at Blooms 4 with a clinical scenario. The candidate items add no value. Disable all.Neonatal definitions cluster:
8f3c55a7(newborn period definition),ce7f1eda(LGA definition),b7c68166(LBW definition — also broken),f4336ead(neonatal RBC lifespan). All are pure definitional recall. Disable all.Tetralogy of Fallot is covered by
f315ae37("Tetralogy of Fallot is characterized by following except") and2a863f6f("The most common cause of Blue baby syndrome"). Both are low-cognitive-demand items covering the same condition. Neither adds value beyond what a Blooms 3 vignette item would provide. Disable both or retain only the slightly higher-qualityf315ae37after fixing the stem.
Recommended disposition: Disable duplicative low-value items as listed. Where a concept has existing gold-standard coverage, prefer disabling the candidate item over rewriting it.
6. Worthwhile Concept, Weak Execution (Keep the Concept, Fix the Stem/Options/Vignette)
Why this pattern is bad. Some items in the sample test genuinely high-yield concepts that appear in INICET and NEET-PG, but the execution is weak: the stem lacks clinical context, the distractors are implausible or structurally poor, the question uses "All of the above" as the correct answer, or the cognitive demand is artificially low for the concept being tested. These items are worth fixing because the concept is correct and the investment in a rewrite is justified.
How it shows up. The pattern appears in approximately 15–20 items in the reviewed sample. The most common execution failures are: (a) bare-fact stems on high-yield topics, (b) "All of the above" as the correct answer, (c) "NOT/EXCEPT" format with a trivially obvious exception, (d) distractors that are implausible or non-parallel, and (e) clinical scenarios that are too thin to require reasoning.
Example question IDs and explanations:
6bdb7e62— "A newborn baby presented with profuse bleeding from the umbilical stump. The rest of the examination and PT, APTT are within normal limits. What is the most probable diagnosis?" Correct answer: Glanzmann thrombasthenia. This is a worthwhile concept (Factor XIII deficiency is the classic answer for umbilical stump bleeding with normal PT/APTT; Glanzmann thrombasthenia causes mucocutaneous bleeding but not specifically umbilical stump bleeding with normal coagulation studies). The key may actually be wrong — Factor XIII deficiency is the textbook answer for this presentation. This item needs expert clinical review before any fix. Fix after key verification, or disable if key is confirmed wrong.aa28f294— "A 2-year-old male child presents with cough, high-grade fever, and rapid breathing for the past 7 days. On examination, the respiratory rate is 50/min, and visible chest indrawing is present. What is the next appropriate line of management?" Correct answer: "Home management with Amoxicillin." This is a worthwhile IMNCI-based clinical scenario, but the correct answer is problematic. Per IMNCI guidelines, a child with chest indrawing has "severe pneumonia" and requires hospital admission with injectable antibiotics, not home management with oral amoxicillin. Chest indrawing is a danger sign that upgrades the classification. The key appears to be wrong. Disable pending expert review.7f46d7db— "A 4-week-old female child with normal genitalia presents to the emergency department with severe dehydration, hyperkalemia, and hyponatremia. The measurement of which of the following blood levels will be helpful?" Correct answer: Aldosterone. This is a good concept (salt-wasting CAH) but the correct answer is debatable. The standard diagnostic test for salt-wasting CAH (21-hydroxylase deficiency) is 17-hydroxyprogesterone, not aldosterone. Aldosterone would be low in this condition, but measuring it is not the primary diagnostic step. The question would be better served by asking for 17-OHP or by restructuring to ask about the diagnosis rather than the investigation. Fix: change the key to 17-hydroxyprogesterone or restructure the question.6b23490d— "A seven-year-old child presents with recurrent chest infections and exocrine pancreatic insufficiency. Sweat chloride levels have been observed between 40-60 mmol/L on two separate occasions. Which of the following tests should be performed next to support the diagnosis of cystic fibrosis?" Correct answer: "Demonstrate an abnormal nasal potential difference." This is a genuinely good clinical reasoning question at Blooms 3. The sweat chloride is in the intermediate range (40–60 mmol/L is borderline), and the next step is either CFTR mutation analysis or nasal potential difference. The correct answer is defensible. However, the distractor "Demonstrate the F508 mutation by DNA analysis" is also a valid next step per CF diagnostic guidelines (CFTR mutation analysis is recommended for borderline sweat chloride). The question needs clearer stem language specifying why NPD is preferred over CFTR mutation analysis in this context, or the key needs review. Fix: clarify stem or review key against current CF diagnostic guidelines.efc5ff89— "You are examining an infant and the findings are as follows: Adductor angle - 100°, Popliteal angle - 90°, Dorsiflexion angle of foot - 70°, Scarf sign - Elbow crosses the middle but doesn't reach the anterior axillary line. What is the appropriate age of the infant?" Correct answer: 4–6 months. This is a worthwhile concept (neuromuscular maturity assessment) and the question format is reasonable. However, the question would benefit from a brief clinical context (e.g., "You are assessing tone in an infant referred for developmental evaluation") to make it feel less like a table-lookup exercise. Fix: add minimal clinical context to the stem.f02585b0— "An infant can regard his parent's face, follow to midline, lift his head from the examining table, smile spontaneously, and respond to a bell. He does not yet regard his own hand, follow past midline, nor lift his head to a 45-degree angle off the examining table. Which of the following is the most likely age of the infant?" Correct answer: 1 month. This is a well-constructed developmental milestone question that requires synthesis of multiple milestone data points. It is one of the better items in the candidate sample. Keep with minor formatting cleanup.5e3bfe99— "A child presented with frothy urine, massive proteinuria, and edema. Urine examination revealed no RBCs, no WBCs, no casts, and no crystals. There is no prior episode of similar presentation. What is your diagnosis?" Correct answer: Minimal change disease. This is a worthwhile concept and the clinical details are appropriate. However, the question would be stronger if it included age (nephrotic syndrome in a young child strongly favors MCD), serum albumin, and asked for the most appropriate initial treatment rather than just the diagnosis. As written, it is a reasonable Blooms 2 item but could be elevated to Blooms 3. Fix: add age and ask for management or pathophysiology.0b736c92— "A neonate presents with vitamin K dependent bleeding. Which of the following drugs, if taken by the mother during the antenatal period, is LEAST likely to cause this condition?" Correct answer: Quinine. This is a worthwhile pharmacology-neonatology interface concept. The question is reasonably constructed, though the "LEAST likely" format is slightly negative. The distractors (phenytoin, phenobarbitone, INH) are all known to interfere with vitamin K metabolism. Quinine does not. Keep with minor stem polish.a2032c5d— "According to PALS 2010 guidelines, which of the following is NOT a component of the initial impression of a child?" Correct answer: Airway. The PALS Pediatric Assessment Triangle (PAT) consists of Appearance, Work of Breathing, and Circulation to skin — not Airway, Breathing, and Circulation (which is the primary assessment). This is a worthwhile concept but the question is testing a specific guideline version (PALS 2010) that may be outdated. The 2020 PALS guidelines should be the reference. Fix: update to current PALS guidelines and verify the key.44fd6887— "A 7-year-old child with a known history of rheumatic heart disease presents with a 3-week history of fever with palpitations. Most likely cause is:" Correct answer: Staphylococcal endocarditis. This is a worthwhile concept (infective endocarditis on rheumatic valve disease) but the correct answer is debatable. In children with rheumatic heart disease, Streptococcus viridans (alpha-hemolytic streptococci) is the most common cause of infective endocarditis, not Staphylococcus aureus. Staphylococcal endocarditis is more common in IV drug users and prosthetic valves. The key needs expert review. Fix: review key against standard references; likely should be Streptococcus viridans.ec0c1da8— "Which of the following are neonatal complications of maternal diabetes during pregnancy?" with the correct answer being "I, II and IV" (Hyperbilirubinemia, Hypocalcemia, Hypoglycemia — excluding Cardiomyopathy). Cardiomyopathy (hypertrophic cardiomyopathy) is a well-recognized complication of IDM. The exclusion of cardiomyopathy from the correct answer set appears to be an error. Fix: verify key; cardiomyopathy should likely be included, making "All of the above" the correct answer, or the question should be restructured.
Recommended disposition: Fix f02585b0, 5e3bfe99, 0b736c92, efc5ff89 (minor fixes). Fix or disable 6bdb7e62, 7f46d7db, 44fd6887, ec0c1da8, aa28f294, a2032c5d after expert key verification. Fix 6b23490d after CF guideline review.
Prioritization
The following prioritization is based on urgency and impact:
Tier 1 — Immediate Action (Disable without further review): These items are either factually unsafe, broken, or so trivially low-value that no rewrite is justified given existing gold-standard coverage.
- Wrong key / factually unsafe:
86ec7aee,4a31fd2b,b7c68166 - Broken delivery (image missing):
77226350,b1e603e5,9506cbfe,de006b02 - Wrong subject:
de006b02(also broken) - Trivial recall with gold-standard coverage already existing:
8f3c55a7,6b0cb9cf,4205d339,750c0e95,ac07da88,b4b39147,1a4d36b8,5141a9d7,b3404967,0ada9469,09872e1e,37e5de73,ce7f1eda,f4336ead,83cd95aa,51964003,58d36a98,8c766b37,83eb3bb0,0504a122,29f99ef2,cff3962c,4ba18bfc,b49a62b8,a7ccb11e,acc8e482,92bf7631,2989f6ca,3535c381
Tier 2 — Expert Review Required Before Disposition (Fix or Disable): These items have potentially wrong keys or ambiguous correct answers that require clinical expert sign-off before any action.
aa28f294(IMNCI classification — likely wrong key)6bdb7e62(umbilical stump bleeding — likely wrong key, Factor XIII vs Glanzmann)7f46d7db(CAH investigation — key likely should be 17-OHP)44fd6887(IE in RHD — key likely should be Streptococcus viridans)ec0c1da8(IDM complications — cardiomyopathy exclusion appears wrong)2a0c37a6(TAR syndrome — VSD as "not a feature" is contested)f49fda07(delayed puberty definition — internally inconsistent stem)a2032c5d(PALS 2010 — outdated guideline version)
Tier 3 — Fix (Concept is sound, execution needs improvement): These items are worth rewriting because the concept is high-yield and the fix is tractable.
6b23490d(CF diagnosis — clarify stem or review key)5e3bfe99(nephrotic syndrome — add age, ask for management)efc5ff89(neuromuscular maturity — add clinical context)93c8e60c(Wilms' tumor — fix stem fragment)0b736c92(vitamin K bleeding — minor polish)
Tier 4 — Keep (Acceptable quality, minor issues only):
f02585b0(developmental milestones — well-constructed, minor formatting)a45b4a61(Prader-Willi syndrome — reasonable Blooms 3 item)258b805b(acute malnutrition assessment — clinically relevant)6bacec98(Sydenham's chorea — acceptable recall item)b41e715f(spike and dome EEG — acceptable recall item)
Example Keep / Fix / Disable Calls
KEEP — f02585b0 (Growth and Development)
This item presents multiple developmental milestone data points and asks the candidate to synthesize them into an age estimate. It requires genuine knowledge integration rather than single-fact recall. The distractors (1, 3, 6, 9 months) are plausible and require the candidate to know which milestones belong to which age window. Minor formatting cleanup recommended but no substantive change needed.
KEEP — a45b4a61 (Developmental and Behavioral Pediatrics)
Prader-Willi syndrome with paternal 15q deletion is a high-yield genetics concept that appears in INICET and FMGE. The question provides a clinical vignette (short stature, muscle weakness, learning disability) and a specific chromosomal finding, requiring the candidate to apply imprinting genetics knowledge. The distractors (Angelman, Cri du chat, Rett) are all plausible chromosomal/genetic conditions. This is one of the stronger items in the candidate sample.
FIX — 5e3bfe99 (Nephrology)
The concept (nephrotic syndrome, minimal change disease) is high-yield and the clinical details are appropriate. The fix required is: (1) add the child's age (e.g., "A 4-year-old boy"), which is the single most important discriminating feature for MCD vs other causes of nephrotic syndrome; (2) change the question from "What is your diagnosis?" to "What is the most appropriate initial treatment?" or "What is the most likely pathological finding on renal biopsy?" to elevate to Blooms 3. This transforms a Blooms 2 diagnosis question into a Blooms 3 management or pathophysiology question without requiring a full rewrite.
FIX — 6b23490d (Gastroenterology)
The CF diagnostic workup question is conceptually strong and clinically relevant. The fix required is: clarify in the stem why CFTR mutation analysis is not the preferred next step (e.g., specify that the most common mutations have already been tested and are negative, or that the clinical picture warrants a functional test). Alternatively, if current CF guidelines support CFTR mutation analysis as equally valid, the key should be changed to reflect that. This requires a single expert clinical review before the fix is implemented.
DISABLE — 86ec7aee (Pediatric Critical Care)
"What is the primary treatment for a dehydrated child?" with the correct answer marked as "5% dextrose" is factually unsafe. 5% dextrose is not a rehydration fluid and its use as primary treatment for dehydration can cause hyponatremia. This item should be removed from all active question sets immediately. The concept of dehydration management is already covered at higher quality by PYQ 0d222b0f.
DISABLE — 77226350 (Growth and Development)
"Which is true about the instrument shown?" is entirely image-dependent with no image present in the text. The question is unanswerable in its current state. The concept (Prader orchidometer, testicular volume assessment, puberty staging) is worth covering, but this item cannot be salvaged without the image. Given that the image is not available, disable and replace with a text-based question on the same concept if coverage is needed.
DISABLE — 4a31fd2b (Neonatology)
"A newborn is prone to hypothermia due to which of the following reasons?" with the correct answer marked as "Presence of brown fat" is factually inverted. Brown fat is the primary heat-generating mechanism in neonates and protects against hypothermia. The correct answers should be the other three options. This item will actively mislead candidates who reason correctly about neonatal thermoregulation. Disable immediately.
DISABLE — 58d36a98 (Oncology)
"Which of the following is used in the treatment of ALL?" with the correct answer being "All of the above" (Vincristine, L-asparaginase, Prednisolone). This item uses the structurally weakest possible answer format, tests only recall of a drug list, and is Blooms 1. The concept of ALL treatment is high-yield but is better served by a question asking about the mechanism of action of L-asparaginase, the rationale for steroid use, or the management of a specific complication of ALL therapy. Disable and replace with a higher-quality item on the same concept.