Proteins and Enzymes
Analytical deep dive — question counts, mark distribution, mastery curves, command-word breakdowns, and examiner narrative analysis.
3.1.4 is one of the most consistently tested sub-sections in Paper 1, appearing every year 2017–2024 with between 4 and 16 marks per sitting and a dedicated multi-part question in most years. With compound question types assigned to dominant category, APPLICATION now accounts for 55.2% of all marks — a reflection of AQA's consistent preference for novel enzyme contexts over direct recall. The accessibility-mastery gap sits at ~29 percentage points (63.8% versus 34.5%), one of the widest in 3.1: most students reach partial credit, but full-mark answers remain rare. Mastery was lowest in 2023 (22.8%), driven by the logarithmic-scale calculation at Q08.4 and the graph-trend analysis at Q08.3; 2017 and 2022 show the highest mastery at 44.3% and 42.0% respectively. Calculation questions are a small fraction of marks (6.9%) but sit at the bottom of the mastery distribution — the arithmetic is rarely the barrier; graph-reading upstream of the calculation is.
Questions in 3.1.4 cluster into two recurring formats. The most common is the multi-part enzyme question: a novel organism or experimental scenario introduces an enzyme, and sub-questions step through mechanism, rate factors, inhibitor identity, and graph interpretation. These questions build mark by mark along a logical chain — inhibitor binds non-active site → active site shape changes → substrate no longer complementary — and students who cannot identify the mechanism at step one cannot access later parts. The second format is protein structure description: typically 2–3 marks for one or two levels of structure, or a 5-mark question demanding a complete hierarchical account from primary through quaternary. These are largely KNOWLEDGE questions but with narrow marking criteria; precision on bond types and structure names is non-negotiable.
At low tariffs (1 mark), questions test single-concept recall: which bond maintains secondary structure, define quaternary structure, identify an amino acid from its R group. At 2 marks, the expectation is typically mechanism plus consequence, or two independent pieces of factual knowledge. At 3 marks, a full sequential chain is almost always required — stating one stage and not progressing earns 1 mark regardless of depth. The 5-mark questions seen in 2023 and 2024 require comprehensive hierarchical coverage; omitting any level costs marks even if the others are fully described.
Novel contexts are a reliable feature of APPLICATION questions. Questions have introduced phosphorylation cascades, anticancer drug mechanisms, polyphenol oxidase in fruit tissue, and allosteric activators — none of which are described in the specification, but all of which test the same core enzyme principles. Students who have memorised a schema ("inhibitor binds active site") rather than understood the principle ("tertiary structure determines active site shape") consistently fail when the context inverts or extends that schema. Practical questions have appeared with increasing frequency from 2022 onwards: table design, control experiment construction, graph interpretation of enzyme activity, and measurement technique selection have each appeared in the 2022–2024 window.
| Year | Questions | Total marks | Mean accessibility | Mean mastery |
|---|---|---|---|---|
| 2017 | 7 | 14 | 65.4% | 44.3% |
| 2018 | 7 | 15 | 61.3% | 28.7% |
| 2019 | 3 | 7 | 59.3% | 31.0% |
| 2020 | 2 | 4 | — COVID | — COVID |
| 2021 | 4 | 10 | — COVID | — COVID |
| 2022 | 6 | 12 | 67.0% | 42.0% |
| 2023 | 5 | 16 | 63.0% | 22.8% |
| 2024 | 4 | 9 | 65.0% | 33.8% |
| Term | Times credited | Years | Notes |
|---|---|---|---|
| active site | 11 | 2017, 2018, 2019, 2021, 2022, 2023 | Rejected when applied to non-enzyme proteins (2018); must specify enzyme's active site in mechanism contexts |
| enzyme-substrate complex | 10 | 2017, 2018, 2019, 2021, 2022, 2023, 2024 | Abbreviation "E-S" alone not accepted unless full term written first; abbreviated form penalised at 2022 Q08.2 |
| tertiary structure | 9 | 2017, 2018, 2019, 2020, 2022, 2023 | Rejected when "3-D structure", "3-D shape", or "3°" substituted without full term; requires exact phrasing |
| complementary | 7 | 2017, 2019, 2021, 2022, 2023 | Rote use without correct mechanism context not credited; must be applied to active site–substrate relationship |
| amino acids | 5 | 2017, 2022, 2023 | Credited as product of peptide bond hydrolysis and as component of polypeptide chains |
| activation energy | 4 | 2017, 2018, 2021 | Must state enzyme lowers activation energy; "provides energy" rejected; "reduces Ea" accepted |
| peptide bonds | 4 | 2017, 2020, 2023 | Credited for primary structure and hydrolysis; explicitly rejected as the bond disrupted during denaturation |
| hydrogen bonds | 3 | 2017, 2018, 2023 | Credited for secondary structure (alpha helix / beta pleated sheet) and as bond disrupted during denaturation |
| Term | Times credited | Years | Notes |
|---|---|---|---|
| ionic bonds | 3 | 2017, 2018 | Credited specifically in pH-denaturation and R-group charge contexts; not accepted as a secondary structure bond |
| induced fit | 2 | 2018, 2021 | Credited as a descriptor of active site conformational adjustment; appears in Credit_Terms as a key concept though not always stated verbatim in mark scheme text |
| allosteric site | 2 | 2019, 2023 | Not a specification term; credited when used correctly to describe non-active site binding; AQA explicitly notes students used it correctly in 2019 |
| condensation | 2 | 2020, 2023 | Credited for peptide bond formation only; not credited in general structural description contexts |
| kinetic energy | 2 | 2021, 2024 | Credited only when explicitly linked to collision frequency between enzyme and substrate molecules; vague "more kinetic energy" without consequence is not credited |
| Term | Times rejected | Years | Why rejected |
|---|---|---|---|
| provides energy (unqualified) | 3 | 2017, 2018 | Enzyme lowers activation energy — it does not supply energy to the reaction; unqualified "provides energy" implies the wrong mechanism |
| peptide bonds broken (denaturation) | 2 | 2017, 2018 | Denaturation disrupts hydrogen bonds and ionic bonds maintaining tertiary structure; the peptide backbone is not broken |
| disulphide bonds broken (denaturation) | 2 | 2017, 2018 | Disulphide bridges are covalent and relevant to tertiary structure in specific contexts, but are not the primary bond disrupted by heat or pH change in general denaturation questions |
| 3-D structure / 3-D shape | 2 | 2018, 2022 | Insufficient precision; "tertiary structure" is the required term; "shape" alone also penalised in 2022 |
| breaking bonds (enzyme mechanism) | 1 | 2018 | The substrate's bonds are bent/stressed to lower the activation energy; the enzyme does not break substrate bonds — the substrate breaks its own bonds more easily due to bond strain |
| same shape (competitive inhibitor) | 1 | 2019 | Must say "similar shape" or "fits the active site"; "same shape" implies chemical identity, which is rejected |
| active site (non-enzyme proteins) | 1 | 2018 | Non-enzyme proteins such as osteocalcin have binding sites; "active site" is enzyme-specific and is rejected when applied to structural or transport proteins |
- Denaturation is described as breaking peptide bonds or disulphide bonds rather than disrupting hydrogen and ionic bonds that maintain tertiary structure — one of the most repeatedly penalised errors across 2017–2024
- All proteins are treated as enzymes: when presented with a hormone, structural protein, or transport protein, students write about "active sites" and "substrates" — osteocalcin, antibodies, and similar proteins are not enzymes and have binding sites, not active sites
- A modifier that increases reaction rate is identified as an inhibitor because the question introduces a molecule binding to the enzyme — students apply the inhibitor schema without checking graph direction (2018 Q04.4)
- Quaternary structure is assumed to require exactly four polypeptides because of the prefix; the mark scheme requires only "more than one polypeptide" (2022 Q05.1, 2023 Q10.1)
- Two enzymes with the same catalytic activity are explained via degenerate code or mutations in the DNA rather than via identical or similar tertiary structures at the active site (2022 Q05.2)
- "3-D structure" or "3-D shape" written instead of "tertiary structure" — close enough to suggest understanding but consistently rejected (2018, 2022)
- "Breaking bonds" used for the enzyme mechanism instead of "bending bonds" — the distinction matters because the substrate is stressed, not cleaved by the enzyme (2018 Q04.1)
- "Provides energy" written unqualified in enzyme mechanism contexts — must specify that the enzyme lowers the activation energy, not supplies it (2017 P3, 2018 Q04.1)
- "Loss of specificity" rather than "substrate no longer complementary to the active site" — insufficiently precise for the final mark point on non-competitive inhibitor questions
- "E-S" as shorthand for enzyme-substrate complex without writing out the full term — not credited in isolation; the abbreviation must be defined before use
- Rate-of-reaction graphs read as showing change over time rather than as a function of substrate or inhibitor concentration — recurs on non-competitive inhibitor graph questions where the x-axis is lipid/substrate concentration (2019 Q01.2)
- Logarithmic concentration scales misread, producing values one order of magnitude off — affects both trend description and calculation (2023 Q08.3, Q08.4)
- Command word ignored: "explain" questions answered with descriptions of what a graph shows, earning zero; the graph description is not the answer, the mechanism is
- Multi-part chains stopped one step early — competitive inhibition identified but not linked to nucleotide shortage and consequent DNA replication failure (2019 P3 Q06.3)
- Experimental time-to-result misread: longer time to browning equated to a faster reaction rather than a slower one (2024 Q07.2), producing inverted conclusions about which enzyme variety is more active
The accessibility-mastery gap of ~29 percentage points is the defining difficulty signature of 3.1.4. Most questions are reachable — 63.8% of students can score at least one mark — but full marks are achieved by approximately one in three students. This is not a topic where students know nothing; it is a topic where they know approximately enough and lose marks on precision. The gap is largest on APPLICATION questions requiring novel context interpretation and smallest on single-mark KNOWLEDGE recall, which confirms that precision under novel conditions is the driver rather than content coverage.
Within the dataset, questions with mean mastery below 15% cluster around three conditions: they require reading from a non-linear scale (logarithmic graphs, 2023 Q08.4); they involve a compound application where two or three separate pieces of knowledge must be synthesised with data from a table or figure before any mark can be earned (2018 Q05.5, 2022 Q05.3, 2022 Q05.4); or they target a well-known misconception head-on in a context where the misconception produces a confidently wrong answer rather than a blank (inhibitor versus activator in 2018 Q04.4, active site versus binding site for osteocalcin in 2018 P3 Q01.2). Questions exceeding 50% mastery are almost entirely single-mark recall or highly scaffolded identification tasks.
Calculation questions sit at the bottom of the mastery distribution: 2023 Q08.4 (7%), 2018 Q04.3 (15%), 2018 Q04.1 (18%). The 2018 Q04.2 calculation is the outlier at 72% mastery — it was a straightforward standard-form division without a graph-reading component. The pattern is consistent: in 3.1.4, calculations are difficult primarily because of the upstream graph-reading step, not the arithmetic itself. Correctly identifying which values to extract, and from which part of the graph, is the harder task.