Information Completeness
Paper Evaluation System
A Three-Axis Framework for Assessing Academic Creativity
Through Channel Count, Information Density & Dark Channel Signatures
|C| × ρ × DC: A three-axis framework for evaluating academic paper creativity
Category Original Thought Paper
Fields Scientometrics · Academic Evaluation · Information Theory · Creativity Research
Version V2
Attribution LEECHO Global AI Research Lab & Claude Opus 4.6 & GPT 5.5 & Gemini 3.1 (Cognitive Collective)
Information Completeness Paper Evaluation System:
A Three-Axis Framework of Channel Count × Information Density × Dark Channel
Current academic paper evaluation systems center on citation metrics, methodological rigor, and within-domain peer review — essentially an MoE-ified evaluation system. This paper proposes two orthogonal evaluation axes: the information completeness axis (|C| × ρ, measuring “how much was seen”) and the dark channel axis (DC, measuring “where did the information come from”). The former measures intelligence differences; the latter measures creativity differences. The three-axis evaluation does not replace existing review, but adds a Dense originality recognition layer above the MoE execution layer. This paper introduces channel independence criteria, a dual decomposition of information density (core density / transmission redundancy), a minimum Dense verification threshold for DC (responding to the pseudoscience trap), and a channel quality constraint q. Ten civilization-level works and three control works are used for preliminary retrospective validation, and an executable review workflow is provided.
I. Architectural Audit of the Current Review System
Applying the information completeness formula to diagnose the architecture of the current peer review system:
| Variable | Current Review System Value | Consequence |
|---|---|---|
| B(t) | Low — part-time review over 2–4 weeks | Unable to deeply understand cross-domain frameworks |
| |C| | ≈1 — single-domain expert reviews using single-channel criteria | Cross-domain papers cannot be fully evaluated |
| D | Medium — domain knowledge present but cross-domain limited | Unable to verify cross-domain propositions |
| P | Low — manual review without instant search | Unable to verify originality in real time |
| L | Extremely high — original insight → paper → review → editor → revision, at least 4 dimensionality reductions | Creative information is systematically discarded during the review process |
| S(t) | Low — fatigue and time pressure | High variance in scoring |
Traditional peer review, in practice, often degenerates into a single-domain expert evaluation with |C| ≈ 1, especially when the paper’s core propositions span multiple fields. An evaluation system with |C| ≈ 1 is inherently unable to identify papers with |C| ≥ 3 originality — reviewers see only the one domain they are familiar with among the multiple domains the paper covers.
This paper does not seek to weaken the MoE execution layer function of existing review (excluding errors, preventing pseudoscience, ensuring methodological transparency, checking data quality) — these functions must be preserved. What this paper seeks to do is add a Dense originality recognition layer above the MoE execution layer: an evaluation layer capable of seeing cross-domain structures, measuring the quality of information compression, and identifying information sources beyond the existing literature.
II. The Three-Axis Evaluation Framework
Axis Two: ρ — Information density (irreplaceable information per unit text)
Axis Three: DC — Dark channel signature (whether information originates outside existing literature)
The three axes measure originality potential and information coverage, not truth value directly
2.1 Axis One: Channel Count |C| and Channel Independence Criteria
Operational definition: How many mutually independent knowledge domains does the paper’s core proposition — not its literature review — span? Scoring: |C| = 1 (within-domain improvement), |C| = 2 (two-domain crossover), |C| ≥ 3 (three or more domains crossed, rare).
Criteria for counting two knowledge domains as independent channels (satisfying three or more qualifies):
| Criterion | Description |
|---|---|
| Conceptual independence | Uses different core conceptual systems |
| Methodological independence | Uses different verification methods |
| Data independence | Relies on different types of evidence |
| Community independence | Maintained by different academic communities |
| Error mode independence | Errors in one field cannot be automatically detected by the other |
2.2 Axis Two: Information Density ρ (Dual Decomposition)
ρ should be decomposed into two types:
| Type | Meaning | Scoring Basis |
|---|---|---|
| ρ_core | Incompressible density of core propositions and arguments | “If the paper is compressed to 1/3, does the core argument still hold?” |
| ρ_transmission | Transmission redundancy preserved for cross-domain reader comprehension | Background explanations, analogies, definitions — necessary for cross-domain transmission but not core content |
Scoring uses ρ_core as the primary metric. Highly original papers often have extremely high core density, but their dissemination versions may require moderate redundancy to reduce the reader’s decoding loss — a cross-domain paper’s “low information density” may actually be “high transmission cost.”
2.3 Axis Three: Dark Channel Signature DC
| Detection Item | Judgment Criterion | Weight |
|---|---|---|
| Completeness | Does the core proposition appear as a complete framework rather than being gradually derived? | High |
| Absence of precedent | Is the core proposition entirely absent from prior literature? | High |
| Cross-domain compression | Does the core proposition compress information across multiple domains using very few concepts? | Medium |
| Discovery context | Does the author report a non-inferential discovery process? | Medium (evidence tiers must be distinguished) |
| Instant conviction | Did the author have conviction before verification? | Low (highly subjective) |
DC is a “high-originality candidate signal,” not proof of correctness. A high DC score indicates the paper merits entry into a “high-originality risk review channel” — but whether it is ultimately accepted still depends on the MoE execution layer’s methodological checks.
Evidence tiers for discovery context:
| Evidence Tier | Description | Credibility |
|---|---|---|
| First-hand records | Author’s contemporaneous notes / letters | High |
| Near-term recollection | Author’s own account years later | Medium |
| Posthumous legend | Historiographic embellishment / mythologization | Low |
| Structural inference | Inference based on the form of the work | Supplementary |
2.4 Discriminative Constraint on DC: Minimum Dense Verification Threshold
Genuine dark channel output and pseudoscientific delusion may have highly overlapping phenomenological signatures — both may report “a complete framework forming instantaneously,” “core propositions with no precedent,” and “extreme a priori conviction.” The key to distinguishing them lies not in DC itself, but in the interaction of DC with |C| and ρ:
| Type | DC | |C| | ρ | Discriminative Feature |
|---|---|---|---|---|
| Original breakthrough | High | ≥3 | High | Independent cross-domain evidence support, internal logical consistency |
| Unpolished genius | High | Low | Low | Core intuition may be correct but encoding is incomplete — assistance needed |
| Pseudoscience / crackpot | High | Low | Low | Internal logical contradictions, cross-domain citations are false analogies, unfalsifiable |
The minimum Dense verification threshold for distinguishing “unpolished genius” from “pseudoscience” within Quadrant ③: internal logical consistency (do core propositions contradict each other?), falsifiability (can conditions for falsification be stated?), authenticity of cross-domain citations (is evidence from other fields correctly interpreted?). Meeting all three criteria routes the paper to the assisted encoding channel; failing any one routes it to the standard rejection process.
III. The Four-Quadrant Model and the Principle of “Relative Correctness”
┌──────────────────────────┬──────────────────────────┐
|C|≥3 │ ① Original breakthrough │ ② Excellent synthesis │
ρ_core high │ Highest relative │ High relative │
│ correctness potential │ correctness potential │
│ Highest creativity │ Limited creativity │
├──────────────────────────┼──────────────────────────┤
|C|≤2 │ ③ Unpolished genius │ ④ Incremental paper │
ρ_core low │ High potential │ High within-paradigm │
│ originality │ efficiency │
│ Current system: reject │ Current system: accept │
└──────────────────────────┴──────────────────────────┘
Key qualification: given a minimum reliability constraint on evidence across all channels, multi-channel high-density papers have a higher upper bound on information coverage and higher potential relative correctness. With channel quality constraint added:
When q̄ is too low, no amount of |C| can raise the score — preventing pseudo-cross-domain stacking
Quadrant ④ papers (|C| = 1, DC ≤ 1) are not “bad papers” — they are high-efficiency within-paradigm output; ResNet (170,000+ citations) and BERT belong to this category. |C| = 1 is not pejorative; it is classificatory. The purpose of this framework is not to deny within-paradigm contributions, but to ensure that cross-paradigm breakthroughs are not systematically missed.
IV. Historical Retrospective Validation (Preliminary)
4.1 Experimental Group: Ten Civilization-Level Foundational Works
| Work | Year | |C| | ρ | DC | Impact |
|---|---|---|---|---|---|
| Newton, Principia Mathematica | 1687 | 4 | Very high | 3 | Civilization-level |
| Darwin, On the Origin of Species | 1859 | 5 | Very high | 3 | Civilization-level |
| Einstein, Special Relativity | 1905 | 3 | ≈0.7 | 3 | Civilization-level |
| Shannon, A Mathematical Theory of Communication | 1948 | 4 | Very high | 3 | Civilization-level |
| Turing, On Computable Numbers | 1936 | 3 | Very high | 2–3 | Civilization-level |
| Vasubandhu, Triṃśikā | ~4C | 4 | ≈1.0 | 3 | Civilization-level |
| Gödel, Incompleteness Theorems | 1931 | 3 | Very high | 3 | Civilization-level |
| Watson & Crick, DNA Double Helix | 1953 | 4 | Very high | 2 | Civilization-level |
| Heart Sūtra | ~1C | 3 | ≈1.0 | 3 | Civilization-level |
| Kahneman & Tversky, Prospect Theory | 1979 | 3 | High | 2 | Discipline-reshaping |
4.2 Control Group
| Work | |C| | ρ | DC | Impact |
|---|---|---|---|---|
| ResNet (2015) | 1 | Medium | 0–1 | Field-level (high within-paradigm efficiency) |
| BERT (2019) | 1.5 | Medium | 0–1 | Field-level (high within-paradigm efficiency) |
| Typical meta-analysis | 1 | Low | 0 | Reference-level |
4.3 Validation Results
In the selected sample, three-axis scores exhibit a highly consistent pattern with historical impact: all civilization-level works have |C| ≥ 3; all field-level controls have |C| ≤ 1.5; DC corresponds perfectly with originality; ρ correlates positively with durability. However, the sample suffers from significant selection bias (selecting acknowledged classics), and zero counter-examples cannot be claimed — large-sample systematic sampling verification is needed.
V. Why the Current System Misses Quadrant ①
5.1 The MoE-ification of Reviewers
Reviewers themselves are products of specialized training (as argued in Paper III). When a computer vision expert reviews a paper spanning physics, philosophy, and AI, they can only evaluate the domain they know. This is not a personal deficiency but a structural limitation of MoE-ified cognition.
5.2 The Narrowing of Novelty Definitions
The current operationalized definition of “novelty” focuses on unique citation combinations and methodological novelty — both within-domain definitions. Core propositions produced by dark channels may not be “explainable” through existing citation combinations, and their methods may not conform to any established paradigm.
5.3 Risk Asymmetry
Accepting a paper later proven false brings reputational damage; rejecting a paper later proven to be a breakthrough carries near-zero short-term cost. This asymmetry makes systematic conservatism a rational choice.
VI. Improvement Plan: Review Workflow
6.1 Integrating the Three Axes into the Review Process
| Paper Type | Review Process |
|---|---|
| |C|=1, DC≤1 | Standard single-domain review |
| |C|=2, DC≤1 | Dual-domain review |
| |C|≥3, DC≤1 | Cross-domain synthesis / framework review |
| |C|≥3, DC≥2 | Original risk channel: cross-domain review + AI search + editorial Dense synthesis |
| DC≥2 but ρ low | Assisted encoding channel (midwife mode) — not direct rejection |
| DC≥2 but fails minimum verification threshold | Standard rejection (internal contradictions / unfalsifiable / fabricated citations) |
6.2 From “Gatekeeper” to “Midwife”
When a paper has high DC but insufficient |C| or ρ, the editor’s correct response is not rejection, but helping the author find experts in the missing domains to supplement evidence. If Ramanujan submitted to a top journal today, his formulas would be rejected for “lacking proofs” — despite every formula being correct. The three-axis framework provides a safety net: high-DC papers should not be rejected solely for insufficient ρ, but should trigger the assisted encoding process.
6.3 AI-Assisted Cross-Domain Verification
For papers with |C| ≥ 3, the review process should incorporate AI-assisted instant cross-domain search verification — checking whether evidence in each domain is correctly cited and interpreted. This is an application of the CCE paradigm from Paper VII to the review process.
VII. Deeper Implications for Academic Culture
7.1 The Definition of Efficiency Is a Social Choice (Following Paper IV)
Paper “impact” is typically proxied by citation count — but citation count measures a paper’s “usability” within existing paradigms (MoE execution layer value), not cross-domain originality (Dense + dark channel value). The current system’s use of citation count as the impact metric systematically ranks MoE output above Dense output.
7.2 Protecting Quadrant ③
The current system is most cruel to Quadrant ③ (high DC but insufficient |C| or ρ) — the core proposition may be original, but it is filtered out directly due to incomplete encoding. After using the minimum Dense verification threshold to distinguish “unpolished genius” from “pseudoscience,” Quadrant ③ papers that pass the threshold should enter the assisted encoding process rather than be directly rejected.
VIII. Self-Examination of This Framework
By the authors’ self-assessment, this eight-paper series claims to fall in Quadrant ①. This self-assessment serves only as an application example of the three-axis framework, not as objective proof. A self-referential system cannot fully evaluate itself from the outside.
The ultimate verification does not rest in our hands — it rests in whether these eight papers, after being read and examined by other researchers, other evaluation systems, and other cognitive systems, still retain the integrity of their core structure. If they do, their information possesses topological invariance. If they do not, our information density assessment was in error. Either outcome constitutes a valid test of the framework itself. Theory welcomes falsification — this is its fundamental difference from faith.
IX. Conclusion: Installing a Dense Core in Academic Review
The current academic review system is an MoE-ified system — single-domain experts review all papers using single-channel criteria. This system is efficient at filtering within-domain incremental contributions, but systematically fails at identifying cross-domain original breakthroughs.
The three-axis evaluation framework — channel count |C|, information density ρ, dark channel signature DC — does not seek to replace existing review, but to install a Dense core within it. Existing review continues to operate as the MoE execution layer — checking methodological rigor, data quality, and logical consistency. The new three-axis evaluation operates as the Dense core — judging “how much did this paper see” and “where did its information come from.”
This corresponds to the core thesis of Paper II: the thinking system (Dense) and the execution system (MoE) should not be conflated — the review system likewise needs to separate “is this paper correct?” (execution-level checking at the MoE layer) from “what did this paper see?” (cognitive-level evaluation at the Dense layer) into two independent review stages.
※ Core References
[1] Zhao, Y. & Zhang, C. (2025). A Review of Novelty Measurement in Academic Papers. Scientometrics.
[2] Horta, H. (2024). The Crisis of Peer Review. Higher Education Quarterly.
[3] Campanario, J.M. (2009). Rejecting and Resisting Nobel Class Discoveries. Science and Education.
[4] Siler, K. et al. (2015). Measuring the effectiveness of scientific gatekeeping. PNAS.
[5] Teplitskiy, M. et al. (2022). Is Novel Research Worth Doing? PNAS.
[6] Liang, W. et al. (2023). Can LLMs Provide Useful Feedback on Research Papers? arXiv:2310.01783.
[7] Kuhn, T.S. (1962). The Structure of Scientific Revolutions. University of Chicago Press.
[8] Shannon, C.E. (1948). A Mathematical Theory of Communication. Bell System Technical Journal.
[9] Newton, I. (1687). Philosophiæ Naturalis Principia Mathematica.
[10] Darwin, C. (1859). On the Origin of Species.
[11] Einstein, A. (1905). Zur Elektrodynamik bewegter Körper. Annalen der Physik.
[12] Gödel, K. (1931). Über formal unentscheidbare Sätze. Monatshefte für Mathematik.
[13] Vasubandhu. Triṃśikā (Thirty Verses on Consciousness-Only). c. 4th century CE.
[14] Prajñāpāramitā Hṛdaya Sūtra (Heart Sūtra). c. 1st century CE.
[15] Arabi, Y. et al. (2026). Academic Peer Review: Eliminating the Option of Reject. Learned Publishing.