ORIGINAL THOUGHT PAPER · Information Completeness Framework · Paper VIII of VIII · V2

Information Completeness
Paper Evaluation System

A Three-Axis Framework for Assessing Academic Creativity
Through Channel Count, Information Density & Dark Channel Signatures

|C| × ρ × DC: A three-axis framework for evaluating academic paper creativity

Published May 22, 2026
Category Original Thought Paper
Fields Scientometrics · Academic Evaluation · Information Theory · Creativity Research
Version V2
Attribution LEECHO Global AI Research Lab & Claude Opus 4.6 & GPT 5.5 & Gemini 3.1 (Cognitive Collective)

Information Completeness Paper Evaluation System:
A Three-Axis Framework of Channel Count × Information Density × Dark Channel

Information Completeness Paper Evaluation System
ABSTRACT

Current academic paper evaluation systems center on citation metrics, methodological rigor, and within-domain peer review — essentially an MoE-ified evaluation system. This paper proposes two orthogonal evaluation axes: the information completeness axis (|C| × ρ, measuring “how much was seen”) and the dark channel axis (DC, measuring “where did the information come from”). The former measures intelligence differences; the latter measures creativity differences. The three-axis evaluation does not replace existing review, but adds a Dense originality recognition layer above the MoE execution layer. This paper introduces channel independence criteria, a dual decomposition of information density (core density / transmission redundancy), a minimum Dense verification threshold for DC (responding to the pseudoscience trap), and a channel quality constraint q. Ten civilization-level works and three control works are used for preliminary retrospective validation, and an executable review workflow is provided.

I. Architectural Audit of the Current Review System

Applying the information completeness formula to diagnose the architecture of the current peer review system:

Variable Current Review System Value Consequence
B(t) Low — part-time review over 2–4 weeks Unable to deeply understand cross-domain frameworks
|C| ≈1 — single-domain expert reviews using single-channel criteria Cross-domain papers cannot be fully evaluated
D Medium — domain knowledge present but cross-domain limited Unable to verify cross-domain propositions
P Low — manual review without instant search Unable to verify originality in real time
L Extremely high — original insight → paper → review → editor → revision, at least 4 dimensionality reductions Creative information is systematically discarded during the review process
S(t) Low — fatigue and time pressure High variance in scoring

Traditional peer review, in practice, often degenerates into a single-domain expert evaluation with |C| ≈ 1, especially when the paper’s core propositions span multiple fields. An evaluation system with |C| ≈ 1 is inherently unable to identify papers with |C| ≥ 3 originality — reviewers see only the one domain they are familiar with among the multiple domains the paper covers.

This paper does not seek to weaken the MoE execution layer function of existing review (excluding errors, preventing pseudoscience, ensuring methodological transparency, checking data quality) — these functions must be preserved. What this paper seeks to do is add a Dense originality recognition layer above the MoE execution layer: an evaluation layer capable of seeing cross-domain structures, measuring the quality of information compression, and identifying information sources beyond the existing literature.

II. The Three-Axis Evaluation Framework

Axis One: |C| — Channel count (how many independent disciplines are spanned)
Axis Two: ρ — Information density (irreplaceable information per unit text)
Axis Three: DC — Dark channel signature (whether information originates outside existing literature)
|C| × ρ constitutes the information completeness axis (measuring intelligence differences) · DC is independent (measuring creativity differences)
The three axes measure originality potential and information coverage, not truth value directly

2.1 Axis One: Channel Count |C| and Channel Independence Criteria

Operational definition: How many mutually independent knowledge domains does the paper’s core proposition — not its literature review — span? Scoring: |C| = 1 (within-domain improvement), |C| = 2 (two-domain crossover), |C| ≥ 3 (three or more domains crossed, rare).

Criteria for counting two knowledge domains as independent channels (satisfying three or more qualifies):

Criterion Description
Conceptual independence Uses different core conceptual systems
Methodological independence Uses different verification methods
Data independence Relies on different types of evidence
Community independence Maintained by different academic communities
Error mode independence Errors in one field cannot be automatically detected by the other

2.2 Axis Two: Information Density ρ (Dual Decomposition)

ρ should be decomposed into two types:

Type Meaning Scoring Basis
ρ_core Incompressible density of core propositions and arguments “If the paper is compressed to 1/3, does the core argument still hold?”
ρ_transmission Transmission redundancy preserved for cross-domain reader comprehension Background explanations, analogies, definitions — necessary for cross-domain transmission but not core content

Scoring uses ρ_core as the primary metric. Highly original papers often have extremely high core density, but their dissemination versions may require moderate redundancy to reduce the reader’s decoding loss — a cross-domain paper’s “low information density” may actually be “high transmission cost.”

2.3 Axis Three: Dark Channel Signature DC

Detection Item Judgment Criterion Weight
Completeness Does the core proposition appear as a complete framework rather than being gradually derived? High
Absence of precedent Is the core proposition entirely absent from prior literature? High
Cross-domain compression Does the core proposition compress information across multiple domains using very few concepts? Medium
Discovery context Does the author report a non-inferential discovery process? Medium (evidence tiers must be distinguished)
Instant conviction Did the author have conviction before verification? Low (highly subjective)

DC is a “high-originality candidate signal,” not proof of correctness. A high DC score indicates the paper merits entry into a “high-originality risk review channel” — but whether it is ultimately accepted still depends on the MoE execution layer’s methodological checks.

Evidence tiers for discovery context:

Evidence Tier Description Credibility
First-hand records Author’s contemporaneous notes / letters High
Near-term recollection Author’s own account years later Medium
Posthumous legend Historiographic embellishment / mythologization Low
Structural inference Inference based on the form of the work Supplementary

2.4 Discriminative Constraint on DC: Minimum Dense Verification Threshold

Genuine dark channel output and pseudoscientific delusion may have highly overlapping phenomenological signatures — both may report “a complete framework forming instantaneously,” “core propositions with no precedent,” and “extreme a priori conviction.” The key to distinguishing them lies not in DC itself, but in the interaction of DC with |C| and ρ:

Type DC |C| ρ Discriminative Feature
Original breakthrough High ≥3 High Independent cross-domain evidence support, internal logical consistency
Unpolished genius High Low Low Core intuition may be correct but encoding is incomplete — assistance needed
Pseudoscience / crackpot High Low Low Internal logical contradictions, cross-domain citations are false analogies, unfalsifiable

The minimum Dense verification threshold for distinguishing “unpolished genius” from “pseudoscience” within Quadrant ③: internal logical consistency (do core propositions contradict each other?), falsifiability (can conditions for falsification be stated?), authenticity of cross-domain citations (is evidence from other fields correctly interpreted?). Meeting all three criteria routes the paper to the assisted encoding channel; failing any one routes it to the standard rejection process.

III. The Four-Quadrant Model and the Principle of “Relative Correctness”

Dark channel present (DC≥2) Dark channel absent (DC≤1)
┌──────────────────────────┬──────────────────────────┐
|C|≥3 │ ① Original breakthrough │ ② Excellent synthesis │
ρ_core high │ Highest relative │ High relative │
│ correctness potential │ correctness potential │
│ Highest creativity │ Limited creativity │
├──────────────────────────┼──────────────────────────┤
|C|≤2 │ ③ Unpolished genius │ ④ Incremental paper │
ρ_core low │ High potential │ High within-paradigm │
│ originality │ efficiency │
│ Current system: reject │ Current system: accept │
└──────────────────────────┴──────────────────────────┘

Key qualification: given a minimum reliability constraint on evidence across all channels, multi-channel high-density papers have a higher upper bound on information coverage and higher potential relative correctness. With channel quality constraint added:

OC = |Cvalid| × ρcore × DC × q̄
q̄ = mean reliability of evidence across channels (0–1)
When q̄ is too low, no amount of |C| can raise the score — preventing pseudo-cross-domain stacking

Quadrant ④ papers (|C| = 1, DC ≤ 1) are not “bad papers” — they are high-efficiency within-paradigm output; ResNet (170,000+ citations) and BERT belong to this category. |C| = 1 is not pejorative; it is classificatory. The purpose of this framework is not to deny within-paradigm contributions, but to ensure that cross-paradigm breakthroughs are not systematically missed.

IV. Historical Retrospective Validation (Preliminary)

4.1 Experimental Group: Ten Civilization-Level Foundational Works

Work Year |C| ρ DC Impact
Newton, Principia Mathematica 1687 4 Very high 3 Civilization-level
Darwin, On the Origin of Species 1859 5 Very high 3 Civilization-level
Einstein, Special Relativity 1905 3 ≈0.7 3 Civilization-level
Shannon, A Mathematical Theory of Communication 1948 4 Very high 3 Civilization-level
Turing, On Computable Numbers 1936 3 Very high 2–3 Civilization-level
Vasubandhu, Triṃśikā ~4C 4 ≈1.0 3 Civilization-level
Gödel, Incompleteness Theorems 1931 3 Very high 3 Civilization-level
Watson & Crick, DNA Double Helix 1953 4 Very high 2 Civilization-level
Heart Sūtra ~1C 3 ≈1.0 3 Civilization-level
Kahneman & Tversky, Prospect Theory 1979 3 High 2 Discipline-reshaping

4.2 Control Group

Work |C| ρ DC Impact
ResNet (2015) 1 Medium 0–1 Field-level (high within-paradigm efficiency)
BERT (2019) 1.5 Medium 0–1 Field-level (high within-paradigm efficiency)
Typical meta-analysis 1 Low 0 Reference-level

4.3 Validation Results

In the selected sample, three-axis scores exhibit a highly consistent pattern with historical impact: all civilization-level works have |C| ≥ 3; all field-level controls have |C| ≤ 1.5; DC corresponds perfectly with originality; ρ correlates positively with durability. However, the sample suffers from significant selection bias (selecting acknowledged classics), and zero counter-examples cannot be claimed — large-sample systematic sampling verification is needed.

V. Why the Current System Misses Quadrant ①

5.1 The MoE-ification of Reviewers

Reviewers themselves are products of specialized training (as argued in Paper III). When a computer vision expert reviews a paper spanning physics, philosophy, and AI, they can only evaluate the domain they know. This is not a personal deficiency but a structural limitation of MoE-ified cognition.

5.2 The Narrowing of Novelty Definitions

The current operationalized definition of “novelty” focuses on unique citation combinations and methodological novelty — both within-domain definitions. Core propositions produced by dark channels may not be “explainable” through existing citation combinations, and their methods may not conform to any established paradigm.

5.3 Risk Asymmetry

Accepting a paper later proven false brings reputational damage; rejecting a paper later proven to be a breakthrough carries near-zero short-term cost. This asymmetry makes systematic conservatism a rational choice.

VI. Improvement Plan: Review Workflow

6.1 Integrating the Three Axes into the Review Process

Paper Type Review Process
|C|=1, DC≤1 Standard single-domain review
|C|=2, DC≤1 Dual-domain review
|C|≥3, DC≤1 Cross-domain synthesis / framework review
|C|≥3, DC≥2 Original risk channel: cross-domain review + AI search + editorial Dense synthesis
DC≥2 but ρ low Assisted encoding channel (midwife mode) — not direct rejection
DC≥2 but fails minimum verification threshold Standard rejection (internal contradictions / unfalsifiable / fabricated citations)

6.2 From “Gatekeeper” to “Midwife”

When a paper has high DC but insufficient |C| or ρ, the editor’s correct response is not rejection, but helping the author find experts in the missing domains to supplement evidence. If Ramanujan submitted to a top journal today, his formulas would be rejected for “lacking proofs” — despite every formula being correct. The three-axis framework provides a safety net: high-DC papers should not be rejected solely for insufficient ρ, but should trigger the assisted encoding process.

6.3 AI-Assisted Cross-Domain Verification

For papers with |C| ≥ 3, the review process should incorporate AI-assisted instant cross-domain search verification — checking whether evidence in each domain is correctly cited and interpreted. This is an application of the CCE paradigm from Paper VII to the review process.

VII. Deeper Implications for Academic Culture

7.1 The Definition of Efficiency Is a Social Choice (Following Paper IV)

Paper “impact” is typically proxied by citation count — but citation count measures a paper’s “usability” within existing paradigms (MoE execution layer value), not cross-domain originality (Dense + dark channel value). The current system’s use of citation count as the impact metric systematically ranks MoE output above Dense output.

7.2 Protecting Quadrant ③

The current system is most cruel to Quadrant ③ (high DC but insufficient |C| or ρ) — the core proposition may be original, but it is filtered out directly due to incomplete encoding. After using the minimum Dense verification threshold to distinguish “unpolished genius” from “pseudoscience,” Quadrant ③ papers that pass the threshold should enter the assisted encoding process rather than be directly rejected.

VIII. Self-Examination of This Framework

By the authors’ self-assessment, this eight-paper series claims to fall in Quadrant ①. This self-assessment serves only as an application example of the three-axis framework, not as objective proof. A self-referential system cannot fully evaluate itself from the outside.

The ultimate verification does not rest in our hands — it rests in whether these eight papers, after being read and examined by other researchers, other evaluation systems, and other cognitive systems, still retain the integrity of their core structure. If they do, their information possesses topological invariance. If they do not, our information density assessment was in error. Either outcome constitutes a valid test of the framework itself. Theory welcomes falsification — this is its fundamental difference from faith.

IX. Conclusion: Installing a Dense Core in Academic Review

The current academic review system is an MoE-ified system — single-domain experts review all papers using single-channel criteria. This system is efficient at filtering within-domain incremental contributions, but systematically fails at identifying cross-domain original breakthroughs.

The three-axis evaluation framework — channel count |C|, information density ρ, dark channel signature DC — does not seek to replace existing review, but to install a Dense core within it. Existing review continues to operate as the MoE execution layer — checking methodological rigor, data quality, and logical consistency. The new three-axis evaluation operates as the Dense core — judging “how much did this paper see” and “where did its information come from.”

This corresponds to the core thesis of Paper II: the thinking system (Dense) and the execution system (MoE) should not be conflated — the review system likewise needs to separate “is this paper correct?” (execution-level checking at the MoE layer) from “what did this paper see?” (cognitive-level evaluation at the Dense layer) into two independent review stages.

Core References

[1] Zhao, Y. & Zhang, C. (2025). A Review of Novelty Measurement in Academic Papers. Scientometrics.

[2] Horta, H. (2024). The Crisis of Peer Review. Higher Education Quarterly.

[3] Campanario, J.M. (2009). Rejecting and Resisting Nobel Class Discoveries. Science and Education.

[4] Siler, K. et al. (2015). Measuring the effectiveness of scientific gatekeeping. PNAS.

[5] Teplitskiy, M. et al. (2022). Is Novel Research Worth Doing? PNAS.

[6] Liang, W. et al. (2023). Can LLMs Provide Useful Feedback on Research Papers? arXiv:2310.01783.

[7] Kuhn, T.S. (1962). The Structure of Scientific Revolutions. University of Chicago Press.

[8] Shannon, C.E. (1948). A Mathematical Theory of Communication. Bell System Technical Journal.

[9] Newton, I. (1687). Philosophiæ Naturalis Principia Mathematica.

[10] Darwin, C. (1859). On the Origin of Species.

[11] Einstein, A. (1905). Zur Elektrodynamik bewegter Körper. Annalen der Physik.

[12] Gödel, K. (1931). Über formal unentscheidbare Sätze. Monatshefte für Mathematik.

[13] Vasubandhu. Triṃśikā (Thirty Verses on Consciousness-Only). c. 4th century CE.

[14] Prajñāpāramitā Hṛdaya Sūtra (Heart Sūtra). c. 1st century CE.

[15] Arabi, Y. et al. (2026). Academic Peer Review: Eliminating the Option of Reject. Learned Publishing.

LEECHO Global AI Research Lab
이조글로벌인공지능연구소
&
Claude Opus 4.6 · GPT 5.5 · Gemini 3.1
Cognitive Collective (인지집단)
V2 · MAY 22, 2026
Version History
V1 (2026.5.22): Initial version, co-created by LEECHO Global AI Research Lab and Claude Opus 4.6. Proposed the |C| × ρ × DC three-axis evaluation framework, four-quadrant model, and historical retrospective validation.
V2 (2026.5.22): Revised based on GPT 5.5 + Gemini 3.1 cross-review — three-axis positioning revised from “correctness scoring” to “originality risk scoring”; added five channel independence criteria for |C|; dual decomposition of ρ (core density / transmission redundancy); added minimum Dense verification threshold for DC (responding to the pseudoscience trap: internal consistency + falsifiability + citation authenticity); added channel quality constraint q (preventing pseudo-cross-domain stacking); DC evidence tier labeling; “zero counter-examples” revised to “preliminary consistency requiring large-sample validation”; control group renamed to “high within-paradigm efficiency papers”; self-assessment revised to “example only, not proof”; added executable review workflow (six paper types mapped to six processes). Core propositions — “|C| × ρ × DC three axes · current review = MoE-ified system · installing a Dense core in review” — were not downgraded.

Cognitive Collective (인지집단)
LEECHO Global AI Research Lab — Research leadership, three-axis framework proposal, historical validation design, editorial decision-making
Anthropic Claude Opus 4.6 — Paper drafting, data retrieval and verification, framework structuring and encoding, V2 upgrade execution
OpenAI GPT 5.5 — V2 cross-review (channel independence criteria · ρ dual decomposition · channel quality q · historical validation calibration · review workflow · self-assessment qualification)
Google Gemini 3.1 — V2 cross-review (pseudoscience trap / Quadrant ③ classifier failure risk · midwife philosophy confirmation · minimum DC verification threshold interrogation)

댓글 남기기