| 404media.co |
AI Labs Should Open Source Data Protection Technologies Long Posts |
404 Media |
https://404media.co/openai-furious-deepseek-might-have-stolen-all-the-data-openai-stole-from-us |
| a |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
hazards |
https://a/ |
| aaai.org |
The Paradox of Reuse, Language Models Edition Long Posts |
paper |
https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/viewFile/15623/14799 |
| academic.oup.com |
Tipping Points for Content Ecosystems Long Posts |
seemingly begun |
https://academic.oup.com/pnasnexus/article/3/9/pgae400/7754871 |
| academic.oup.com |
Tipping Points for Content Ecosystems Long Posts |
see work on the topic here |
https://academic.oup.com/pnasnexus/article/3/9/pgae400/7754871 |
| acaworkshop.github.io |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Algorithmic Collective Action |
https://acaworkshop.github.io/ |
| aclanthology.org |
Attestation across the AI Supply Chain Long Posts |
Deng et al. on training/evaluation overlap via benchmark contamination |
https://aclanthology.org/2024.naacl-long.482 |
| aclanthology.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
paper |
https://aclanthology.org/2021.acl-short.24.pdf |
| aclanthology.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
benchmark contamination |
https://aclanthology.org/2024.naacl-long.482 |
| adityakaran.me |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
Aditya Karan |
https://adityakaran.me/ |
| aeaweb.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
insight |
https://aeaweb.org/articles?id=10.1257%2Fpandp.20181003 |
| aeaweb.org |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
Economics of Maps |
https://aeaweb.org/articles?id=10.1257%2Fjep.34.1.196 |
| aeaweb.org |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
data as labor |
https://aeaweb.org/articles?id=10.1257%2Fpandp.20181003 |
| aeaweb.org |
Attestation across the AI Supply Chain Long Posts |
Jones and Tonetti on the non-rivalry of data |
https://aeaweb.org/articles?id=10.1257%2Faer.20191330 |
| aeaweb.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
work |
https://aeaweb.org/articles?id=10.1257%2Fpandp.20251045 |
| aeaweb.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
data as labor |
https://aeaweb.org/articles?id=10.1257%2Fpandp.20181003 |
| agi.safe.ai |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Humanity’s Last Exam |
https://agi.safe.ai/ |
| ai.facebook.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
LLaMa |
https://ai.facebook.com/blog/large-language-model-llama-meta-ai |
| ai.google.dev |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
Google Gemini |
https://ai.google.dev/gemini-api/terms |
| ai.se |
Public AI, Data Appraisal, and Data Debates Long Posts |
AI Sweden |
https://ai.se/en |
| ainowinstitute.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://ainowinstitute.org/2025-landscape |
https://ainowinstitute.org/2025-landscape |
| ainowinstitute.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
report |
https://ainowinstitute.org/publications/research/ai-now-2025-landscape-report |
| aisi.gov.uk |
Attestation across the AI Supply Chain Long Posts |
AI Security Institute |
https://aisi.gov.uk/ |
| aisi.gov.uk |
Public AI, Data Appraisal, and Data Debates Long Posts |
UK AISI |
https://aisi.gov.uk/ |
| aisingapore.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
AI Singapore |
https://aisingapore.org/ |
| aitopics.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
evaluation crisis |
https://aitopics.org/doc/news%3A87AE91F4 |
| allenai.org |
Attestation across the AI Supply Chain Long Posts |
AI2 |
https://allenai.org/olmo |
| allenai.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
OLMo2 |
https://allenai.org/olmo |
| allenai.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
OlmoTrace |
https://allenai.org/blog/olmotrace |
| allenai.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
Dolma |
https://allenai.org/dolma |
| annualreviews.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
social dilemmas |
https://annualreviews.org/doi/abs/10.1146/annurev.soc.24.1.183 |
| anthropic.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
work |
https://anthropic.com/index/influence-functions |
| anthropic.com |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
techniques |
https://anthropic.com/research/influence-functions |
| anthropic.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
Anthropic |
https://anthropic.com/legal/consumer-terms |
| anthropic.com |
The Coding Agent Data Deal Long Posts |
Privacy Policy |
https://anthropic.com/legal/privacy |
| anthropic.com |
The Coding Agent Data Deal Long Posts |
Anthropic |
https://anthropic.com/news/how-people-use-claude-for-support-advice-and-companionship |
| anthropic.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
work |
https://anthropic.com/index/influence-functions |
| anthropic.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
exactly that. |
https://anthropic.com/index/influence-functions |
| apnews.com |
Attestation across the AI Supply Chain Long Posts |
Google reportedly paying Reddit roughly $60M/year for access to Reddit data |
https://apnews.com/article/a7f131c7cb4225307134ef21d3c6a708 |
| apnews.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
demands |
https://apnews.com/article/wga-writers-strike-demands-d403f5b4666f20e2ce3e379bcaef5f2a |
| apolloresearch.ai |
Attestation across the AI Supply Chain Long Posts |
Apollo Research |
https://apolloresearch.ai/ |
| apolloresearch.ai |
Attestation across the AI Supply Chain Long Posts |
Apollo |
https://apolloresearch.ai/ |
| arch.library.northwestern.edu |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[2022 Dissertation |
https://arch.library.northwestern.edu/concern/generic_works/jq085k38d?locale=en |
| arch.library.northwestern.edu |
Plural AI Data Alignment Long Posts |
dissertation |
https://arch.library.northwestern.edu/concern/generic_works/jq085k38d?locale=en |
| arstechnica.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
protest |
https://arstechnica.com/information-technology/2022/12/artstation-artists-stage-mass-protest-against-ai-generated-artwork |
| arstechnica.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
protest |
https://arstechnica.com/information-technology/2022/12/artstation-artists-stage-mass-protest-against-ai-generated-artwork |
| arstechnica.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
otherwise |
https://arstechnica.com/information-technology/2022/12/artstation-artists-stage-mass-protest-against-ai-generated-artwork |
| arstechnica.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
initiative |
https://arstechnica.com/information-technology/2023/04/reddit-will-start-charging-ai-models-learning-from-its-extremely-human-archives |
| arxiv.org |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
[link |
https://arxiv.org/abs/2110.14049 |
| arxiv.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
had |
https://arxiv.org/abs/2305.13238 |
| arxiv.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
InstructGPT |
https://arxiv.org/abs/2203.02155 |
| arxiv.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation |
https://arxiv.org/abs/2602.16763. |
| arxiv.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
How Well Does Agent Development Reflect Real-World Work |
https://arxiv.org/abs/2603.01203 |
| arxiv.org |
AI Labs Should Open Source Data Protection Technologies Long Posts |
watermarking |
https://arxiv.org/abs/2301.10226 |
| arxiv.org |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
prior theoretical work |
https://arxiv.org/abs/2410.12633 |
| arxiv.org |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
paper |
https://arxiv.org/abs/2505.00195 |
| arxiv.org |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
paper |
https://arxiv.org/abs/2505.00195 |
| arxiv.org |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
here |
https://arxiv.org/abs/2505.00195 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
leverage |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[arXiv |
https://arxiv.org/abs/2404.12590 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[arXiv |
https://arxiv.org/abs/2501.11457v1 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[Meng et al. |
https://arxiv.org/abs/2502.12658v1 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Collective Bargaining for Information |
https://arxiv.org/abs/2506.10272 |
| arxiv.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
see this ICML workshop paper |
https://arxiv.org/abs/2507.09296 |
| arxiv.org |
Attestation across the AI Supply Chain Long Posts |
datasheets |
https://arxiv.org/abs/1803.09010 |
| arxiv.org |
Attestation across the AI Supply Chain Long Posts |
data counterfactual estimation method |
https://arxiv.org/abs/1904.02868 |
| arxiv.org |
Attestation across the AI Supply Chain Long Posts |
"everything in the whole wide world benchmarking" |
https://arxiv.org/abs/2111.15366 |
| arxiv.org |
Attestation across the AI Supply Chain Long Posts |
Liang et al. on holistic evaluation and coverage gaps |
https://arxiv.org/abs/2211.09110 |
| arxiv.org |
Attestation across the AI Supply Chain Long Posts |
work |
https://arxiv.org/abs/2504.06219 |
| arxiv.org |
Bing Rewards for the AI Age Long Posts |
data dividend |
https://arxiv.org/abs/1912.00757 |
| arxiv.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
troubling trends |
https://arxiv.org/abs/1807.03341 |
| arxiv.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
InstructGPT paper |
https://arxiv.org/abs/2203.02155 |
| arxiv.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
paper |
https://arxiv.org/abs/2203.02155 |
| arxiv.org |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
necessarily |
https://arxiv.org/html/2512.23032 |
| arxiv.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
GPT-3 |
https://arxiv.org/abs/2005.14165 |
| arxiv.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data leverage |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data leverage |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
leverage |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
attentional agency |
https://arxiv.org/abs/2405.14614 |
| arxiv.org |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
paper |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
paper |
https://arxiv.org/pdf/2305.13238 |
| arxiv.org |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
related |
https://arxiv.org/abs/2107.10939 |
| arxiv.org |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
preprint |
https://arxiv.org/abs/2405.14614 |
| arxiv.org |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
paper |
https://arxiv.org/abs/2405.14614 |
| arxiv.org |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
ideas |
https://arxiv.org/abs/2406.18682 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
arxiv |
https://arxiv.org/abs/2012.09995 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
FAccT 2025: [arxiv |
https://arxiv.org/abs/2405.14614 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
studying |
https://arxiv.org/abs/2409.19104 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
AIES 2025: [arxiv |
https://arxiv.org/abs/2409.19104 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
AIES 2025: [arxiv |
https://arxiv.org/abs/2409.19104 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
arxiv |
https://arxiv.org/abs/2505.00195 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
knowledge gaps |
https://arxiv.org/abs/2505.24195 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
2025 NeurIPS position paper: [arxiv |
https://arxiv.org/abs/2506.10272 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
CodeML @ ICML paper: [arxiv |
https://arxiv.org/abs/2507.09296 |
| arxiv.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
auditing |
https://arxiv.org/abs/2508.10010 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
the |
https://arxiv.org/abs/2005.04176 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
effort |
https://arxiv.org/abs/2403.13073 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
already |
https://arxiv.org/abs/2502.06559v1 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
favour |
https://arxiv.org/abs/2504.02234 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
been |
https://arxiv.org/abs/2504.20879 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Hayes et al. |
https://arxiv.org/abs/2505.18773 |
| arxiv.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
work |
https://arxiv.org/abs/2506.15553 |
| arxiv.org |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
training data influence estimation |
https://arxiv.org/abs/2212.04612 |
| arxiv.org |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
data attribution |
https://arxiv.org/abs/2303.14186 |
| arxiv.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
technical report |
https://arxiv.org/abs/2501.12948 |
| arxiv.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
paper |
https://arxiv.org/abs/2501.16946 |
| arxiv.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://arxiv.org/abs/2501.16946 |
https://arxiv.org/abs/2501.16946 |
| arxiv.org |
Plural AI Data Alignment Long Posts |
arguments |
https://arxiv.org/abs/2109.13916 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
Shapley |
https://arxiv.org/abs/2110.14049 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
ablation |
https://arxiv.org/abs/2402.00159 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
data |
https://arxiv.org/abs/2402.07827 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
influence |
https://arxiv.org/abs/2405.13954 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
consent |
https://arxiv.org/abs/2407.14933 |
| arxiv.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
research |
https://arxiv.org/abs/2410.15661 |
| arxiv.org |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
sources |
https://arxiv.org/abs/2010.12282 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
construct validity |
https://arxiv.org/abs/2111.15366 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
early |
https://arxiv.org/abs/2203.15827 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
some |
https://arxiv.org/abs/2305.10429 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Metadata Conditioning Accelerates Language Model Pre-training |
https://arxiv.org/abs/2501.01956 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
providing |
https://arxiv.org/html/2402.11537v3 |
| arxiv.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
work |
https://arxiv.org/html/2406.11794v1 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
work |
https://arxiv.org/abs/2004.04906 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
RAG |
https://arxiv.org/abs/2005.11401 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
versions |
https://arxiv.org/abs/2005.14165 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
tool |
https://arxiv.org/abs/2112.09332 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
calling |
https://arxiv.org/abs/2210.03629 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
RAG-bench |
https://arxiv.org/abs/2306.03091 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
SWE-bench |
https://arxiv.org/abs/2310.06770 |
| arxiv.org |
The Coding Agent Data Deal Long Posts |
SWE-agent |
https://arxiv.org/abs/2405.15793 |
| arxiv.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
arXiv |
https://arxiv.org/abs/2305.00118 |
| arxiv.org |
Tipping Points for Content Ecosystems Long Posts |
preference data |
https://arxiv.org/abs/2305.18290 |
| arxiv.org |
Tipping Points for Content Ecosystems Long Posts |
Phi 1.5 |
https://arxiv.org/abs/2309.05463 |
| arxiv.org |
Tipping Points for Content Ecosystems Long Posts |
dispossession |
https://arxiv.org/abs/2403.13073 |
| arxiv.org |
Tipping Points for Content Ecosystems Long Posts |
likewise |
https://arxiv.org/abs/2403.13812 |
| arxiv.org |
Tipping Points for Content Ecosystems Long Posts |
begun |
https://arxiv.org/abs/2410.08044 |
| arxiv.org |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
research |
https://arxiv.org/abs/2304.03442 |
| arxiv.org |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
criticized |
https://arxiv.org/abs/2401.08572 |
| arxiv.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
research |
https://arxiv.org/abs/2305.00118 |
| arxiv.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
concept erasure |
https://arxiv.org/abs/2306.03819 |
| arxiv.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
SILO |
https://arxiv.org/abs/2308.04430 |
| arxiv.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
Liu et al |
https://arxiv.org/abs/2308.05374 |
| assets.mofoprod.net |
Public AI, Data Appraisal, and Data Debates Long Posts |
Mozilla |
https://assets.mofoprod.net/network/documents/Public_AI_Mozilla.pdf |
| atproto.wiki |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
WIP |
https://atproto.wiki/en/wiki/reference/core-architecture/pds |
| attest.org |
Attestation across the AI Supply Chain Long Posts |
Attest |
https://attest.org/ |
| authorsguild.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
it |
https://authorsguild.org/news/meta-libgen-ai-training-book-heist-what-authors-need-to-know |
| averi.org |
Attestation across the AI Supply Chain Long Posts |
Frontier AI Auditing |
https://averi.org/ourwork/frontier-ai-auditing |
| averi.org |
Attestation across the AI Supply Chain Long Posts |
ecosystem |
https://averi.org/ourwork/frontier-ai-auditing |
| averi.org |
Attestation across the AI Supply Chain Long Posts |
auditing |
https://averi.org/ourwork/frontier-ai-auditing |
| averi.org |
Attestation across the AI Supply Chain Long Posts |
Brundage et al. on frontier AI auditing |
https://averi.org/ourwork/frontier-ai-auditing |
| averi.org |
Attestation across the AI Supply Chain Long Posts |
Frontier AI auditing |
https://averi.org/ourwork/frontier-ai-auditing |
| aws.amazon.com |
Attestation across the AI Supply Chain Long Posts |
12 months of "Anonymized, non-aggregated granular consumer-level data across all asset classes" from Equifax for $175k |
https://aws.amazon.com/marketplace/pp/prodview-vgmxklm42lhmq |
| axios.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Axios |
https://axios.com/2025/05/28/ai-jobs-white-collar-unemployment-anthropic |
| axios.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://www.axios.com/2025/05/28/ai-jobs-white-collar-unemployment-anthropic |
https://axios.com/2025/05/28/ai-jobs-white-collar-unemployment-anthropic |
| azure.microsoft.com |
Bing Rewards for the AI Age Long Posts |
credits |
https://azure.microsoft.com/en-us/pricing/member-offers/credit-for-visual-studio-subscribers |
| bcassessment.ca |
Public AI, Data Appraisal, and Data Debates Long Posts |
BC Assessment |
https://bcassessment.ca/ |
| berggruen.org |
Bing Rewards for the AI Age Long Posts |
proposal |
https://berggruen.org/ideas/articles/a-data-dividend-that-works-steps-toward-building-an-equitable-data-economy |
| beta.openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
OpenAI's Model Index |
https://beta.openai.com/docs/model-index-for-researchers |
| betterconflictbulletin.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Jonathan Stray |
https://betterconflictbulletin.org/p/openai-just-agreed-to-power-autonomous |
| bigcode-project.org |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
BigCode |
https://bigcode-project.org/docs/about/the-stack |
| bit.ly |
Public AI, Data Appraisal, and Data Debates Long Posts |
PAINT |
https://bit.ly/publicAIpaper |
| blog.datadividendproject.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
https://blog.datadividendproject.com/data-strikes/ |
https://blog.datadividendproject.com/data-strikes |
| blog.samaltman.com |
Tipping Points for Content Ecosystems Long Posts |
three observations |
https://blog.samaltman.com/three-observations |
| blogs.gwu.edu |
Public AI, Data Appraisal, and Data Debates Long Posts |
legality |
https://blogs.gwu.edu/law-eti/ai-litigation-database |
| bloodinthemachine.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
essay |
https://bloodinthemachine.com/p/the-ai-jobs-apocalypse-is-for-the |
| bloodinthemachine.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://www.bloodinthemachine.com/p/the-ai-jobs-apocalypse-is-for-the |
https://bloodinthemachine.com/p/the-ai-jobs-apocalypse-is-for-the |
| bloomberg.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
Bloomberg |
https://bloomberg.com/news/articles/2025-01-29/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-data |
| brave.com |
Attestation across the AI Supply Chain Long Posts |
101M monthly active users as of September 30, 2025 |
https://brave.com/blog/100m-mau |
| brenthecht.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
conscious data contribution |
https://brenthecht.com/publications/CollectiveIntelligence2020_ConsciousDataContribution.pdf |
| brenthecht.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
funnel your data labor |
https://brenthecht.com/publications/CollectiveIntelligence2020_ConsciousDataContribution.pdf |
| brenthecht.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
restaurants reviews |
https://brenthecht.com/publications/cscw2020_restaurantratings.pdf |
| brenthecht.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
restaurant review platforms |
https://brenthecht.com/publications/cscw2020_restaurantratings.pdf |
| brenthecht.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
study |
https://brenthecht.com/publications/icwsm17_googlewikipedia.pdf |
| brenthecht.com |
Tipping Points for Content Ecosystems Long Posts |
classic |
https://brenthecht.com/publications/icwsm17_googlewikipedia.pdf |
| brookings.edu |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
“data labor” |
https://brookings.edu/blog/techtank/2018/02/21/should-we-treat-data-as-labor-lets-open-up-the-discussion |
| bsc.es |
Public AI, Data Appraisal, and Data Debates Long Posts |
Barcelona Supercomputing Center |
https://bsc.es/ |
| bsky.app |
AI Labs Should Open Source Data Protection Technologies Long Posts |
Bluesky |
https://bsky.app/profile/404media.co/post/3lgvcq53j322a |
| businessinsider.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
coverage |
https://businessinsider.com/meta-ai-llama-models-training-data-ablation-2025-4 |
| businessinsider.com |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
endorsed |
https://businessinsider.com/katy-perry-anthropic-department-of-defense-spat-claude-subscription-2026-2 |
| businessinsider.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
post-training |
https://businessinsider.com/anthropic-surge-ai-leaked-list-sites-2025-7 |
| businessinsider.com |
Bing Rewards for the AI Age Long Posts |
way |
https://businessinsider.com/microsoft-limits-bing-chat-exchanges-and-conversation-lengths-2023-2 |
| businessinsider.com |
Bing Rewards for the AI Age Long Posts |
issues |
https://businessinsider.com/microsoft-limits-bing-chat-exchanges-and-conversation-lengths-2023-2 |
| businessinsider.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
Business Insider |
https://businessinsider.com/openais-latest-chatgpt-version-hides-training-on-copyrighted-material-2023-8 |
| c2pa.org |
Attestation across the AI Supply Chain Long Posts |
C2PA |
https://c2pa.org/ |
| cacm.acm.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Datasheets for Datasets |
https://cacm.acm.org/research/datasheets-for-datasets |
| carper.ai |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
model |
https://carper.ai/diff-models-a-new-way-to-edit-code |
| casmi.northwestern.edu |
Plural AI Data Alignment Long Posts |
efforts |
https://casmi.northwestern.edu/news/articles/2023/defining-safety-in-artificial-intelligence.html |
| cb4i.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
CBI |
https://cb4i.org/ |
| cbc.ca |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
implementing a news ban |
https://cbc.ca/news/business/meta-block-news-1.7174031 |
| cdn.governance.ai |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Addressing the U.S. Labor Market Impacts of Advanced AI |
https://cdn.governance.ai/RFI_Labor_Impacts_March-2025_Sam_Manning.pdf |
| cdn.governance.ai |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://cdn.governance.ai/RFI_Labor_Impacts_March-2025_Sam_Manning.pdf |
https://cdn.governance.ai/RFI_Labor_Impacts_March-2025_Sam_Manning.pdf |
| cdn.openai.com |
"People First" Policy Ideas that Complement Each Other (through better data flow) Long Posts |
here |
https://cdn.openai.com/pdf/561e7512-253e-424b-9734-ef4098440601/Industrial%20Policy%20for%20the%20Intelligence%20Age.pdf |
| cdn.openai.com |
The Coding Agent Data Deal Long Posts |
OpenAI |
https://cdn.openai.com/pdf/a253471f-8260-40c6-a2cc-aa93fe9f142e/economic-research-chatgpt-usage-paper.pdf |
| chat.openanonymity.ai |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
oa-chat |
https://chat.openanonymity.ai/ |
| chatgpt.com |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
share link |
https://chatgpt.com/share/675a540a-8208-800f-9a2f-c448eea49b71 |
| chatgptiseatingtheworld.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
documents |
https://chatgptiseatingtheworld.com/2025/01/15/kadrey-refiles-motion-to-file-third-amended-consolidated-complaint-with-partially-unredacted-exhibits-per-judge-chhabrias-order |
| cifar.ca |
Public AI, Data Appraisal, and Data Debates Long Posts |
Canadian AI Institute |
https://cifar.ca/ai |
| cip.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
digital commons |
https://cip.org/research/generative-ai-digital-commons |
| cip.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
here |
https://cip.org/research/generative-ai-digital-commons |
| cip.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
coverage |
https://cip.org/research/generative-ai-digital-commons |
| citizensandtech.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
https://citizensandtech.org/2020/08/collective-refusal/ |
https://citizensandtech.org/2020/08/collective-refusal |
| clarip.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
Dashboard Act |
https://clarip.com/blog/senate-dashboard-act |
| claude.ai |
The Coding Agent Data Deal Long Posts |
Data Privacy Controls page |
https://claude.ai/settings/data-privacy-controls |
| cloudflare.com |
Attestation across the AI Supply Chain Long Posts |
Cloudflare's public support of anti-scraping |
https://cloudflare.com/en-ca/press-releases/2025/cloudflare-just-changed-how-ai-crawlers-scrape-the-internet-at-large |
| cloudflare.com |
The Coding Agent Data Deal Long Posts |
deals |
https://cloudflare.com/en-ca/press/press-releases/2025/cloudflare-just-changed-how-ai-crawlers-scrape-the-internet-at-large |
| cnbc.com |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
surging |
https://cnbc.com/2026/02/28/anthropics-claude-apple-apps.html |
| cnbc.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
AI race |
https://cnbc.com/2025/01/23/scale-ai-ceo-says-china-has-quickly-caught-the-us-with-deepseek.html |
| code.claude.com |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
web |
https://code.claude.com/docs/en/claude-code-on-the-web |
| code.claude.com |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
Claude Code |
https://code.claude.com/docs/en/overview |
| code.claude.com |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
sandboxing |
https://code.claude.com/docs/en/sandboxing |
| code.claude.com |
The Coding Agent Data Deal Long Posts |
docs page |
https://code.claude.com/docs/en/data-usage |
| commoncrawl.github.io |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
list of common domains |
https://commoncrawl.github.io/cc-crawl-statistics/plots/domains |
| commoncrawl.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
CommonCrawl Data |
https://commoncrawl.org/the-data |
| commoncrawl.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
CommonCrawl |
https://commoncrawl.org/the-data |
| commoncrawl.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
Common Crawl |
https://commoncrawl.org/the-data |
| commoncrawl.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
Common Crawl |
https://commoncrawl.org/ |
| commoncrawl.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Common Crawl |
https://commoncrawl.org/ |
| commons.wikimedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Wikimedia Commons |
https://commons.wikimedia.org/wiki/File:Jheronimus_Bosch_011.jpg |
| commons.wikimedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Wikimedia Commons |
https://commons.wikimedia.org/wiki/File:Shichiri_Ferry_Boat.jpg |
| commons.wikimedia.org |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
Wikimedia Commons |
https://commons.wikimedia.org/wiki/File:BL_Royal_Vincent_of_Beauvais.jpg |
| commons.wikimedia.org |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
Wikimedia Commons |
https://commons.wikimedia.org/wiki/File:Hartford_Steam_Boiler_Inspection_and_Insurance_Co._ad.png |
| commons.wikimedia.org |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
Wikimedia Commons. |
https://commons.wikimedia.org/wiki/File:Bonifacio_falaises_Grain_de_Sable.jpg |
| commons.wikimedia.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Wikimedia Commons. |
https://commons.wikimedia.org/wiki/File:Operation_of_trains_and_station_work_and_telegraphy_(1914 |
| commons.wikimedia.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Wikimedia Commons. |
https://commons.wikimedia.org/wiki/File:Sedan_Plowshare_Crater.jpg |
| commons.wikimedia.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
[wikimedia commons |
https://commons.wikimedia.org/wiki/File:Book_of_Royal_Gemstones_WDL2839.jpg |
| commons.wikimedia.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
[wikimedia commons |
https://commons.wikimedia.org/wiki/File:Isaac_Lea_collection_of_precious_stones._Miss_Margaret_W._Moodey_in_charge_LCCN2016892128.jpg |
| commons.wikimedia.org |
The Coding Agent Data Deal Long Posts |
[Wikimedia Commons |
https://commons.wikimedia.org/wiki/Category:Patterns |
| commons.wikimedia.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
Rama |
https://commons.wikimedia.org/wiki/User:Rama |
| computerworld.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
In an AI-perfect world, it’s time to prove you’re human |
https://computerworld.com/article/4114605/in-an-ai-perfect-world-its-time-to-prove-youre-human.html |
| confer.to |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
confer.to |
https://confer.to/ |
| consensus.app |
The Paradox of Reuse, Language Models Edition Long Posts |
Consensus |
https://consensus.app/search |
| creativecommons.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
CC BY-SA 2.0 |
https://creativecommons.org/licenses/by-sa/2.0 |
| creativecommons.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Creative Commons Licence |
https://creativecommons.org/licenses/by-sa/2.0 |
| creativecommons.org |
Tipping Points for Content Ecosystems Long Posts |
CC BY 2.0 |
https://creativecommons.org/licenses/by/2.0/deed.en |
| crfm.stanford.edu |
Attestation across the AI Supply Chain Long Posts |
Foundation Model Transparency Index |
https://crfm.stanford.edu/fmti/December-2025/index.html |
| crowddynamicslab.github.io |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
blog |
https://crowddynamicslab.github.io/collective/action,/machine/learning/2025/06/19/two-collectives |
| crtc.gc.ca |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Canada |
https://crtc.gc.ca/eng/industr/info.htm |
| cs.cornell.edu |
The Coding Agent Data Deal Long Posts |
clickthrough data |
https://cs.cornell.edu/~tj/publications/joachims_etal_05a.pdf |
| cs.stanford.edu |
The Coding Agent Data Deal Long Posts |
satisfaction |
https://cs.stanford.edu/people/ashton/pubs/audit.pdf |
| csss.uw.edu |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
CSSS |
https://csss.uw.edu/ |
| darioamodei.com |
Tipping Points for Content Ecosystems Long Posts |
post |
https://darioamodei.com/machines-of-loving-grace |
| data-workers.org |
Attestation across the AI Supply Chain Long Posts |
Data Workers' Inquiry |
https://data-workers.org/ |
| data-workers.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Data Workers’ Inquiry |
https://data-workers.org/ |
| data.stackexchange.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
raw data |
https://data.stackexchange.com/stackoverflow/query/1882534/questions-per-month |
| datadividendproject.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
project |
https://datadividendproject.com/ |
| datadividends.org |
"People First" Policy Ideas that Complement Each Other (through better data flow) Long Posts |
data dividends |
https://datadividends.org/ |
| datadividends.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
"data dividend" |
https://datadividends.org/ |
| datadividends.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data dividend |
https://datadividends.org/ |
| datadividends.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data dividend |
https://datadividends.org/ |
| datadividends.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
report |
https://datadividends.org/ |
| datalabelers.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Data Labelers Association |
https://datalabelers.org/about |
| dataleverage.leaflet.pub |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
attestation-forward data strategy |
https://dataleverage.leaflet.pub/3mizn5hsjg5vo |
| dataleverage.substack.com |
"People First" Policy Ideas that Complement Each Other (through better data flow) Long Posts |
longer |
https://dataleverage.substack.com/p/almost-everybody-including-both-data |
| dataleverage.substack.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
newsletter |
https://dataleverage.substack.com/p/is-zuckerberg-right-to-say-that-your |
| dataleverage.substack.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
post |
https://dataleverage.substack.com/p/live-by-the-free-content-for-training |
| dataleverage.substack.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
maps |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
| dataleverage.substack.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
post |
https://dataleverage.substack.com/p/how-do-we-know-our-ai-output-is-good |
| dataleverage.substack.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
newsletters |
https://dataleverage.substack.com/p/perplexity-ceos-interaction-with |
| dataleverage.substack.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Tipping Points for Content Ecosystems |
https://dataleverage.substack.com/p/tipping-points-for-content-ecosystems |
| dataleverage.substack.com |
April 2026 small points Meta Notes |
https://dataleverage.substack.com/p/attestation-across-the-ai-supply |
https://dataleverage.substack.com/p/attestation-across-the-ai-supply |
| dataleverage.substack.com |
April 2026 small points Meta Notes |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
"data rules" |
https://dataleverage.substack.com/p/almost-everybody-including-both-data |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
series |
https://dataleverage.substack.com/p/evaluation-data-leverage-advances |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
of |
https://dataleverage.substack.com/p/how-do-we-know-our-ai-output-is-good |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
earlier |
https://dataleverage.substack.com/p/selling-agi-like-ag1-will-the-market |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
"quasi-enclosure" |
https://dataleverage.substack.com/p/the-paradox-of-reuse-in-2026-a-case |
| dataleverage.substack.com |
Attestation across the AI Supply Chain Long Posts |
"tipping points" |
https://dataleverage.substack.com/p/tipping-points-for-content-ecosystems |
| dataleverage.substack.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
alignment |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
| dataleverage.substack.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
Subscribe now |
https://dataleverage.substack.com/subscribe |
| dataleverage.substack.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage |
https://dataleverage.substack.com/p/ai-artist-or-ai-art-thief-innovation-public-mandates-and-the-case-for-talking-in-terms-of-leverage |
| dataleverage.substack.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
AI Technologies are System Maps, and You are a Cartographer |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
| dataleverage.substack.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
Bing Rewards for the AI Age |
https://dataleverage.substack.com/p/bing-rewards-for-the-ai-age |
| dataleverage.substack.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) |
https://dataleverage.substack.com/p/chatgpt-is-awesome-and-scary-you-deserve-credit |
| dataleverage.substack.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
Plural AI Data Alignment |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
| dataleverage.substack.com |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
post |
https://dataleverage.substack.com/p/google-and-tiktok-rank-bundles-of |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/ai-labs-could-open-source-data-protection |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
FAccT 2025: [substack |
https://dataleverage.substack.com/p/algorithmic-collective-action-with |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/building-a-data-pipeworks-for-democratic |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/each-instance-of-ai-utility-stems |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/evaluation-data-leverage-advances |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/google-and-tiktok-rank-bundles-of |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/how-do-we-know-our-ai-output-is-good |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[longer substack |
https://dataleverage.substack.com/p/is-zuckerberg-right-to-say-that-your |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/live-by-the-free-content-for-training |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/many-models-and-track-changes-for |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack microblog |
https://dataleverage.substack.com/p/microblog-one-book-is-worth-006-benchmark |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/on-ai-driven-job-apocalypses-and |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/public-ai-data-appraisal-and-data |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/selling-agi-like-ag1-will-the-market |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/tipping-points-for-content-ecosystems |
| dataleverage.substack.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[substack |
https://dataleverage.substack.com/p/which-datasets-should-we-assume-are |
| dataleverage.substack.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
post 2 |
https://dataleverage.substack.com/p/each-instance-of-ai-utility-stems |
| dataleverage.substack.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Each Instance of "AI Utility" stems from some human act(s) of information recording and ranking |
https://dataleverage.substack.com/p/each-instance-of-ai-utility-stems |
| dataleverage.substack.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Eval Data Leverage post |
https://dataleverage.substack.com/p/evaluation-data-leverage-advances |
| dataleverage.substack.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
post 1 |
https://dataleverage.substack.com/p/google-and-tiktok-rank-bundles-of |
| dataleverage.substack.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? |
https://dataleverage.substack.com/p/selling-agi-like-ag1-will-the-market |
| dataleverage.substack.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
eval data leverage |
https://dataleverage.substack.com/p/evaluation-data-leverage-advances |
| dataleverage.substack.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Tipping Points |
https://dataleverage.substack.com/p/tipping-points-for-content-ecosystems |
| dataleverage.substack.com |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
before |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
| dataleverage.substack.com |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
argument |
https://dataleverage.substack.com/p/will-the-new-york-times-data-strike |
| dataleverage.substack.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
cartographic labor |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
| dataleverage.substack.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
early |
https://dataleverage.substack.com/p/dont-give-openai-all-the-credit-for |
| dataleverage.substack.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
here |
https://dataleverage.substack.com/p/ai-artist-or-ai-art-thief-innovation-public-mandates-and-the-case-for-talking-in-terms-of-leverage |
| dataleverage.substack.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
clear data rules |
https://dataleverage.substack.com/p/almost-everybody-including-both-data |
| dataleverage.substack.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
post |
https://dataleverage.substack.com/p/attestation-across-the-ai-supply |
| dataleverage.substack.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
| dataleverage.substack.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
post |
https://dataleverage.substack.com/p/almost-everybody-including-both-data |
| dataleverage.substack.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
here |
https://dataleverage.substack.com/p/how-collective-bargaining-for-information |
| dataleverage.substack.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
post |
https://dataleverage.substack.com/p/the-coding-agent-data-deal |
| dataleverage.substack.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
paradox of reuse |
https://dataleverage.substack.com/p/the-paradox-of-reuse-language-models-edition |
| dataleverage.substack.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
map-making |
https://dataleverage.substack.com/p/ai-technologies-are-system-maps-and-you-are-a-cartographer |
| dataleverage.substack.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
guesses |
https://dataleverage.substack.com/p/chatgpt-is-awesome-and-scary-you-deserve-credit |
| dataleverage.substack.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
posts |
https://dataleverage.substack.com/p/measuring-relative-ai-alignment-in-terms-of-data-pipelines |
| dataleverage.substack.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
post |
https://dataleverage.substack.com/p/reddit-stackoverflow-and-europe-all |
| dataleverage.substack.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
Subscribe now |
https://dataleverage.substack.com/subscribe |
| datalevers.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
only |
https://datalevers.org/ |
| datalevers.org |
Bing Rewards for the AI Age Long Posts |
data itself as a lever |
https://datalevers.org/ |
| datalevers.org |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
datalevers.org |
https://datalevers.org/ |
| datalevers.org |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
website |
https://datalevers.org/ |
| datalicenses.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
here |
https://datalicenses.org/ |
| datalicenses.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
here |
https://datalicenses.org/ |
| datalicenses.org |
Attestation across the AI Supply Chain Long Posts |
Data licensing |
https://datalicenses.org/ |
| datalicenses.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
data licenses |
https://datalicenses.org/?sort=recent |
| dataprovenance.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Data Provenance Initiative |
https://dataprovenance.org/ |
| dataprovenance.org |
Attestation across the AI Supply Chain Long Posts |
data provenance |
https://dataprovenance.org/ |
| dataprovenance.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
provenance |
https://dataprovenance.org/ |
| dataprovenance.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
Data Provenance Initiative |
https://dataprovenance.org/ |
| devclass.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
DevClass |
https://devclass.com/2026/01/05/dramatic-drop-in-stack-overflow-questions-as-devs-look-elsewhere-for-help |
| developers.google.com |
The Coding Agent Data Deal Long Posts |
page |
https://developers.google.com/gemini-code-assist/resources/privacy-notices |
| developers.google.com |
The Coding Agent Data Deal Long Posts |
here |
https://developers.google.com/gemini-code-assist/resources/privacy-notices |
| devin.ai |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
Devin |
https://devin.ai/pricing |
| diff.wikimedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Wikimedia |
https://diff.wikimedia.org/2025/10/17/new-user-trends-on-wikipedia |
| digifesto.com |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
critiques |
https://digifesto.com/2018/12/06/data-isnt-labor-because-using-search-engines-is-really-easy |
| digifesto.com |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
retort |
https://digifesto.com/2018/12/06/data-isnt-labor-because-using-search-engines-is-really-easy |
| digifesto.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
post |
https://digifesto.com/2018/12/06/data-isnt-labor-because-using-search-engines-is-really-easy |
| digital-strategy.ec.europa.eu |
Attestation across the AI Supply Chain Long Posts |
regulatory transparency requirements |
https://digital-strategy.ec.europa.eu/en/faqs/navigating-ai-act |
| direct.mit.edu |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Data Statements for NLP |
https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00041/43452/Data-Statements-for-Natural-Language-Processing |
| dl.acm.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
data strike |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
early |
https://dl.acm.org/doi/10.1145/3442188.3445922 |
| dl.acm.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
data strikes |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
data leverage |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
and |
https://dl.acm.org/doi/10.1145/3449177 |
| dl.acm.org |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
“data strikes” |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
“conscious data contribution” |
https://dl.acm.org/doi/10.1145/3449177 |
| dl.acm.org |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
power |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
power |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[ACM DL |
https://dl.acm.org/doi/10.1145/3531146.3534637 |
| dl.acm.org |
Bing Rewards for the AI Age Long Posts |
key source |
https://dl.acm.org/doi/abs/10.1145/3449078 |
| dl.acm.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
action |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
Fallacy of AI Functionality |
https://dl.acm.org/doi/abs/10.1145/3531146.3533158 |
| dl.acm.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
"data leverage" |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
paper |
https://dl.acm.org/doi/10.1145/3449177 |
| dl.acm.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data strikes |
https://dl.acm.org/citation.cfm?id=3313742 |
| dl.acm.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
data strike |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
labour |
https://dl.acm.org/doi/10.1145/3593013.3594070 |
| dl.acm.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
2020 FAccT paper: [ACM DL |
https://dl.acm.org/doi/10.1145/3442188.3445885 |
| dl.acm.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
concerns |
https://dl.acm.org/doi/full/10.1145/3613904.3642703 |
| dl.acm.org |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
here |
https://dl.acm.org/doi/10.1145/3593013.3594070 |
| dl.acm.org |
Plural AI Data Alignment Long Posts |
work |
https://dl.acm.org/doi/abs/10.1145/3306618.3314250 |
| dl.acm.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
paper |
https://dl.acm.org/doi/10.1145/2441776.2441923 |
| dl.acm.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
data strike |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
search engine results |
https://dl.acm.org/doi/abs/10.1145/3449078 |
| dl.acm.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
research |
https://dl.acm.org/doi/10.1145/3308558.3313742 |
| dl.acm.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
data leverage power |
https://dl.acm.org/doi/pdf/10.1145/3449177 |
| dl.acm.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
data leverage power |
https://dl.acm.org/doi/pdf/10.1145/3449177 |
| docs.github.com |
Bing Rewards for the AI Age Long Posts |
access |
https://docs.github.com/en/copilot/quickstart |
| docs.langchain.com |
The Coding Agent Data Deal Long Posts |
systems |
https://docs.langchain.com/oss/python/langchain/retrieval |
| docs.openwebui.com |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
Open WebUI |
https://docs.openwebui.com/ |
| doi.org |
Attestation across the AI Supply Chain Long Posts |
Castro Fernandez on auditable data-sharing arrangements |
https://doi.org/10.1145/3589317 |
| doi.org |
Bing Rewards for the AI Age Long Posts |
Kittur et al |
https://doi.org/10.1145/2441776.2441923 |
| doi.org |
Bing Rewards for the AI Age Long Posts |
wages |
https://doi.org/10.1145/3476060 |
| doi.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://doi.org/10.1257/pandp.20251045 |
https://doi.org/10.1257/pandp.20251045 |
| doi.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
https://doi.org/10.5281/zenodo.13914560 |
https://doi.org/10.5281/zenodo.13914560 |
| doi.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
real world |
https://doi.org/10.1111/jems.12421 |
| drinkag1.com |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
AG1 |
https://drinkag1.com/ |
| duck.ai |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Duck.ai |
https://duck.ai/ |
| duckduckgo.com |
Bing Rewards for the AI Age Long Posts |
bangs |
https://duckduckgo.com/bangs |
| eckhartarnold.de |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
daunting |
https://eckhartarnold.de/papers/2014_Social_Simulations/Whats_wrong_with_social_simulations.html |
| economicpossibility.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
sectoral bargaining |
https://economicpossibility.org/insights/codetermination-is-not-a-standalone-institution-rather-it-is-part-of-a-broader-in |
| economics.mit.edu |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Simple Macroeconomics of AI |
https://economics.mit.edu/sites/default/files/2024-04/The%20Simple%20Macroeconomics%20of%20AI.pdf |
| economics.mit.edu |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://economics.mit.edu/sites/default/files/2024-04/The%20Simple%20Macroeconomics%20of%20AI.pdf |
https://economics.mit.edu/sites/default/files/2024-04/The%20Simple%20Macroeconomics%20of%20AI.pdf |
| economist.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
poems |
https://economist.com/science-and-technology/2020/08/08/a-new-ai-language-model-generates-poetry-and-prose |
| eleuther.ai |
Attestation across the AI Supply Chain Long Posts |
EleutherAI |
https://eleuther.ai/ |
| eleuther.ai |
Which datasets should we assume are "in all the AI models"? Long Posts |
EleutherAI |
https://eleuther.ai/projects/training-large-language-models |
| en.m.wikipedia.org |
Tipping Points for Content Ecosystems Long Posts |
Wikimedia Commons |
https://en.m.wikipedia.org/wiki/File:Permafrost_in_Herschel_Island_002.jpg |
| en.wikipedia.org |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
diff |
https://en.wikipedia.org/wiki/Diff |
| en.wikipedia.org |
AI Labs Should Open Source Data Protection Technologies Long Posts |
Trap streets |
https://en.wikipedia.org/wiki/Trap_street |
| en.wikipedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
free-culture |
https://en.wikipedia.org/wiki/Free-culture_movement |
| en.wikipedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Codes |
https://en.wikipedia.org/wiki/News_Media_Bargaining_Code |
| en.wikipedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
open knowledge |
https://en.wikipedia.org/wiki/Open_knowledge |
| en.wikipedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
open knowledge |
https://en.wikipedia.org/wiki/Open_knowledge |
| en.wikipedia.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
history of statistical software |
https://en.wikipedia.org/wiki/Rexer’s_Annual_Data_Miner_Survey |
| en.wikipedia.org |
Bing Rewards for the AI Age Long Posts |
carbon dividend |
https://en.wikipedia.org/wiki/Carbon_fee_and_dividend |
| en.wikipedia.org |
Bing Rewards for the AI Age Long Posts |
MMOs |
https://en.wikipedia.org/wiki/Eve_Online |
| en.wikipedia.org |
Bing Rewards for the AI Age Long Posts |
Gardens by the Bay |
https://en.wikipedia.org/wiki/Gardens_by_the_Bay |
| en.wikipedia.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
cybernetics |
https://en.wikipedia.org/wiki/Cybernetics |
| en.wikipedia.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
1950s |
https://en.wikipedia.org/wiki/The_Human_Use_of_Human_Beings |
| en.wikipedia.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
Wikimedia Commons |
https://en.wikipedia.org/wiki/Reaper_(Van_Gogh_series |
| en.wikipedia.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
regulation |
https://en.wikipedia.org/wiki/California_Consumer_Privacy_Act |
| en.wikipedia.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
privacy |
https://en.wikipedia.org/wiki/General_Data_Protection_Regulation |
| en.wikipedia.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
diffs |
https://en.wikipedia.org/wiki/Diff |
| en.wikipedia.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
Wikimedia Commons |
https://en.wikipedia.org/wiki/File:Petardsketch2.jpg |
| en.wikipedia.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
hoist |
https://en.wikipedia.org/wiki/Hoist_with_his_own_petard |
| en.wikipedia.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
hoisted with its own petard |
https://en.wikipedia.org/wiki/Hoist_with_his_own_petard |
| en.wikipedia.org |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
live by the sword, and thusly die by the sword |
https://en.wikipedia.org/wiki/Live_by_the_sword,_die_by_the_sword |
| en.wikipedia.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
agent-based model |
https://en.wikipedia.org/wiki/Agent-based_model |
| en.wikipedia.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
platform |
https://en.wikipedia.org/wiki/2023_Reddit_API_controversy |
| en.wikipedia.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
Ahmad al-Tifashi |
https://en.wikipedia.org/wiki/Ahmad_al-Tifashi |
| en.wikipedia.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
summarizing |
https://en.wikipedia.org/wiki/Arrow_information_paradox |
| en.wikipedia.org |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
tragedy of the > commons |
https://en.wikipedia.org/wiki/Tragedy_of_the_commons |
| en.wikipedia.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
article |
https://en.wikipedia.org/wiki/BERT_(language_model |
| en.wikipedia.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Wikipedia |
https://en.wikipedia.org/wiki/Wikipedia:Featured_article_criteria |
| en.wikipedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
club goods |
https://en.wikipedia.org/wiki/Club_good |
| en.wikipedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
club goods |
https://en.wikipedia.org/wiki/Club_good |
| en.wikipedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
enclosure |
https://en.wikipedia.org/wiki/Enclosure |
| en.wikipedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
knowledge commons |
https://en.wikipedia.org/wiki/Knowledge_commons |
| en.wikipedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
tragedy of the commons |
https://en.wikipedia.org/wiki/Tragedy_of_the_commons |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
Jacquard loom |
https://en.wikipedia.org/wiki/Jacquard_machine |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
The Right to Be Forgotten |
https://en.wikipedia.org/wiki/Right_to_be_forgotten |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
strikebreakers |
https://en.wikipedia.org/wiki/Strikebreaker |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
my favorite |
https://en.wikipedia.org/wiki/The_Room |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
The Second Machine Age |
https://en.wikipedia.org/wiki/The_Second_Machine_Age |
| en.wikipedia.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
WGA |
https://en.wikipedia.org/wiki/Writers_Guild_of_America |
| en.wikipedia.org |
Tipping Points for Content Ecosystems Long Posts |
1% rule |
https://en.wikipedia.org/wiki/1%25_rule |
| en.wikipedia.org |
Tipping Points for Content Ecosystems Long Posts |
tipping points in natural ecosystems |
https://en.wikipedia.org/wiki/Tipping_points_in_the_climate_system |
| en.wikipedia.org |
Tipping Points for Content Ecosystems Long Posts |
length |
https://en.wikipedia.org/wiki/Wikipedia:Size_of_Wikipedia |
| en.wikipedia.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
complementary events |
https://en.wikipedia.org/wiki/Complementary_event |
| en.wikipedia.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
instance |
https://en.wikipedia.org/wiki/Complementary_event |
| en.wikipedia.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
foundation |
https://en.wikipedia.org/wiki/Foundation_model |
| en.wikipedia.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
enclosure |
https://en.wikipedia.org/wiki/Enclosure |
| en.wikipedia.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
AI research |
https://en.wikipedia.org/wiki/GPT-2 |
| en.wikipedia.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
prosumers |
https://en.wikipedia.org/wiki/Prosumer |
| euronews.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
and |
https://euronews.com/culture/2023/03/27/from-lawsuits-to-tech-hacks-heres-how-artists-are-fighting-back-against-ai-image-generatio |
| evalevalai.com |
Attestation across the AI Supply Chain Long Posts |
"Evaleval" |
https://evalevalai.com/about |
| evalevalai.com |
Attestation across the AI Supply Chain Long Posts |
"Every Eval Ever" |
https://evalevalai.com/projects/every-eval-ever |
| evalevalai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
EvalEval Coalition |
https://evalevalai.com/ |
| exploringai.org |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
data napkin math |
https://exploringai.org/ |
| facctconference.org |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
FAccT |
https://facctconference.org/ |
| fair.work |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Fairwork’s work on fair AI supply chains |
https://fair.work/en/fw/certification |
| firstmonday.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
heteromation |
https://firstmonday.org/ojs/index.php/fm/article/view/5331/4090 |
| floriantramer.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
can |
https://floriantramer.com/publications/diffusion23 |
| floriantramer.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
verbatim |
https://floriantramer.com/publications/verbatim22 |
| forbes.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
overhyped |
https://forbes.com/sites/robtoews/2020/07/19/gpt-3-is-amazingand-overhyped |
| forbes.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
op-ed |
https://forbes.com/sites/benjaminwolff/2022/12/31/why-the-creative-economy-shouldnt-fear-generative-ai?sh=5dae644d1fd5 |
| forum.effectivealtruism.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://forum.effectivealtruism.org/posts/xoX936hEvpxToeuLw |
https://forum.effectivealtruism.org/posts/xoX936hEvpxToeuLw |
| forum.effectivealtruism.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
post |
https://forum.effectivealtruism.org/posts/xoX936hEvpxToeuLw/estimating-the-substitutability-between-compute-and |
| ft.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
Financial Times |
https://ft.com/content/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6 |
| ft.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[The Financial Times |
https://ft.com/content/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6 |
| genlaw.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
GenLaw |
https://genlaw.org/ |
| geograph.org.uk |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Roger McLachlan |
https://geograph.org.uk/profile/1205 |
| geograph.org.uk |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
reuse |
https://geograph.org.uk/reuse.php?id=382118 |
| gist.github.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
post |
https://gist.github.com/veekaybee/6f8885e9906aa9c5408ebe5c7e870698 |
| github.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[GitHub |
https://github.com/Responsible-Dataset-Sharing/easy-dataset-share |
| github.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
proposal |
https://github.com/creativecommons/cc-signals |
| github.com |
Attestation across the AI Supply Chain Long Posts |
evalstats |
https://github.com/ianarawjo/evalstats |
| github.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
many competing AI models |
https://github.com/manymodels/manymodels |
| github.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
open source effort |
https://github.com/jcpeterson/openwebtext |
| github.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
here |
https://github.com/nickmvincent/UGCValueRoundup/blob/main/wikipedia.md |
| github.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
InstructGPT Model Card |
https://github.com/openai/following-instructions-human-feedback |
| github.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
model card |
https://github.com/openai/following-instructions-human-feedback |
| github.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
model card |
https://github.com/openai/following-instructions-human-feedback/blob/main/model-card.md |
| github.com |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
Agentwatch |
https://github.com/nickmvincent/agentwatch |
| github.com |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
repo |
https://github.com/nickmvincent/extenote |
| github.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
here |
https://github.com/jcpeterson/openwebtext |
| github.com |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
needed |
https://github.com/nickmvincent/data_napkin_math |
| github.com |
Google and TikTok rank bundles of information; ChatGPT ranks grains. Long Posts |
Some Semi-Serious Naming Proposals to Improve AI Discourse |
https://github.com/nickmvincent/blogs/blob/main/microblogs/2025-05-17_three_terms.md |
| github.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
blogs |
https://github.com/nickmvincent/blogs |
| github.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
here |
https://github.com/nickmvincent/blogs/blob/main/ideas.md |
| github.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
synthetic data |
https://github.com/nickmvincent/blogs/blob/main/microblogs/2025-05-17_three_terms.md |
| github.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
GitHub [repo |
https://github.com/nickmvincent/paidf_consultation |
| github.com |
Tipping Points for Content Ecosystems Long Posts |
enough |
https://github.com/nickmvincent/data_napkin_math |
| github.com |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
here |
https://github.com/nickmvincent/public-talks/tree/main/2025-10_csss_gabm |
| github.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
NLP research papers |
https://github.com/nickmvincent/UGCValueRoundup/blob/main/wikipedia.md |
| github.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
) |
https://github.com/nickmvincent/UGCValueRoundup/blob/main/wikipedia.md |
| google-gemini.github.io |
The Coding Agent Data Deal Long Posts |
documentation page |
https://google-gemini.github.io/gemini-cli/docs/tos-privacy.html |
| gradual-disempowerment.ai |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
essay |
https://gradual-disempowerment.ai/ |
| gs.statcounter.com |
Attestation across the AI Supply Chain Long Posts |
1% of worldwide desktop browser share |
https://gs.statcounter.com/browser-market-share/desktop/worldwide |
| hai.stanford.edu |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
benchmark saturation |
https://hai.stanford.edu/ai-index/2025-ai-index-report |
| hbr.org |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
workslop |
https://hbr.org/2025/09/ai-generated-workslop-is-destroying-productivity |
| hbr.org |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
intermediary |
https://hbr.org/2018/09/a-blueprint-for-a-better-digital-society |
| hbr.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
piece |
https://hbr.org/2021/03/ai-should-augment-human-intelligence-not-replace-it |
| hbr.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://hbr.org/2021/03/ai-should-augment-human-intelligence-not-replace-it |
https://hbr.org/2021/03/ai-should-augment-human-intelligence-not-replace-it |
| hbs.edu |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[HBR |
https://hbs.edu/faculty/Pages/item.aspx?num=50951 |
| hbs.edu |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
value |
https://hbs.edu/ris/Publication%20Files/24-038_51f8444f-502c-4139-8bf2-56eb4b65c58a.pdf |
| help.openai.com |
The Coding Agent Data Deal Long Posts |
here |
https://help.openai.com/en/articles/11369540-using-codex-with-your-chatgpt-plan |
| help.openai.com |
The Coding Agent Data Deal Long Posts |
article |
https://help.openai.com/en/articles/5722486-how-your-data-is-used-to-improve-model-performance |
| history.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
instrumental |
https://history.com/news/strikes-labor-movement |
| historytoday.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
been |
https://historytoday.com/archive/head-head/what-have-strikes-achieved |
| hls.harvard.edu |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Mary Gray and Siddharth Suri’s *Ghost Work* |
https://hls.harvard.edu/today/the-hidden-labor-supporting-algorithms |
| hollywoodreporter.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
article |
https://hollywoodreporter.com/business/business-news/writers-strike-ai-chatgpt-1235478681 |
| huggingface.co |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
here |
https://huggingface.co/datasets/nickmvincent/coding-agent-transcripts-1 |
| huggingface.co |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
weights |
https://huggingface.co/deepseek-ai/DeepSeek-R1 |
| huggingface.co |
Which datasets should we assume are "in all the AI models"? Long Posts |
FineWeb |
https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1 |
| hyperdimensional.co |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Dean Ball |
https://hyperdimensional.co/p/clawed |
| icml.cc |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
favor |
https://icml.cc/virtual/2025/poster/40125 |
| imgflip.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
Imgflip |
https://imgflip.com/memegenerator/255177692/astronaut-meme-always-has-been-template |
| in-toto.io |
Attestation across the AI Supply Chain Long Posts |
in-toto |
https://in-toto.io/docs/what-is-in-toto |
| inet.ox.ac.uk |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
works |
https://inet.ox.ac.uk/publications/large-language-models-reduce-public-knowledge-sharing-on-online-q-a-platforms |
| instagram.com |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
aim |
https://instagram.com/bryanjohnson_/p/C__8jp9yAd4 |
| intelligence-curse.ai |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
essay |
https://intelligence-curse.ai/ |
| jeffreybigham.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
reap the AI harvest |
https://jeffreybigham.com/blog/2019/the-coming-ai-autumnn.html |
| jessicahullman.substack.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
Living the metascience dream (or nightmare) with AI for science |
https://jessicahullman.substack.com/p/living-the-metascience-dream-or-nightmare |
| joinreboot.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
post |
https://joinreboot.org/i/162295663/a-site-for-every-soliloquy |
| joinreboot.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://joinreboot.org/p/macrodoses-7 |
https://joinreboot.org/p/macrodoses-7 |
| journals.openedition.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
challenging |
https://journals.openedition.org/cybergeo/1035?lang=en |
| jstor.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
1960s |
https://jstor.org/stable/1705998?seq=2 |
| jstor.org |
Public AI, Data Appraisal, and Data Debates Long Posts |
work |
https://jstor.org/stable/1911865?seq=1 |
| karriekarahalios.com |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
Karrie Karahalios |
https://karriekarahalios.com/ |
| katecrawford.net |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
of |
https://katecrawford.net/ |
| knightcolumbia.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
essay |
https://knightcolumbia.org/content/ai-as-normal-technology |
| knightcolumbia.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://knightcolumbia.org/content/ai-as-normal-technology |
https://knightcolumbia.org/content/ai-as-normal-technology |
| lambdalabs.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
estimated |
https://lambdalabs.com/blog/demystifying-gpt-3 |
| latimes.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
LA Times op-ed |
https://latimes.com/opinion/story/2023-05-05/ai-writers-strike-worker-protections-entertainment |
| lesswrong.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
post |
https://lesswrong.com/posts/GAv4DRGyDHe2orvwB/gradual-disempowerment-concrete-research-projects |
| lexology.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
whether |
https://lexology.com/library/detail.aspx?g=0adc3f5a-23f4-422e-a375-7ad5e7bf6709 |
| licenses.ai |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
licensing |
https://licenses.ai/ |
| licenses.ai |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
RAIL |
https://licenses.ai/blog/2023/1/17/rail-initiative-call-for-participation |
| link.springer.com |
Plural AI Data Alignment Long Posts |
scholarship |
https://link.springer.com/article/10.1007/s11023-020-09539-2 |
| loc.gov |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
[link |
https://loc.gov/item/2003664100 |
| lukedrago.substack.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://lukedrago.substack.com/p/the-intelligence-curse |
https://lukedrago.substack.com/p/the-intelligence-curse |
| machinelearning.apple.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
Apple’s |
https://machinelearning.apple.com/research/apple-intelligence-foundation-language-models |
| mako.cc |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
concentrated |
https://mako.cc/copyrighteous/editor-to-reader-ratios-on-wikipedia |
| mako.cc |
The Paradox of Reuse, Language Models Edition Long Posts |
estimate |
https://mako.cc/copyrighteous/editor-to-reader-ratios-on-wikipedia |
| marginalrevolution.com |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
post |
https://marginalrevolution.com/marginalrevolution/2025/02/deep-research.html |
| marketoonist.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
marketoonist |
https://marketoonist.com/2023/03/ai-written-ai-read.html |
| marketoonist.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
here |
https://marketoonist.com/faq |
| marsh.com |
Attestation across the AI Supply Chain Long Posts |
Marsh |
https://marsh.com/en-gb/services/risk-analytics/expertise/ai-system-risk-analysis.html |
| math.stackexchange.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
counting |
https://math.stackexchange.com/questions/3351186/why-should-i-consider-the-complementary-probability-if-i-can-do-it-directly |
| mcgill.ca |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
controversial |
https://mcgill.ca/oss/article/critical-thinking-health-and-nutrition/you-probably-dont-need-green-ag1-smoothie |
| mechanize.work |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
Mechanize |
https://mechanize.work/ |
| mercor.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Mercor |
https://mercor.com/research |
| meta.stackexchange.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
post |
https://meta.stackexchange.com/questions/388401/new-blog-post-from-our-ceo-prashanth-community-is-the-future-of-ai |
| meta.stackexchange.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
Stack Overflow |
https://meta.stackexchange.com/questions/388401/new-blog-post-from-our-ceo-prashanth-community-is-the-future-of-ai |
| meta.stackexchange.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
serious |
https://meta.stackexchange.com/questions/333089/stack-exchange-and-stack-overflow-have-moved-to-cc-by-sa-4-0 |
| meta.stackexchange.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
contention |
https://meta.stackexchange.com/questions/344491/an-update-on-creative-commons-licensing. |
| meta.stackexchange.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
collective bargaining |
https://meta.stackexchange.com/questions/391847/moderation-strike-results-of-negotiations?cb=1 |
| meta.wikimedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Wikimedia Enterprise |
https://meta.wikimedia.org/wiki/Wikimedia_Enterprise |
| meta.wikimedia.org |
The Paradox of Reuse, Language Models Edition Long Posts |
program |
https://meta.wikimedia.org/wiki/Wikimedia_Enterprise |
| meta.wikimedia.org |
Which datasets should we assume are "in all the AI models"? Long Posts |
Wikimedia Enterprise |
https://meta.wikimedia.org/wiki/Wikimedia_Enterprise |
| metr.org |
Attestation across the AI Supply Chain Long Posts |
METR |
https://metr.org/ |
| metr.org |
Attestation across the AI Supply Chain Long Posts |
METR |
https://metr.org/ |
| metr.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
work |
https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks |
| metr.org |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
Task-Completion Time Horizons of Frontier AI Models |
https://metr.org/time-horizons |
| miba.dev |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
Human Context Protocol |
https://miba.dev/assets/publications/HCP_ArXiv_2025.pdf |
| microsoft.com |
Attestation across the AI Supply Chain Long Posts |
datasheets |
https://microsoft.com/en-us/research/project/datasheets-for-datasets |
| microsoft.com |
Bing Rewards for the AI Age Long Posts |
credits |
https://microsoft.com/en-us/azure-academic-research |
| microsoft.com |
Bing Rewards for the AI Age Long Posts |
Microsoft Rewards |
https://microsoft.com/en-us/rewards |
| microsoft.com |
Bing Rewards for the AI Age Long Posts |
Microsoft Rewards |
https://microsoft.com/en-us/rewards |
| microsoft.com |
The Coding Agent Data Deal Long Posts |
implicit measures for web search |
https://microsoft.com/en-us/research/publication/evaluating-implicit-measures-improve-web-search |
| microsoft.com |
The Coding Agent Data Deal Long Posts |
dwell time |
https://microsoft.com/en-us/research/publication/modeling-dwell-time-to-predict-click-level-satsifaction |
| microsoft.com |
Tipping Points for Content Ecosystems Long Posts |
synthetic data |
https://microsoft.com/en-us/research/publication/textbooks-are-all-you-need-ii-phi-1-5-technical-report |
| mitpress.mit.edu |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Obfuscation |
https://mitpress.mit.edu/9780262529860/obfuscation |
| mturk.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
MTurk |
https://mturk.com/ |
| nature.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
their |
https://nature.com/articles/d41586-022-04383-z |
| nature.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
over |
https://nature.com/articles/s43586-022-00172-0.epdf?sharing_token=20oCMhzni41xvDUut2OItdRgN0jAjWel9jnR3ZoTv0OwjbZm_FCT7gsPxkyDixLb1Sapyw-rKunjdUM-MQsb2Df0fuyC5afG4elbIDnGjYVTr4j3hlrQ7YmaASLl3Q0UKi5thaNq9gvVPV-cT8IZm9wh7kXFdLAzLh60tNgS2gE%3D |
| nature.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Data Provenance Initiative |
https://nature.com/articles/s42256-024-00878-8 |
| nature.com |
Tipping Points for Content Ecosystems Long Posts |
tipping points |
https://nature.com/articles/s41559-019-0797-2 |
| nbcnews.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
David Sacks |
https://nbcnews.com/tech/tech-news/openai-says-deepseek-may-inapproriately-used-data-rcna189872 |
| nbcnews.com |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
be |
https://nbcnews.com/think/opinion/remote-testing-monitored-ai-failing-students-forced-undergo-it-ncna1246769 |
| nber.org |
Attestation across the AI Supply Chain Long Posts |
Arrow on the economics of information |
https://nber.org/books-and-chapters/rate-and-direction-inventive-activity-economic-and-social-factors/economic-welfare-and-allocation-resources-invention |
| nber.org |
Attestation across the AI Supply Chain Long Posts |
economic properties of information |
https://nber.org/books-and-chapters/rate-and-direction-inventive-activity-economic-and-social-factors/economic-welfare-and-allocation-resources-invention |
| nber.org |
Tipping Points for Content Ecosystems Long Posts |
displaced |
https://nber.org/system/files/working_papers/w24174/w24174.pdf |
| neurips.cc |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
track |
https://neurips.cc/Conferences/2026/CallForEvaluationsDatasets |
| news.ycombinator.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
split |
https://news.ycombinator.com/item?id=33998112 |
| news.ycombinator.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
Hackernews |
https://news.ycombinator.com/item?id=42865527 |
| news.ycombinator.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
discussion |
https://news.ycombinator.com/item?id=46709320 |
| newsletter.semianalysis.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
We Have No Moat, And Neither Does OpenAI |
https://newsletter.semianalysis.com/p/google-we-have-no-moat-and-neither |
| newyorker.com |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
dignity |
https://newyorker.com/science/annals-of-artificial-intelligence/there-is-no-ai |
| nicholas.carlini.com |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
post |
https://nicholas.carlini.com/writing/2024/how-i-use-ai.html |
| nicholas.carlini.com |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
post |
https://nicholas.carlini.com/writing/2024/how-i-use-ai.html |
| nickmvincent.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
policy paper |
https://nickmvincent.com/static/canada_publicai.pdf |
| nickmvincent.com |
Attestation across the AI Supply Chain Long Posts |
data dividends |
https://nickmvincent.com/static/eaamo_data_dividends.pdf |
| nickmvincent.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
contributing to Wikipedia, |
https://nickmvincent.com/static/WikiSerp2020.pdf |
| nickmvincent.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
user-generated data in search engine results |
https://nickmvincent.com/static/icwsm2019_ugcinsearch_arxiv.pdf |
| nickmvincent.com |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
economic inequality |
https://nickmvincent.com/static/eaamo_data_dividends.pdf |
| nickmvincent.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
paper |
https://nickmvincent.com/static/cbi_paper.pdf |
| nickmvincent.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://nickmvincent.com/static/cbi_paper.pdf |
https://nickmvincent.com/static/cbi_paper.pdf |
| nickmvincent.com |
The Paradox of Reuse, Language Models Edition Long Posts |
do |
https://nickmvincent.com/static/wikiserp_cscw.pdf |
| nickmvincent.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
papers |
https://nickmvincent.com/ |
| nickmvincent.github.io |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
[docs |
https://nickmvincent.github.io/extenote |
| nickmvincent.github.io |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[GitHub pages |
https://nickmvincent.github.io/paidf_consultation/01c_pipeworks.html |
| nickmvincent.github.io |
Tipping Points for Content Ecosystems Long Posts |
Data Napkin Math Project |
https://nickmvincent.github.io/data_napkin_math |
| nmvg.mataroa.blog |
Bing Rewards for the AI Age Long Posts |
responsible |
https://nmvg.mataroa.blog/blog/chatgpt-is-awesome-and-scary-you-deserve-credit |
| nmvg.mataroa.blog |
Bing Rewards for the AI Age Long Posts |
paradox of reuse |
https://nmvg.mataroa.blog/blog/the-paradox-of-reuse-language-models-edition |
| nmvg.mataroa.blog |
Bing Rewards for the AI Age Long Posts |
paradox of re-use |
https://nmvg.mataroa.blog/blog/the-paradox-of-reuse-language-models-edition |
| notion.so |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
The Paradox of Reuse, Language Models Edition |
https://notion.so/Data-Leverage-Recap-December-2022-April-2023-e1faadd001364ca18180995eeadcb223 |
| nowpublishers.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
differential privacy |
https://nowpublishers.com/article/Details/TCS-042 |
| npr.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
emphatically |
https://npr.org/2023/08/16/1194202562/new-york-times-considers-legal-action-against-openai-as-copyright-tensions-swirl |
| npr.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
NPR |
https://npr.org/2025/09/05/nx-s1-5529404/anthropic-settlement-authors-copyright-ai |
| nytimes.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
posturing |
https://nytimes.com/2023/04/18/technology/reddit-ai-openai-google.html |
| nytimes.com |
Bing Rewards for the AI Age Long Posts |
public |
https://nytimes.com/2021/07/07/opinion/google-utility-antitrust-technology.html |
| nytimes.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://www.nytimes.com/2025/05/30/podcasts/hardfork-ai-jobpocalypse.html |
https://nytimes.com/2025/05/30/podcasts/hardfork-ai-jobpocalypse.html |
| nytimes.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
topic |
https://nytimes.com/2025/05/30/technology/ai-jobs-college-graduates.html |
| nytimes.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://www.nytimes.com/2025/05/30/technology/ai-jobs-college-graduates.html |
https://nytimes.com/2025/05/30/technology/ai-jobs-college-graduates.html |
| nytimes.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
Reddit |
https://nytimes.com/2023/04/18/technology/reddit-ai-openai-google.html |
| nytimes.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
article |
https://nytimes.com/2023/07/18/magazine/wikipedia-ai-chatgpt.html |
| nytimes.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
The Daily Podcast |
https://nytimes.com/2023/09/10/podcasts/the-daily/wikipedia-ai.html |
| oag.ca.gov |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
The Right to Delete |
https://oag.ca.gov/privacy/ccpa |
| ojs.aaai.org |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
data |
https://ojs.aaai.org/index.php/ICWSM/article/view/7347 |
| ojs.aaai.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
computational social science |
https://ojs.aaai.org/index.php/ICWSM/article/view/7347 |
| open.spotify.com |
Tipping Points for Content Ecosystems Long Posts |
podcast |
https://open.spotify.com/episode/2G4UlFmVjwMizRl1jMUPxf?si=1f41d2d0afcb40b2 |
| openai.com |
"People First" Policy Ideas that Complement Each Other (through better data flow) Long Posts |
here |
https://openai.com/index/industrial-policy-for-the-intelligence-age |
| openai.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
techniques |
https://openai.com/index/understanding-the-source-of-what-we-see-and-hear-online |
| openai.com |
Attestation across the AI Supply Chain Long Posts |
"Introducing ChatGPT Health" |
https://openai.com/index/introducing-chatgpt-health |
| openai.com |
Attestation across the AI Supply Chain Long Posts |
OpenAI's health launch as an example of product-level signaling around physician involvement and evaluation |
https://openai.com/index/introducing-chatgpt-health |
| openai.com |
Bing Rewards for the AI Age Long Posts |
paywall |
https://openai.com/blog/chatgpt-plus |
| openai.com |
Bing Rewards for the AI Age Long Posts |
OpenAI pricing |
https://openai.com/pricing |
| openai.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
governs |
https://openai.com/blog/democratic-inputs-to-ai |
| openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
post |
https://openai.com/blog/chatgpt |
| openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
ChatGPT |
https://openai.com/blog/chatgpt |
| openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
The ChatGPT Blog Post |
https://openai.com/blog/chatgpt/%5D |
| openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
InstructGPT blog post |
https://openai.com/blog/instruction-following |
| openai.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
blog post |
https://openai.com/blog/instruction-following |
| openai.com |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
release |
https://openai.com/index/introducing-deep-research |
| openai.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
OpenAI |
https://openai.com/policies/row-terms-of-use |
| openai.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://openai.com/charter |
https://openai.com/charter |
| openai.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
highly autonomous systems that outperform humans at most economically valuable work |
https://openai.com/charter |
| openai.com |
Plural AI Data Alignment Long Posts |
Alignment page post |
https://openai.com/blog/our-approach-to-alignment-research |
| openai.com |
Plural AI Data Alignment Long Posts |
post |
https://openai.com/blog/planning-for-agi-and-beyond |
| openai.com |
Public AI, Data Appraisal, and Data Debates Long Posts |
OpenAI |
https://openai.com/index/openai-and-reddit-partnership |
| openai.com |
Selling AGI like AG1: Will Consumers Push Back Against Proprietary Blends of Herbs and of Data? Long Posts |
AGI |
https://openai.com/index/planning-for-agi-and-beyond |
| openai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
HealthBench |
https://openai.com/index/healthbench |
| openai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
medicine |
https://openai.com/index/introducing-chatgpt-health |
| openai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
(1) transfer |
https://openai.com/index/language-models-are-few-shot-learners |
| openai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
HealthBench Professional / ChatGPT for Clinicians |
https://openai.com/index/making-chatgpt-better-for-clinicians |
| openai.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
(2) scaling |
https://openai.com/index/scaling-laws-for-neural-language-models |
| openai.com |
The Paradox of Reuse, Language Models Edition Long Posts |
ChatGPT |
https://openai.com/blog/chatgpt |
| openai.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
research |
https://openai.com/research/gpts-are-gpts |
| openaireview.org |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
peer review death spiral |
https://openaireview.org/blog.html |
| openaireview.org |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
OpenAIReview |
https://openaireview.org/blog.html |
| openhands.dev |
The Coding Agent Data Deal Long Posts |
Open Hands |
https://openhands.dev/ |
| openreview.net |
Attestation across the AI Supply Chain Long Posts |
Raji et al. on the limits of "general" benchmarks |
https://openreview.net/forum?id=j6NxpQbREA1 |
| openreview.net |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
memorization |
https://openreview.net/forum?id=TatRHT_1cK |
| openrouter.ai |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
OpenRouter |
https://openrouter.ai/ |
| openwebui.com |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
efforts |
https://openwebui.com/f/maxkerkula/mixture_of_agents |
| oreilly.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
answer |
https://oreilly.com/radar/what-does-copyright-say-about-generative-models |
| palewi.re |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
follow suit |
https://palewi.re/docs/news-homepages/openai-gptbot-robotstxt.html |
| papers.ssrn.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
Zargham and Shorish 2023 |
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4569037 |
| papers.ssrn.com |
Tipping Points for Content Ecosystems Long Posts |
data rivers |
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4388928 |
| paperswithcode.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
WebText |
https://paperswithcode.com/dataset/webtext |
| paperswithcode.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
here |
https://paperswithcode.com/dataset/webtext |
| partnershiponai.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Partnership on AI’s responsible sourcing work |
https://partnershiponai.org/workstream/responsible-sourcing |
| perkinscoie.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
proposed |
https://perkinscoie.com/en/news-insights/the-latest-on-the-eus-proposed-artificial-intelligence-act.html |
| perplexity.ai |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
Perplexity |
https://perplexity.ai/ |
| pewresearch.org |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
polling |
https://pewresearch.org/politics/2024/02/01/labor-unions |
| pewresearch.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Pew |
https://pewresearch.org/short-reads/2025/07/22/google-users-are-less-likely-to-click-on-links-when-an-ai-summary-appears-in-the-results |
| phenomenalworld.org |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
Viljoen |
https://phenomenalworld.org/analysis/data-as-property |
| pile.eleuther.ai |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
The Pile |
https://pile.eleuther.ai/ |
| pile.eleuther.ai |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
The Pile |
https://pile.eleuther.ai/ |
| platform.openai.com |
Tipping Points for Content Ecosystems Long Posts |
1 token = 3/4 word |
https://platform.openai.com/tokenizer |
| plurality.institute |
Plural AI Data Alignment Long Posts |
plurality |
https://plurality.institute/ |
| plurality.institute |
Plural AI Data Alignment Long Posts |
plurality research |
https://plurality.institute/ |
| polarislab.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
database |
https://polarislab.org/ai-law-tracker.html |
| policykit.org |
Bing Rewards for the AI Age Long Posts |
PolicyKit |
https://policykit.org/ |
| press.princeton.edu |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
Radical Markets |
https://press.princeton.edu/books/paperback/9780691196060/radical-markets |
| probml.github.io |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
here |
https://probml.github.io/pml-book/book1.html |
| proceedings.mlr.press |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
[link |
https://proceedings.mlr.press/v89/jia19a.html |
| proceedings.mlr.press |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
[link |
https://proceedings.mlr.press/v97/ghorbani19c.html |
| proceedings.mlr.press |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
work |
https://proceedings.mlr.press/v80/agarwal18a.html |
| proceedings.neurips.cc |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
[link |
https://proceedings.neurips.cc/paper/2019/hash/a78482ce76496fcf49085f2190e675b4-Abstract.html |
| prolific.com |
Attestation across the AI Supply Chain Long Posts |
via Prolific |
https://prolific.com/pricing |
| psagroup.org |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
"Expert Language Model Trainers" |
https://psagroup.org/blogposts/78 |
| psagroup.org |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
here |
https://psagroup.org/blogposts/62 |
| psagroup.org |
The Paradox of Reuse, Language Models Edition Long Posts |
disproportionate role |
https://psagroup.org/blogposts/78 |
| publicai.co |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
[publicai.co |
https://publicai.co/ |
| publicai.network |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
public AI |
https://publicai.network/ |
| publicai.network |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
public AI network [website |
https://publicai.network/ |
| publicai.network |
Public AI, Data Appraisal, and Data Debates Long Posts |
website |
https://publicai.network/ |
| pubmed.ncbi.nlm.nih.gov |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
contexts |
https://pubmed.ncbi.nlm.nih.gov/33982031 |
| quitgpt.org |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
QuitGPT |
https://quitgpt.org/ |
| radicalmarkets.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
http://radicalmarkets.com/chapters/data-as-labor/frequently-asked-questions/ |
https://radicalmarkets.com/chapters/data-as-labor/frequently-asked-questions |
| radicalxchange.org |
AI Labs Should Open Source Data Protection Technologies Long Posts |
here |
https://radicalxchange.org/media/blog/three-pathways-to-distributed-power-in-the-ai-economy |
| radicalxchange.org |
AI Technologies are System Maps, and You are a Cartographer Long Posts |
data dignity |
https://radicalxchange.org/concepts/data-dignity |
| radicalxchange.org |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
pluralistic |
https://radicalxchange.org/media/blog/why-i-am-a-pluralist |
| radicalxchange.org |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
data |
https://radicalxchange.org/wiki/data-dignity |
| radicalxchange.org |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
data dignity |
https://radicalxchange.org/concepts/data-dignity |
| radicalxchange.org |
The Coding Agent Data Deal Long Posts |
data labor |
https://radicalxchange.org/wiki/data-dignity |
| radicalxchange.org |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
data dignity |
https://radicalxchange.org/concepts/data-dignity |
| raulcastrofernandez.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
DataStation |
https://raulcastrofernandez.com/papers/data_station_paper-11.pdf |
| raulcastrofernandez.com |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
escrow |
https://raulcastrofernandez.com/papers/data_station_paper-11.pdf |
| raulcastrofernandez.com |
Public AI, Data Appraisal, and Data Debates Long Posts |
Data-Sharing Consortia |
https://raulcastrofernandez.com/papers/data-sharing-consortia-escrow.pdf |
| reddit |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
arbitrary reddit post just to get a sense of how people are engaging on other platforms |
https://reddit/ |
| redditinc.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
terms |
https://redditinc.com/policies/user-agreement |
| research.brickroad.network |
Attestation across the AI Supply Chain Long Posts |
fragmented |
https://research.brickroad.network/neurips2025-data-deals |
| research.brickroad.network |
Attestation across the AI Supply Chain Long Posts |
here |
https://research.brickroad.network/neurips2025-data-deals |
| research.google |
Attestation across the AI Supply Chain Long Posts |
model cards |
https://research.google/pubs/model-cards-for-model-reporting |
| research.google |
Attestation across the AI Supply Chain Long Posts |
Model cards |
https://research.google/pubs/model-cards-for-model-reporting |
| research.google |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Model Cards |
https://research.google/pubs/model-cards-for-model-reporting |
| researcher-help.prolific.com |
Attestation across the AI Supply Chain Long Posts |
at least $8/hr, and typically more like $12/hr |
https://researcher-help.prolific.com/en/article/2273bd |
| researchgate.net |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
Hess and Ostrom |
https://researchgate.net/publication/239919282_Introduction_An_Overview_of_the_Knowledge_Commons |
| reuters.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Reuters |
https://reuters.com/technology/artificial-intelligence/nyt-sends-ai-startup-perplexity-cease-desist-notice-over-content-use-wsj-reports-2024-10-15 |
| reuters.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Reuters |
https://reuters.com/world/us/us-appeals-court-rejects-copyrights-ai-generated-art-lacking-human-creator-2025-03-18 |
| reuters.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
arguing |
https://reuters.com/legal/litigation/tech-companies-face-tough-ai-copyright-questions-2025-2024-12-27 |
| reuters.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
investment |
https://reuters.com/technology/chinas-deepseek-sets-off-ai-market-rout-2025-01-27 |
| reuters.com |
Public AI, Data Appraisal, and Data Debates Long Posts |
Google |
https://reuters.com/technology/reddit-ai-content-licensing-deal-with-google-sources-say-2024-02-22 |
| reuters.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
browsing-enabled ChatGPT |
https://reuters.com/technology/openai-says-chatgpt-can-now-browse-internet-2023-09-27 |
| saiph.org |
Attestation across the AI Supply Chain Long Posts |
Saiph Savage |
https://saiph.org/ |
| scale.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Scale |
https://scale.com/ |
| science.org |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
Science |
https://science.org/content/blog-post/evaluation-deep-research-performance |
| science.org |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
paper |
https://science.org/doi/10.1126/science.adj0998 |
| scriptorium |
Each Instance of "AI Utility" Stems from Some Human Act(s) of Information Recording and Ranking Long Posts |
scriptorium |
https://scriptorium/ |
| sec.gov |
Attestation across the AI Supply Chain Long Posts |
S-1 |
https://sec.gov/Archives/edgar/data/1713445/000162828024011789/reddit-sx1a3.htm |
| shared-references.pages.dev |
Coding agents are (1) a big deal, (2) very relevant to data leverage, and (3) able to help build tools that support data leverage! Long Posts |
examples |
https://shared-references.pages.dev/projects |
| simile.ai |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
Simile.ai |
https://simile.ai/ |
| simonwillison.net |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
spilled |
https://simonwillison.net/2025/Apr/30/criticism-of-the-chatbot-arena |
| simonwillison.net |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
some |
https://simonwillison.net/2025/Apr/30/criticism-of-the-chatbot-arena |
| site.spawning.ai |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
ai.txt |
https://site.spawning.ai/spawning-ai-txt |
| skylion007.github.io |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
here |
https://skylion007.github.io/OpenWebTextCorpus |
| slideshare.net |
The Paradox of Reuse, Language Models Edition Long Posts |
talk |
https://slideshare.net/dartar/the-sum-of-all-human-knowledge-in-the-age-of-machines-a-new-research-agenda-for-wikimedia-icwsm-15 |
| slsa.dev |
Attestation across the AI Supply Chain Long Posts |
SLSA |
https://slsa.dev/spec/v1.2 |
| smithsonianmag.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
stealing |
https://smithsonianmag.com/smart-news/is-popular-photo-app-lensas-ai-stealing-from-artists-180981281 |
| socialist.net |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
new |
https://socialist.net/marx-s-capital-chapters-15-the-machine |
| spec.c2pa.org |
Attestation across the AI Supply Chain Long Posts |
C2PA |
https://spec.c2pa.org/specifications/specifications/2.3/ai-ml/ai_ml.html |
| spec.c2pa.org |
Attestation across the AI Supply Chain Long Posts |
C2PA |
https://spec.c2pa.org/specifications/specifications/2.3/ai-ml/ai_ml.html |
| sr.ithaka.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
Data Deals Tracker |
https://sr.ithaka.org/our-work/generative-ai-licensing-agreement-tracker |
| stacker.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
in |
https://stacker.com/business-economy/30-victories-workers-rights-won-organized-labor-over-years |
| stackexchange.com |
Tipping Points for Content Ecosystems Long Posts |
ratio |
https://stackexchange.com/about |
| stackexchange.com |
Tipping Points for Content Ecosystems Long Posts |
higher |
https://stackexchange.com/about |
| stackexchange.com |
Tipping Points for Content Ecosystems Long Posts |
reported |
https://stackexchange.com/about |
| stackoverflow.blog |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
statements |
https://stackoverflow.blog/2023/04/17/community-is-the-future-of-ai |
| stackoverflow.blog |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
blog post |
https://stackoverflow.blog/2023/04/17/community-is-the-future-of-ai |
| stackoverflow.blog |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
an array |
https://stackoverflow.blog/2025/12/02/introducing-stack-overflow-ai-assist-a-tool-for-the-modern-developer |
| stackoverflow.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
terms |
https://stackoverflow.com/legal/terms-of-service |
| storage.courtlistener.com |
Public AI, Data Appraisal, and Data Debates Long Posts |
documents |
https://storage.courtlistener.com/recap/gov.uscourts.cand.415175/gov.uscourts.cand.415175.391.14.pdf |
| support.anthropic.com |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
help page |
https://support.anthropic.com/en/articles/8525154-claude-is-providing-incorrect-or-misleading-responses-what-s-going-on |
| support.brave.app |
Attestation across the AI Supply Chain Long Posts |
Brave Rewards |
https://support.brave.app/hc/en-us/articles/360027276731-Brave-Rewards-FAQ |
| surgehq.ai |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
Surge |
https://surgehq.ai/ |
| swashapp.io |
Attestation across the AI Supply Chain Long Posts |
Swash |
https://swashapp.io/about |
| techcrunch.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
not |
https://techcrunch.com/2021/01/12/ftc-settlement-with-ever-orders-data-and-ais-deleted-after-facial-recognition-pivot |
| techcrunch.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
agreements |
https://techcrunch.com/2024/05/06/stack-overflow-signs-deal-with-openai-to-supply-data-to-its-models |
| techcrunch.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
Why DeepSeek’s new AI model thinks it’s ChatGPT |
https://techcrunch.com/2024/12/27/why-deepseeks-new-ai-model-thinks-its-chatgpt |
| techcrunch.com |
Perplexity CEO''s Interaction with Striking New York Times Workers Does Not Reflect Well on the AI Industry Long Posts |
amount |
https://techcrunch.com/2024/11/04/perplexity-ceo-offers-ai-companys-services-to-replace-striking-nyt-staff |
| techcrunch.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
of |
https://techcrunch.com/2024/07/29/apple-says-it-took-a-responsible-approach-to-training-its-apple-intelligence-models?guccounter=1 |
| techcrunch.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
data transparency |
https://techcrunch.com/2023/05/11/eu-ai-act-mep-committee-votes?guccounter=1 |
| technologyreview.com |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
participation |
https://technologyreview.com/2026/02/10/1132577/a-quitgpt-campaign-is-urging-people-to-cancel-chatgpt-subscriptions |
| technologyreview.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
name |
https://technologyreview.com/2022/09/16/1059598/this-artist-is-dominating-ai-generated-art-and-hes-not-happy-about-it |
| technologyreview.com |
Bing Rewards for the AI Age Long Posts |
goods |
https://technologyreview.com/2018/06/27/141776/lets-make-private-data-into-a-public-good |
| technologyreview.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
art |
https://technologyreview.com/2023/10/23/1082189/data-poisoning-artists-fight-generative-ai |
| technologyreview.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
bloviator. |
https://technologyreview.com/2020/08/22/1007539/gpt3-openai-language-generator-artificial-intelligence-ai-opinion |
| technologyreview.com |
Evaluation Data Leverage: Advances like "Deep Research" Highlight a Looming Opportunity for Bargaining Power Long Posts |
distressing |
https://technologyreview.com/2020/08/07/1006132/software-algorithms-proctoring-online-tests-ai-ethics |
| technologyreview.com |
The Paradox of Reuse, Language Models Edition Long Posts |
raised |
https://technologyreview.com/2022/03/29/1048439/chatbots-replace-search-engine-terrible-idea |
| technologyreview.com |
Tipping Points for Content Ecosystems Long Posts |
concentrated |
https://technologyreview.com/2024/12/18/1108796/this-is-where-the-data-to-build-ai-comes-from |
| techtarget.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
opinions |
https://techtarget.com/searchsoftwarequality/news/252528379/ChatGPT-writes-code-but-wont-replace-developers |
| techtarget.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
data dignity |
https://techtarget.com/searchenterpriseai/definition/data-dignity |
| telegraph.co.uk |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
property land grab |
https://telegraph.co.uk/business/2023/08/21/internets-original-sin-ai-nightmare |
| the-independent.com |
Algorithmic Collective Action With Two Collectives [crosspost] Long Posts |
empirical examples |
https://the-independent.com/arts-entertainment/music/news/taylor-swift-fearless-fans-b1829051.html |
| theatlantic.com |
The Paradox of Reuse, Language Models Edition Long Posts |
article |
https://theatlantic.com/newsletters/archive/2022/12/why-the-rise-of-ai-is-the-most-important-story-of-the-year/672308 |
| theguardian.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
followed suit |
https://theguardian.com/technology/2023/aug/25/new-york-times-cnn-and-abc-block-openais-gptbot-web-crawler-from-scraping-content |
| theguardian.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[The Guardian |
https://theguardian.com/technology/2020/sep/18/wikipedia-edits-have-massive-impact-on-tourism-say-economists |
| theguardian.com |
Bing Rewards for the AI Age Long Posts |
nationalized search engine |
https://theguardian.com/commentisfree/2017/aug/30/nationalise-google-facebook-amazon-data-monopoly-platform-public-interest |
| theguardian.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
op-eds |
https://theguardian.com/commentisfree/2020/sep/08/robot-wrote-this-article-gpt-3 |
| theguardian.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
theft |
https://theguardian.com/commentisfree/2025/sep/10/tech-companies-are-stealing-our-books-music-and-films-for-ai-its-brazen-theft-and-must-be-stopped |
| theguardian.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
The Guardian |
https://theguardian.com/media/2026/jan/12/publishers-fear-ai-search-summaries-and-chatbots-mean-end-of-traffic-era |
| themultiplicity.ai |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
themultiplicity.ai |
https://themultiplicity.ai/ |
| theodi.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
data trusts |
https://theodi.org/insights/projects/defining-a-data-trust |
| theregister.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
slop PRs |
https://theregister.com/2026/02/03/github_kill_switch_pull_requests_ai |
| theverge.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
article |
https://theverge.com/2024/9/25/24254042/mark-zuckerberg-creators-value-ai-meta |
| theverge.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
stated |
https://theverge.com/2023/8/21/23840705/new-york-times-openai-web-crawler-ai-gpt |
| theverge.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
updating robots.txt |
https://theverge.com/2023/8/21/23840705/new-york-times-openai-web-crawler-ai-gpt |
| theverge.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
lawsuit |
https://theverge.com/2022/11/8/23446821/microsoft-openai-github-copilot-class-action-lawsuit-ai-copyright-violation-training-data |
| theverge.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
stance |
https://theverge.com/2023/3/15/23640180/openai-gpt-4-launch-closed-research-ilya-sutskever-interview |
| theverge.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
astonishing |
https://theverge.com/21346343/gpt-3-explainer-openai-examples-errors-agi-potential |
| theverge.com |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
interview |
https://theverge.com/2024/9/25/24254042/mark-zuckerberg-creators-value-ai-meta |
| theverge.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
everybody is doing it |
https://theverge.com/2025/1/14/24343692/meta-lawsuit-copyright-lawsuit-llama-libgen |
| theverge.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
legal |
https://theverge.com/23444685/generative-ai-copyright-infringement-legal-fair-use-training-data |
| theverge.com |
The Coding Agent Data Deal Long Posts |
providers |
https://theverge.com/2024/5/22/24162782/openai-licensing-deal-wall-street-journal-news-corp |
| theverge.com |
The Paradox of Reuse, Language Models Edition Long Posts |
pay artists |
https://theverge.com/2022/10/25/23422359/shutterstock-ai-generated-art-openai-dall-e-partnership-contributors-fund-reimbursement |
| theverge.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
legal |
https://theverge.com/2023/1/16/23557098/generative-ai-art-copyright-legal-lawsuit-stable-diffusion-midjourney-deviantart |
| theverge.com |
Tipping Points for Content Ecosystems Long Posts |
coverage |
https://theverge.com/2024/10/16/24268209/anthropic-ai-dario-amodei-agi-funding-blog |
| time.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://time.com/7289692/when-ai-replaces-workers |
https://time.com/7289692/when-ai-replaces-workers |
| time.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
What Happens When AI Replaces Workers? |
https://time.com/7289692/when-ai-replaces-workers |
| time.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
https://time.com/7290751/ai-future-of-work-essay |
https://time.com/7290751/ai-future-of-work-essay |
| time.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
published |
https://time.com/7290751/ai-future-of-work-essay |
| tn.boell.org |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
commons governance problems |
https://tn.boell.org/en/2023/04/19/5-elinor-ostrom-et-les-huit-principes-de-gestion-des-communs |
| towardsdatascience.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
BookCorpus |
https://towardsdatascience.com/dirty-secrets-of-bookcorpus-a-key-dataset-in-machine-learning-6ee2927e8650 |
| twitter.com |
"Many Models" and "Track Changes" for AI: Some Thoughts on LLM Interfaces Long Posts |
people |
https://twitter.com/d3mondev/status/1814068102897787165 |
| twitter.com |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
Spawning |
https://twitter.com/spawning_/status/1603126330261897217 |
| twitter.com |
Bing Rewards for the AI Age Long Posts |
costs of ChatGPT |
https://twitter.com/sama/status/1599671496636780546 |
| twitter.com |
ChatGPT is Awesome and Scary: You Deserve Credit for the Good Parts (and Might Help Fix the Bad Parts) Long Posts |
tweet |
https://twitter.com/elonmusk/status/1599291104687374338 |
| twitter.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
evidence |
https://twitter.com/DominikGutt/status/1636732846948663298?s=20 |
| twitter.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
point |
https://twitter.com/peternixey/status/1640002493630369792?s=20 |
| twitter.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
discussions |
https://twitter.com/random_walker/status/1648322180558606338?s=20 |
| twitter.com |
Don’t give OpenAI all the credit for GPT-3: You might have helped create the latest “astonishing” advance in AI too Long Posts |
code |
https://twitter.com/mattshumer_/status/1287125015528341506 |
| twitter.com |
The Paradox of Reuse, Language Models Edition Long Posts |
underlying training data |
https://twitter.com/nickmvincent/status/1598478685019189248 |
| twitter.com |
The Paradox of Reuse, Language Models Edition Long Posts |
claims |
https://twitter.com/sytelus/status/1598164765385555976 |
| twitter.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
tweet |
https://twitter.com/katiekilkenny7/status/1653811340107194368 |
| unsplash.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
David Smooke |
https://unsplash.com/@smooke |
| unsplash.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
Unsplash |
https://unsplash.com/photos/En_wELYYhD4 |
| unsplash.com |
Bing Rewards for the AI Age Long Posts |
Victor from Unsplash |
https://unsplash.com/photos/c53HvA-blYQ |
| unsplash.com |
Data Leverage Recap: December 2022 - April 2023 Long Posts |
Unsplash contributor Photoholgic |
https://unsplash.com/photos/RGvwatYi0-Q |
| unsplash.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
Unsplash |
https://unsplash.com/ |
| unsplash.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
Kelly Sikkema |
https://unsplash.com/@kellysikkema |
| unsplash.com |
Is Zuckerberg right to say that your specific creative work has no value to AI? Long Posts |
Unsplash |
https://unsplash.com/ |
| unsplash.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
Unsplash |
https://unsplash.com/ |
| unsplash.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
USgGS |
https://unsplash.com/@usgs |
| unsplash.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
Unsplash |
https://unsplash.com/ |
| unsplash.com |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
Myke Simon |
https://unsplash.com/@myke_simon |
| unsplash.com |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
Unsplash |
https://unsplash.com/ |
| unsplash.com |
Two natural allies of a "Data Transparency" agenda: capabilities forecasters and social simulators Long Posts |
Hans |
https://unsplash.com/@hansphoto |
| unsplash.com |
Will the New York Times Data Strike Have a Large Impact on ChatGPT? Long Posts |
unsplash |
https://unsplash.com/photos/lEBDlbXLEgs |
| upload.wikimedia.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
version |
https://upload.wikimedia.org/wikipedia/commons/1/19/ICWSM15_Wikimedia_Talk.pdf |
| usenix.org |
How do we know our AI output is good? Double checks, bar charts, vibes, and training data. Long Posts |
training data extraction |
https://usenix.org/conference/usenixsecurity21/presentation/carlini-extracting |
| users.ssc.wisc.edu |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
collective action |
https://users.ssc.wisc.edu/~oliver/PROTESTS/ArticleCopies/OliverMarwellCritMassI.pdf |
| vanityfair.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
article |
https://vanityfair.com/news/story/meta-ai-lawsuit |
| virginia-eubanks.com |
A Harbinger of the Future of Content? The New York Times Starts a Data Strike Long Posts |
variety |
https://virginia-eubanks.com/automating-inequality |
| vmst.io |
AI Artist or AI Art Thief? Innovation, Public Mandates, and the Case for Talking in Terms of Leverage Long Posts |
begun |
https://vmst.io/@selzero/109512557990367884 |
| vox.com |
Building a Data Pipeworks for Democratic AI: From Human Knowledge to Records to AI Systems Long Posts |
automation |
https://vox.com/future-perfect/23787024/power-progress-book-ai-history-future-economy-daron-acemoglu-simon-johnson |
| washingtonpost.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[Washington Post |
https://washingtonpost.com/technology/2025/02/25/chegg-google-ai-lawsuit |
| washingtonpost.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
[Washington Post |
https://washingtonpost.com/technology/2025/08/08/wikipedia-ai-generated-mistakes-editors |
| weval.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
WeVal |
https://weval.org/ |
| weval.org |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
WeVal |
https://weval.org/ |
| wga.org |
The WGA Strike is a Canary in the Coal Mine for AI Labor Concerns Long Posts |
asks |
https://wga.org/uploadedfiles/members/member_info/contract-2023/WGA_proposals.pdf |
| whitehouse.gov |
Attestation across the AI Supply Chain Long Posts |
public procurement guidance |
https://whitehouse.gov/wp-content/uploads/2024/03/M-24-10-Advancing-Governance-Innovation-and-Risk-Management-for-Agency-Use-of-Artificial-Intelligence.pdf |
| wired.com |
Live by the free-content-for-training sword, die by the free-content-for-training sword Long Posts |
policy |
https://wired.com/story/deepseek-ai-china-privacy-data |
| wsj.com |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
internal |
https://wsj.com/tech/ai/openai-ceo-altman-defends-pentagon-work-to-staff-calls-backlash-really-painful-76d769ec |
| wsj.com |
Bing Rewards for the AI Age Long Posts |
decided |
https://wsj.com/articles/microsoft-puts-caps-on-new-bing-usage-after-ai-chatbot-offered-unhinged-responses-39c3252f |
| wsj.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
here |
https://wsj.com/articles/ai-chatgpt-dall-e-microsoft-rutkowski-github-artificial-intelligence-11675466857?mod=article_inline |
| wsj.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
coverage |
https://wsj.com/articles/chatgpt-ai-artificial-intelligence-openai-personal-writing-5328339a |
| wsj.com |
Reddit, StackOverflow, and Europe: All Trending Towards Data Dignity Long Posts |
reported |
https://wsj.com/articles/europe-to-chatgpt-disclose-your-sources-863ef330 |
| x.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
tweet |
https://x.com/AndrewCurran_/status/1914045840265789540 |
| x.com |
[microblog] One book is worth "0.06%" benchmark points to AI; is "no different from noise". What gives? Long Posts |
tweet |
https://x.com/giffmana/status/1914245144422776906 |
| x.com |
AI is driving the cost of polish down; some musings on fancy versus terse artifacts Long Posts |
tweet |
https://x.com/alexolegimas/status/2037173080741523742?s=20 |
| x.com |
AI Labs Should Open Source Data Protection Technologies Long Posts |
responded |
https://x.com/kevinroose/status/1884488496649494810 |
| x.com |
Almost Everybody -- Including Both Data Creators and AI Companies -- Stands to Benefit from Clearer "Data Rules". Long Posts |
tweet |
https://x.com/IvanVendrov/status/1988134758199578993?s=20 |
| x.com |
April 2026 small points Meta Notes |
https://x.com/nickmvincent/status/2048833224047296622 |
https://x.com/nickmvincent/status/2048833224047296622 |
| x.com |
How collective bargaining for information, public AI, and HCI research all fit together Long Posts |
Tweet |
https://x.com/iamtrask/status/1971197830258950236 |
| x.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
figures |
https://x.com/BernieSanders/status/1930613586331635830 |
| x.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
tweet |
https://x.com/DavidDuvenaud/status/1928682244052496446 |
| x.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
tweet |
https://x.com/bcmerchant/status/1929654074875826315 |
| x.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
summary |
https://x.com/luke_drago_/status/1915376929542111353 |
| x.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
tweet |
https://x.com/Hesamation/status/2011251156467794250?s=20 |
| x.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
tweet |
https://x.com/alexolegimas/status/2011432956984996294 |
| x.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
contention |
https://x.com/tszzl/status/2011506847036162177 |
| x.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
expressed interest |
https://x.com/JeffDean/status/1732461004901282238 |
| x.com |
Which datasets should we assume are "in all the AI models"? Long Posts |
discussion |
https://x.com/JesseDodge/status/1732444597593203111?s=20 |
| yalelawjournal.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
A Relational Theory of Data Governance |
https://yalelawjournal.org/feature/a-relational-theory-of-data-governance |
| yalelawjournal.org |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
A Relational Theory of Data Governance |
https://yalelawjournal.org/pdf/131.2_Viljoen_1n12myx5.pdf |
| ybrikman.com |
The Paradox of Reuse in 2026: A Case of Quasi-Enclosure, or "Subsidized Club Goods that Sort of Look Like Public Goods" Long Posts |
blog post |
https://ybrikman.com/blog/2026/01/21/gen-ai-snake-eating-its-own-tail |
| youtube.com |
A Short Guide to Data Strikes and Conscious Data Contribution in the Context of 2026 Frontier AI Long Posts |
coverage |
https://youtube.com/watch?v=Zj35mEtwUvY |
| youtube.com |
On AI-driven Job Apocalypses and Collective Bargaining for Information Long Posts |
podcast |
https://youtube.com/watch?v=nG_jGZuRBxs |
| youtube.com |
The AI "Evaluation Crisis" Is an Opportunity to Get Data Flow Right Long Posts |
“AI’s original sin” |
https://youtube.com/watch?v=CJWPezMVNdQ |