R&D Tax Credit Process & Documentation

Minutes Read

Published

October 7, 2025

Updated

October 7, 2025

R&D Tax Credit Guide for SaaS AI and Machine Learning Startups

Learn how to document R&D tax credit for AI startups by properly tracking machine learning project expenses and qualifying development activities for the IRS.

Glencoyne Editorial Team

The Glencoyne Editorial Team is composed of former finance operators who have managed multi-million-dollar budgets at high-growth startups, including companies backed by Y Combinator. With experience reporting directly to founders and boards in both the UK and the US, we have led finance functions through fundraising rounds, licensing agreements, and periods of rapid scaling.

Need clarity on your startup’s finances? Contact Glencoyne.

Understanding the R&D Tax Credit for AI and Machine Learning Startups

For an early-stage AI or machine learning startup, every dollar of runway counts. While you focus on building your product and finding market fit, a significant source of non-dilutive capital often goes overlooked: the US R&D tax credit. For qualifying startups, this incentive can provide up to $500,000 in cash back annually. Crucially, this is not just a future tax deduction. For startups with limited or no income tax liability, the credit can be claimed against payroll taxes, resulting in a direct cash refund from the IRS.

This article provides a practical framework for how to document R&D tax credit for AI startups, enabling you to claim the credit you have earned without derailing your engineering team. We will address common confusion over qualifying R&D activities for startups, the documentation required, and the risk of misclassifying expenses.

The R&D Credit in Plain English

At its core, the R&D tax credit is a government incentive designed to encourage innovation within the United States. For pre-seed to Series B startups, its most powerful feature is the ability to offset payroll taxes. This means that even if you are not yet profitable, you can receive a quarterly cash refund, directly improving your cash flow and extending your runway.

The process involves identifying your qualified research activities and the expenses associated with them, known as Qualified Research Expenses (QREs). These calculations are documented and claimed using IRS Form 6765. The reality for most startups is more pragmatic: you do not need an enterprise-grade system, but you do need a consistent, lightweight process. The goal is to translate the technical work your team is already doing into a language the IRS understands, turning your innovation into a tangible financial asset. This is a crucial piece of US startup tax benefits that founders should not ignore.

Qualifying R&D Activities for Startups: The IRS Four-Part Test for AI

The central challenge for many founders is determining which of their team's activities count as qualified research. The IRS uses a four-part test to make this determination, and confusion over these requirements often leads startups to either underclaim or misclassify their work. Let's translate this test specifically for AI and machine learning software development.

1. Permitted Purpose Test
Your research must aim to create a new or improved business component, which can be a product, process, software, technique, or formula. For an AI startup, this could be developing a novel algorithm for a recommendation engine, creating a proprietary data labeling process, or improving the performance of a natural language processing model to handle industry-specific terminology.

2. Elimination of Uncertainty Test
This is the most critical test for software engineering tax incentives. You must demonstrate that you sought to eliminate uncertainty concerning the capability, method, or appropriate design of your product. This is a technical uncertainty, not a market uncertainty like 'will customers buy it?'. For example, you might be uncertain if a new neural network architecture can achieve a 95% accuracy rate on your proprietary dataset, whether a model can be optimized to run with acceptable latency for real-time user interaction, or how to overcome data sparsity issues in your training set.

3. Process of Experimentation Test
You must show a systematic process of evaluating one or more alternatives to eliminate that uncertainty. This involves modeling, simulation, systematic trial and error, or other evaluative methods. For an ML team, this is your daily work. It includes A/B testing different models, adjusting hyperparameters, trying different data augmentation techniques, performing feature engineering, and comparing the performance of each iteration against a baseline. Documenting these experiments is key.

4. Technological in Nature Test
The experimentation process must fundamentally rely on principles of the physical or biological sciences, engineering, or computer science. For any AI/ML company, this test is generally the easiest to meet. Your work is grounded in computer science, mathematics, and statistics.

Example in Practice: Consider a SaaS startup building a tool to detect fraud in financial transactions. Their permitted purpose is the fraud detection software itself. The technical uncertainty is whether their custom-built graph neural network can identify complex fraud rings more effectively than existing logistic regression models. Their process of experimentation involves training both models on the same dataset, comparing precision and recall metrics, and iterating on the network's architecture to improve results. This work is fundamentally technological in nature, relying on principles of computer science and machine learning.

Avoiding the Internal Use Software (IUS) Trap

A critical distinction exists for Internal Use Software (IUS), which is software developed for your own general and administrative functions, like finance, HR, or internal project management. IUS is subject to a much stricter 'high threshold of innovation' test, requiring the software to be unique, novel, and involve significant economic risk. For most AI-driven SaaS companies, software developed as part of a service you offer to customers is not considered IUS. However, if you are building a highly custom internal tool, be prepared for this higher standard of scrutiny.

How to Document R&D Tax Credit for AI Startups Without Driving Engineers Crazy

An R&D credit claim is only as strong as its documentation. The IRS requires 'contemporaneous documentation', meaning it should be created as the work happens, not months later. This is where many startups stumble, fearing a heavy, bureaucratic process. The key is to integrate documentation into the tools your engineers already use, such as Jira, Linear, and GitHub. This approach minimizes disruption and creates a robust audit trail.

Integrate with Project Management Tools

A scenario we repeatedly see is teams leveraging their existing ticketing systems. The simplest method is to create a tag, label, or custom field (e.g., R&D-Eligible) in Jira or Linear. Engineers apply this tag to tasks, stories, or epics that involve solving a technical uncertainty. For instance, a ticket titled 'Spike: Evaluate new vector database for query latency' would be tagged, while a ticket like 'Update button color on login page' would not. This practice creates a clear, auditable link between time spent and qualified activities, forming the foundation of your AI software development tax documentation.

Leverage Your Version Control System

Your Git history is a powerful and often overlooked source of documentation. The quality of commit messages can make or break their usefulness for an R&D claim. Vague messages create ambiguity, while descriptive ones provide clear evidence of experimentation.

Poor Commit Message: feat: updated model
High-Quality Commit Message: feat(model): Experiment with GELU activation to resolve vanishing gradient. ReLU test yielded 78% accuracy; GELU test yielded 81%. Committing GELU version.

The high-quality message clearly describes a process of experimentation, linking a specific code change to the elimination of a technical uncertainty. Encourage your engineers to explain the 'why' behind their changes, not just the 'what'. This transforms a simple log into a powerful narrative for an R&D study.

Calculating Your Qualified Research Expenses (QREs)

Once you know which activities qualify, the next step is to calculate the associated costs. For US companies, QREs generally fall into three categories. Correctly tracking R&D costs for AI is essential, as misclassifying these expenses is a primary reason startups lose out on the credit or face audit challenges.

1. Wages

This category includes the taxable wages of employees in the United States who are directly performing, supervising, or supporting qualified R&D activities. This often includes software engineers, data scientists, and product managers involved in technical experimentation. To simplify tracking, the IRS provides a 'Substantially All' rule. This rule allows you to include 100% of an employee's wages in your QREs if they spend 80% or more of their time on qualified R&D.

Numeric Example: An engineer listed in your QuickBooks payroll has an annual salary of $160,000. Your project records from Jira show she spent 85% of her time developing a new machine learning algorithm (a qualified activity) and 15% on customer support (a non-qualified activity). Because her qualified time exceeds the 80% threshold, you can include her entire $160,000 salary in your QRE calculation.

2. Contractor Costs

You can include payments to contractors conducting R&D on your behalf. The critical rule here is geography. Contractor fees only qualify if the research work is performed within the United States. This is a frequent mistake for startups with distributed teams. Payments to an offshore development team, no matter how innovative their work, are not eligible for the US R&D credit.

3. Cloud Computing Costs

This is a significant and relatively new category for machine learning project expenses. As of 2022, costs for cloud computing services (e.g., AWS, GCP, Azure) used for development and testing are includable as QREs. This covers expenses for services used in the direct conduct of R&D, such as compute power for model training, servers for a staging environment, or data storage for research datasets. The distinction is crucial: costs for hosting your live production application for customers are not qualified. To separate these costs, use resource tagging in your cloud provider's console. For example, in AWS you can tag all resources related to development with a tag like Environment:R&D. You can then run a cost report filtered by this tag to easily isolate your qualified cloud spend.

Actionable Next Steps for Founders

Successfully claiming the R&D credit doesn't require a dedicated finance team or a complex accounting system. It requires a pragmatic, consistent process built upon the tools your team already uses. For founders worried about capital, this process translates into direct cash that can fund another hire or extend your operational life.

Here are your immediate next steps:

Review the Four-Part Test with Your Team: Hold a brief session to discuss which of your current projects involve technical uncertainty and a process of experimentation. Identify and agree upon your core qualifying R&D activities for startups.
Implement a Tagging System: In Jira, Linear, or your chosen project management tool, create a simple tag like RD-Eligible. Train your engineering team to apply it consistently to work that meets the criteria discussed.
Improve Commit Message Hygiene: Share examples of high-quality Git commit messages and encourage engineers to write descriptive notes that explain the 'why' behind their technical experiments. This provides invaluable contemporaneous evidence.
Isolate Your R&D Cloud Costs: If you have not already, implement a resource tagging strategy in AWS, GCP, or Azure to separate R&D and staging environments from production. This will make calculating your cloud QREs straightforward and defensible.

By embedding these simple habits into your workflow, you create a defensible and auditable record of your innovation. This effort provides a direct financial return, turning your team's hard work into a valuable source of non-dilutive funding when you file IRS Form 6765.

Frequently Asked Questions

Q: Can our startup claim the R&D credit if we are not yet profitable?
A: Yes. Qualifying startups can elect to apply up to $500,000 of the R&D credit against their payroll tax liability, even with no income tax to offset. This results in a direct cash refund from the IRS, making it a powerful source of non-dilutive funding for pre-revenue or unprofitable companies.

Q: What are the most common documentation mistakes AI startups make?
A: The most common mistakes include relying on vague Git commit messages, failing to consistently tag R&D tasks in project management tools like Jira, and not properly separating R&D cloud computing costs from production costs. These omissions make it difficult to substantiate a claim during an audit.

Q: Does refactoring existing code qualify for the R&D tax credit?
A: It depends on the purpose. If refactoring is part of a process of experimentation to resolve a specific technical uncertainty, such as improving performance to meet a critical latency threshold, it may qualify. However, routine code cleanup done simply to improve readability or maintainability typically does not.

This content shares general information to help you think through finance topics. It isn’t accounting or tax advice and it doesn’t take your circumstances into account. Please speak to a professional adviser before acting. While we aim to be accurate, Glencoyne isn’t responsible for decisions made based on this material.

Or Learn more