How AI is Reshaping Technical Writing: Metadata • Fluid Topics

Mar 26, 2025 | Reading Time: 4 minutes

In the first article of this series, “How AI is Reshaping Technical Writing: Structure”, we explored structured content — what it is, how to do it, and why it’s crucial to the Content Value Path. The Content Value Path is our solution to mitigating the challenges technical documentation teams face when looking to deliver content to new technologies like AI applications.

In this second part of our three-part series, we’re diving into the next core element: metadata. Read on to uncover what metadata is, how it connects to structure from part one, what to consider when choosing metadata, and how automations can simplify this second step.

What is Metadata?

Metadata is data that describes other data.

There are several types of metadata with different purposes and added value:

Content Management Metadata: This includes tags such as the date a topic was created, content author, and publishing status.
Editorial Metadata: They describe the context and applicability of the content and, as such, are part of the content itself. This could be the product version the content applies to, the type of task described (i.e. installation, maintenance, etc.), or the required level of expertise.

Metadata can consist of dates, numbers, free keywords or, as we typically see in documentation, labels taken from controlled lists (i.e. flat or hierarchical lists used for taxonomies).

Additionally, teams may attach metadata to both topics and maps. In the case of maps, the metadata applies to all topics within the map.

different metadata types at the top of documents

How Do Metadata Relate to Content Structure?

The link between structure and metadata is simple. Metadata must unambiguously apply to all content within a topic to be meaningful and usable. Take, for example, a topic that contains information on both the installation and maintenance of a product. In this case, the topic needs to split into two distinct topics, with one containing information about the product installation and the other containing information about the product maintenance. By doing so, teams label topics more accurately so users and chatbots can find the information most relevant to their needs.

How Do You Choose Metadata?

Naturally, one of the biggest questions is how to choose metadata and with which values. There are many options and recommended best practices available, yet no universal answer. Each company’s ideal metadata depends on their products, their content, and how they intend for users to interact with their content.

Here are some typical use cases:

Use metadata to create filters, also called facets, in your enterprise search engine so users can refine their queries. For example, users could choose to only see search results pertaining to a specific product or version. This improves search result relevance, helping users get the answers they need faster.
Use in-product help, also called contextual help. These embedded tools use contextual information linked to the exact version of the product and its configuration to provide users with the right content as quickly as possible.

Include a QR code directly on machines for on-site interventions. Then, the engineer, operator, or technician simply scans the code to access the exact maintenance documentation for that machine. This works because the QR code connects to a list of all metadata related to the machine’s subsystems and conditions, then uses that list to filter content.

For companies starting at the beginning of the metadata process, or for those who have chosen some metadata but aren’t sure how to proceed, we recommend you follow the steps below.

First, develop some use cases involving storytelling and characters based on your customer profiles. Think about where typical users encounter a problem or a question related to your product.
Next, identify the metadata they would need for support in these scenarios. Which search filter criteria would be necessary to extract the most relevant content?
Align the metadata with the content. Here, you may have to adapt your content’s granularity, as mentioned in the first article on Structure.

If this last step feels somewhat daunting or even beyond your reach because you have thousands or even tens of thousands of topics, don’t worry. You’re not alone, and technology is here to help.

How to Automate Content Metadata Tagging?

Today’s automatic classification algorithms use the latest technological advances in Artificial Intelligence, and they are extremely accurate. They learn from a representative set of manually labeled topics (the supervised learning phase). Then, they proceed with tagging metadata on their own (the automatic classification phase). Consequently, with just a few hundred pre-labeled topics, you can tag thousands or even millions of topics in a matter of minutes.

Your team can also extend this method to tag content from any other sources (wikis, knowledge bases) to benefit from fully aligned documentation. Here, it’s important to note the need for topics with the right level of granularity. The more focused the topic’s content is, the more precise the algorithm’s labeling will be. This is particularly important for topics used for training the system. When done correctly, automatic tagging is equally as accurate as human-led labeling.

Continuing on the Content Value Path

Metadata and structure form two of the three essential pillars to developing a Content Value Path that prepares your content for new technologies like AI projects. Metadata has a direct effect on how easy it is for users to find the right content when they need it. In parallel, metadata impacts how relevant and personalized AI-generated responses are in user-facing tools like chatbots. While getting started may take some planning and manual tagging, new technologies exist to facilitate the metadata labeling process.

If you missed part one of this series on Structure, it’s not too late to go back and read up on the first requirement for modern documentation success. Otherwise, don’t miss the third article “How AI is Reshaping Technical Writing: Semantic Enablement” where we tackle the final component of the Content Value Path.

About The Author

Fabrice Lacroix

Fabrice is Fluid Topics visionary thinker. By tirelessly meeting clients, prospects and partners, he is sensing the needs of the market and fueling his creativity to invent the functions that makes Fluid Topics the market leading solution for technical content dynamic delivery.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
Font Awesome	1 years 30 days	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
Instant page	1 years 30 days	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
pll_language	1 year	The pll _language cookie is used by Polylang to remember the language selected by the user when returning to the website, and also to get the language information when not available in another way.
Polylang	1 years 19 days 15 minutes	This cookie is set by Polylang plugin for WordPress powered websites. The cookie stores the language code of the last browsed page.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_132D9JVQ7V	2 years	This cookie is installed by Google Analytics.
_gat_UA-58488653-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
CONSENT	16 years 3 months 8 days 15 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
Pardot		The cookie is set when the visitor is logged in as a Pardot user.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_hjSession_2701483	30 minutes	No description
_hjSessionUser_2701483	1 year	No description
_lfa_test_cookie_stored	past	No description
AnalyticsSyncHistory	1 month	No description
BIGipServerab08web-nginx-app_https	session	No description
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
FT_LOCALES	1 year	No description
FT_SESSION	1 month	No description
li_gc	2 years	No description
lpv880192	30 minutes	No description
route	12 hours	No description available.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
visitor_id880192	10 years	No description
visitor_id880192-hash	10 years	No description

What is Metadata?

How Do Metadata Relate to Content Structure?

How Do You Choose Metadata?

How to Automate Content Metadata Tagging?

Continuing on the Content Value Path

Looking for our logo?