Blog Tags for Trainings #633

stevepiercy · 2022-08-31T08:13:41Z

Consider using blog-style tags for trainings for searching.

See an example implementation at: https://sphinx-tags.readthedocs.io/en/latest/quickstart.html

acsr · 2025-03-14T10:31:20Z

@stevepiercy I second this approach. Can we simply add sphinx-tags as extension to overcome the hen egg problem. And then add a more concrete example how to use that option in the MyST Reference section. If you can suggest a different order of execution (eg. testing in a branch) let me know. I think this cannot break stuff, is easy to remove and would help a lot. We can even hide tags using CSS until release of a serious amount of tags.

acsr · 2025-03-14T10:39:59Z

By the way: Suggesting tags with some guidance is a typical task for a dumb LLM based AI-Agent in the postprocessing of content.

Take this content by Markdown URL list as given (nothing else)
Take these existing tags as given: [List]
Go through the listet content items/urls by using the rendered versions as well – one by one and add tags in the Markdown source.
Suggest missing tags and offer them as a second step for review.
Add the suggested tags except those of a given exclude list to the Markdown as a second step.
Create one pull request per task (needs some refinement)
Write a proper change log per session for content editors
Write a news style changelog for consumers of the trainings to catch up with latest changes

Note: Using AI here does not rise any copyright issues at all.

stevepiercy · 2025-03-14T11:24:09Z

I don't think tags add value, given the PLIP plone/Products.CMFPlone#4097.

As such, I personally have zero interest in doing the work to implement this extension, as it subverts the long-term goal. I think we should close this issue.

acsr · 2025-03-14T12:28:19Z

I cannot get exactly where the PLIP plone/Products.CMFPlone#4097 adresses the purpose of tags.

Tags are part of SEO and information architecture in knowledge management. Even with a upcoming Nuclia search engine.

Why? Tags (or keywords) come into consideration when the actual content lacks terms that help to reflect all synonyms or relationships or contexts the written content is relevant for. There are also usecases where a term belongs to a particular namespace or its would help to insert it into different taxonomies. It avoids bloating the content itself. There are tags that are synonym to terms and tags that are synthetic creations to to take them out of existing namespaces and make them unique.

If you can point me to the exact point in your PLIP that adresses this purpose before closing this would help my understanding.

stevepiercy · 2025-03-14T20:15:58Z

You didn't mention SEO until now. In any case, we don't have a problem with SEO. https://duckduckgo.com/?t=ffab&q=plone+training&atb=v190-1&ia=web

We barely have enough interest from authors to contribute trainings, much less maintain them, and even less to create and maintain tags. It would fall to me, and I don't have the interest to do this. I have much, much higher priorities, like having Nuclia search results, as explained in the PLIP. A good search tool would obviate the need for tags. The problem is easily finding content, and a good search beats tags every time.

I also don't have the interest to mentor a first-timer through the process.

I have negative interest in bringing AI into the picture to "do it for us". It just sounds like more work, to place the responsibility of curation onto an AI. Its output would still require curation by a human.

If this is something that you feel strongly enough to take on and maintain or mentor a first-timer through, please do. It's very low priority.

acsr · 2025-03-14T21:28:51Z

I have negative interest in bringing AI into the picture to "do it for us". It just sounds like more work, to place the responsibility of curation onto an AI. Its output would still require curation by a human.

I agree with your personal priorities.

Actually a good search like algolia, typesense or nuclia need to use machine learning aka AI to improve synonym or context improvement. To make this happen you need to map synonyms not available in the content to their counterparts. In Algolia this was a manual task years ago similar to adding tags. Leaving this to a "good" search engine is uncurated AI as well. The difference when adding tags, is that the assignment is visible and allows to visit aggregated similar content. Much closer than current Sphinx results.

SEO needs in fact no special mentioning, since searching relates to finding (at least a bit).
All AI or classic full text search is failing if the content lacks proper markup or content or the engine lacks access to context. Metadata is king. Tags are metadata. AI succeeds in semantic context resolution by using much longer distance vector chains.

In any case, we don't have a problem with SEO. https://duckduckgo.com/?t=ffab&q=plone+training&atb=v190-1&ia=web

An example when a search engine finds obvious content is no proof how to find stuff those search engines do not discover due to lacking context.

Thanks for leaving this at low prio. We need to see how fast nuclia comes into range and if this vaporizes as you expect. I am open.

The increasing quality of results by LLMs creating Python code has only one origin: The excellent Sphinx documentation markup of the standard library.

stevepiercy · 2025-03-14T22:07:38Z

See the work that went into configuring Nuclia to do exactly that in the open pull requests in Training linked in the PLIP.

We use https://pypi.org/project/sphinxext-opengraph/ as well for SEO. Full list of extensions: https://6.docs.plone.org/contributing/documentation/themes-and-extensions.html#extensions

stevepiercy added the 04 type: enhancement label Aug 31, 2022

acsr added the 99 use case for AI assistance Generative AI ist critical for content creation. Marks use cases for assistant technology. label Mar 14, 2025

stevepiercy added the 14 prio: low label Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blog Tags for Trainings #633

Blog Tags for Trainings #633

stevepiercy commented Aug 31, 2022

acsr commented Mar 14, 2025 •

edited

Loading

acsr commented Mar 14, 2025

stevepiercy commented Mar 14, 2025

acsr commented Mar 14, 2025 •

edited

Loading

stevepiercy commented Mar 14, 2025

acsr commented Mar 14, 2025

stevepiercy commented Mar 14, 2025

Blog Tags for Trainings #633

Blog Tags for Trainings #633

Comments

stevepiercy commented Aug 31, 2022

acsr commented Mar 14, 2025 • edited Loading

acsr commented Mar 14, 2025

stevepiercy commented Mar 14, 2025

acsr commented Mar 14, 2025 • edited Loading

stevepiercy commented Mar 14, 2025

acsr commented Mar 14, 2025

stevepiercy commented Mar 14, 2025

acsr commented Mar 14, 2025 •

edited

Loading

acsr commented Mar 14, 2025 •

edited

Loading