Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove Google Analytics from the Pinot Website #14840

Open
niallkp opened this issue Jan 18, 2025 · 1 comment
Open

Remove Google Analytics from the Pinot Website #14840

niallkp opened this issue Jan 18, 2025 · 1 comment

Comments

@niallkp
Copy link

niallkp commented Jan 18, 2025

Hi Pinot Team

The ASF Privacy Policy[1][2] does not permit the use of Google Analytics on any ASF websites and the ASF Infra team will soon enforce a Content Security Policy(CSP) that will block access to external trackers:

Please could you remove the use of the Google Analytics from the Pinoit website (I will submit a PR request shortly to do that)?

The ASF hosts its own Matomo instance to provide projects with analytics and you can request a tracking id for your project by sending a mail to privacy AT apache.org.

Additionally I would recommend reviewing any external resources loaded by your website. The Content Security Policy will prevent any resources being loaded from 3rd Party providers that the ASF does not have a Data Processing Agreement (DPA) with. On the 1st February Infra will begin a temporary "brownout" when the CSP will be turned on for a short period. This will allow projects to check which parts, if any, of their websites will stop working. The Privacy FAQ answers a number of questions about which external providers are permitted or not:

Thanks

Niall

[1] https://privacy.apache.org/policies/website-policy.html
[2] https://privacy.apache.org/faq/committers.html#can-i-use-google-analytics

@niallkp
Copy link
Author

niallkp commented Jan 29, 2025

Thanks for merging the PR - much appreciated. However, I have just re-scanned the new-static-prod branch of the pinot repo and there are 129 files that still use Google Analytics.

  • File .htaccess has a Content-Security-Policy
  • There are 128 html & text file with references to google-analytics.com and googletagmanger.com:
about.html
about.txt
blog.html
blog.txt
download.html
download.txt
powered-by.html
powered-by.txt
who-uses.html
who-uses.txt
who_uses.html
who_uses.txt
blog/2015/05/16/LinkedIn-Scaling.html
blog/2015/05/16/LinkedIn-Scaling.txt
blog/2015/06/10/Open-Sourcing-Pinot.html
blog/2015/06/10/Open-Sourcing-Pinot.txt
blog/2017/09/17/Restaurant-Manager.html
blog/2017/09/17/Restaurant-Manager.txt
blog/2019/01/09/LinkedIn-IntroThirdEye.html
blog/2019/01/09/LinkedIn-IntroThirdEye.txt
blog/2019/06/14/LinkedIn-StarTree.html
blog/2019/06/14/LinkedIn-StarTree.txt
blog/2019/07/11/LinkedIn-AutoTune.html
blog/2019/07/11/LinkedIn-AutoTune.txt
blog/2020/01/15/Pinot-Presto-SQL.html
blog/2020/01/15/Pinot-Presto-SQL.txt
blog/2020/02/20/LinkedIn-Thirdeye.html
blog/2020/02/20/LinkedIn-Thirdeye.txt
blog/2020/04/10/DevBlog-AnalyzeGitEvents.html
blog/2020/04/10/DevBlog-AnalyzeGitEvents.txt
blog/2020/06/25/LinkedIn-SmartAlerts.html
blog/2020/06/25/LinkedIn-SmartAlerts.txt
blog/2020/06/29/LinkedIn-TalentInsight.html
blog/2020/06/29/LinkedIn-TalentInsight.txt
blog/2020/07/14/LinkedIn-BatchRealtime.html
blog/2020/07/14/LinkedIn-BatchRealtime.txt
blog/2020/07/28/DevBlog-AnomalyDetection.html
blog/2020/07/28/DevBlog-AnomalyDetection.txt
blog/2020/07/28/DevBlog-DevUpStack.html
blog/2020/07/28/DevBlog-DevUpStack.txt
blog/2020/08/08/DevBlog-IngestPlugins.html
blog/2020/08/08/DevBlog-IngestPlugins.txt
blog/2020/08/08/DevBlog-PinotMonitoring.html
blog/2020/08/08/DevBlog-PinotMonitoring.txt
blog/2020/08/08/DevBlog-ScalarUDFs.html
blog/2020/08/08/DevBlog-ScalarUDFs.txt
blog/2020/08/08/DevBlog-SLAApps.html
blog/2020/08/08/DevBlog-SLAApps.txt
blog/2020/10/16/DevBlog-TwitterTrollAnalysis.html
blog/2020/10/16/DevBlog-TwitterTrollAnalysis.txt
blog/2020/10/20/Uber-Operating.html
blog/2020/10/20/Uber-Operating.txt
blog/2020/12/01/LinkedIn-Lamda.html
blog/2020/12/01/LinkedIn-Lamda.txt
blog/2021/01/08/DevBlog-DebeziumCDC.html
blog/2021/01/08/DevBlog-DebeziumCDC.txt
blog/2021/02/02/DevBlog-PrestoPinot.html
blog/2021/02/02/DevBlog-PrestoPinot.txt
blog/2021/04/08/DevBlog-UpsertsIntro.html
blog/2021/04/08/DevBlog-UpsertsIntro.txt
blog/2021/04/16/LinkedIn-Theta.html
blog/2021/04/16/LinkedIn-Theta.txt
blog/2021/04/27/DevBlog-PinotInRetailChain.html
blog/2021/04/27/DevBlog-PinotInRetailChain.txt
blog/2021/04/29/Uber-Charon.html
blog/2021/04/29/Uber-Charon.txt
blog/2021/06/13/DevBlog-Geospatial.html
blog/2021/06/13/DevBlog-Geospatial.txt
blog/2021/06/16/LinkedIn-TextAnalytics.html
blog/2021/06/16/LinkedIn-TextAnalytics.txt
blog/2022/04/04/Announcing-Apache-Pinot-0-10.html
blog/2022/04/04/Announcing-Apache-Pinot-0-10.txt
blog/2022/08/02/GapFill-Function-For-Time-Series-Datasets-In-Pinot.html
blog/2022/08/02/GapFill-Function-For-Time-Series-Datasets-In-Pinot.txt
blog/2022/11/08/Apache Pinot-How-do-I-see-my-indexes.html
blog/2022/11/08/Apache Pinot-How-do-I-see-my-indexes.txt
blog/2022/11/17/Apache Pinot-Inserts-from-SQL.html
blog/2022/11/17/Apache Pinot-Inserts-from-SQL.txt
blog/2022/11/22/Apache-Pinot-Timestamp-Indexes.html
blog/2022/11/22/Apache-Pinot-Timestamp-Indexes.txt
blog/2022/11/28/Apache-Pinot-Pausing-Real-Time-Ingestion.html
blog/2022/11/28/Apache-Pinot-Pausing-Real-Time-Ingestion.txt
blog/2023/01/29/Apache-Pinot-Deduplication-on-Real-Time-Tables.html
blog/2023/01/29/Apache-Pinot-Deduplication-on-Real-Time-Tables.txt
blog/2023/02/21/Apache-Pinot-0-12-Configurable-Time-Boundary.html
blog/2023/02/21/Apache-Pinot-0-12-Configurable-Time-Boundary.txt
blog/2023/03/30/Apache-Pinot-0-12-Consumer-Record-Lag.html
blog/2023/03/30/Apache-Pinot-0-12-Consumer-Record-Lag.txt
blog/2023/05/11/Geospatial-Indexing-in-Apache-Pinot.html
blog/2023/05/11/Geospatial-Indexing-in-Apache-Pinot.txt
blog/2023/05/16/star-tree-indexes-in-apache-pinot-part-1-understanding-the-impact-on-query-performance.html
blog/2023/05/16/star-tree-indexes-in-apache-pinot-part-1-understanding-the-impact-on-query-performance.txt
blog/2023/05/18/apache-pinot-tutorial-for-getting-started-a-step-by-step-guide.html
blog/2023/05/18/apache-pinot-tutorial-for-getting-started-a-step-by-step-guide.txt
blog/2023/05/23/change-data-capture-with-apache-pinot-how-does-it-work.html
blog/2023/05/23/change-data-capture-with-apache-pinot-how-does-it-work.txt
blog/2023/05/30/how-to-ingest-streaming-data-from-kafka-to-apache-pinot.html
blog/2023/05/30/how-to-ingest-streaming-data-from-kafka-to-apache-pinot.txt
blog/2023/06/01/real-time-mastodon-usage-with-apache-kafka-apache-pinot-and-streamlit.html
blog/2023/06/01/real-time-mastodon-usage-with-apache-kafka-apache-pinot-and-streamlit.txt
blog/2023/07/12/star-tree-index-in-apache-pinot-part-3-understanding-the-impact-in-real-customer.html
blog/2023/07/12/star-tree-index-in-apache-pinot-part-3-understanding-the-impact-in-real-customer.txt
blog/2023/08/04/segment-compaction-for-upsert-enabled-tables-in-apache-pinot-3f30657aa077.html
blog/2023/08/04/segment-compaction-for-upsert-enabled-tables-in-apache-pinot-3f30657aa077.txt
blog/2023/09/19/Annoucing-Apache-Pinot-1-0.html
blog/2023/09/19/Annoucing-Apache-Pinot-1-0.txt
blog/2024/08/07/AWSxPinot.html
blog/2024/08/07/AWSxPinot.txt
blog/page/1.html
blog/page/1.txt
blog/page/10.html
blog/page/10.txt
blog/page/2.html
blog/page/2.txt
blog/page/3.html
blog/page/3.txt
blog/page/4.html
blog/page/4.txt
blog/page/5.html
blog/page/5.txt
blog/page/6.html
blog/page/6.txt
blog/page/7.html
blog/page/7.txt
blog/page/8.html
blog/page/8.txt
blog/page/9.html
blog/page/9.txt

@Jackie-Jiang Jackie-Jiang reopened this Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants