Import documents into CMS through RSS feeds

Hi,

I am trying to import data through the RSS feeds generated by my wordpress site. So my requirement is:

  • To set a cron job to import the feeds into documents in Hippo CMS once every day.

  • The documents will have the same document type and will have fields mapping to RSS feed.

So, is there a way to import or create the documents dynamically in Hippo CMS using RSS feeds as external source?

Thanks in advance!

RK

Yes, there are many ways doing it.
In high level, you have two options basically:

  1. Implement a REST service in Hippo, which imports data from RSS feeds, periodically.
  2. Or implement a Groovy Updater script, which imports data from RSS feeds, periodically.

For the option #1, you can choose either Repository JAX-RS Component solution [1] or HST Plain JAX-RS Service solution [2]. In both approaches, you have to configure a CRON job somewhere externally.
In implementation level, with Repository JAX-RS Component approach, you cannot use HST-2 APIs (e.g, HST-2’s WorkflowPersistenceManager [3]) unlike the HST Plain JAX-RS Service solution approach. You need to stick with either low-level JCR API or Repository Workflow API instead. In that case, Content-EXIM’s DocumentManager would help a lot as it provides a high level abstraction for you. [4]

For the option #2, take a look at [5]. You can schedule it with the forge module [6]. As the scheduler works in the repository itself, it’s more convenient and cluster-safe.
In this option, you can’t use HST-2 API either. You need to stick with either JCR or Repository Workflow API, possibly compensated by Content-EXIM core library (e.g, DocumentManager).

Regards,

Woonsan

[1] Repository JAX-RS Service - Bloomreach Experience Manager (PaaS/Self-Hosted) - The Fast and Flexible Headless CMS
[2] RESTful API Support - Plain JAX-RS Services - Bloomreach Experience Manager (PaaS/Self-Hosted) - The Fast and Flexible Headless CMS
[3] Hst Component Persistable annotation and workflow - Bloomreach Experience Manager (PaaS/Self-Hosted) - The Fast and Flexible Headless CMS
[4] https://onehippo-forge.github.io/content-export-import/tutorials-import-docs.html
[5] Updater Scripts - Bloomreach Experience Manager (PaaS/Self-Hosted) - The Fast and Flexible Headless CMS
[6] https://onehippo-forge.github.io/hippo-utilities/repository/updater-exec-sched-job.html