Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft Nuclia Documentation #1529

Draft
wants to merge 3 commits into
base: 6.0
Choose a base branch
from
Draft
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
85 changes: 85 additions & 0 deletions nuclia.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
# Replacing Sphinx Search with Nuclia Search in Plone
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

## Overview
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

This documentation outlines the process of replacing Sphinx search in a Plone site with Nuclia search. The goal is to explain the approach taken and provide instructions for replicating the process in another project.
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

## Approach

### Data Indexing and Processing
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

To achieve the transition, the following steps were taken:
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

1. **Generate Nuclia Sync Data:**
- Iterate through `.md` files in the repository [docs](https://github.com/plone/documentation/tree/6.0/docs) .
- Calculate the hash of each file and store the mapping in `nuclia_sync.json`.

2. **Extract Headings:**
- Use regular expressions to extract headings from Markdown files.
- Store the extracted headings and slugs for document URLs.

3. **Upload Documents:**
- Utilize the `NucliaUpload` class from the `nuclia` SDK.
- Generated slugs, Extracted headings, Nuclia `API_KEY` and constructed URLs for `.md` can be passed in the class method.

4. **Synchronize Documents:**
- Compare hashes to determine whether to upload, update, or delete documents, when changes are made.
- Handle document deletions using the `NucliaKB` class.

5. **GitHub Actions Workflow:**
- Created a synchronization workflow triggered by a `push` event.
- Defined the steps for checking out code, setting up the Python environment, running the sync script, and committing changes back to the repository.
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

## Replication Steps

Follow these steps to replicate the process in your own project:

### Set Up
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

4 spaces to indent things.

Suggested change
### Set Up
### Pre-requisites
- See {ref}`installation` for requirements to build Plone 6 Documentation.
- See `Training <inv:training:std:lable:installation>` for requirements to build Plone Training.
- [insert requirements for Nuclia and anything else as additional list items]


Copy this in your terminal to clone plone/documentation

```bash
git clone `<repository-url>`

pip install -q -r requirements-initial.txt

pip install -q -r requirements.txt
```
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

### Global Configuration
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

1. **PUBLIC_URL:**
- Set the URL of your website
2. **API Key:**
- Obtain the API key from your nuclia Knowledge Box.
3. **Knowledge Base URL:**
- Set up the variable to define the Nuclia knowledge box URL.
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

### Indexing and Syncing
justdaksh marked this conversation as resolved.
Show resolved Hide resolved

1. **Generate Nuclia Sync Data:**

```bash
python3 upload.py
```

2. **Sync Documents:**

- Run the `sync` function to upload, update, or delete documents in Nuclia.

justdaksh marked this conversation as resolved.
Show resolved Hide resolved
### GitHub Actions

- **Workflow Setup:**
Modify the GitHub Actions workflow `nuclia_sync.yml` to match your repository structure.

>## Usage Notes
>
>- Ensure that the API key and knowledge base URL are correctly configured.
>- Regularly update the sync process to keep the knowledge base up to date.
>- Troubleshoot issues by checking API key validity and document URLs.

### Conclusion

Replacing Sphinx search with Nuclia search brings improved search functionality to Plone sites. Feel free to reach out for assistance or clarification on any aspect of this documentation.

Happy syncing and searching!
justdaksh marked this conversation as resolved.
Show resolved Hide resolved
Loading