Using the AI Chat
LME's AI chat lets you interact with a large language model (LLM) directly from the dashboard. You can ask security questions, get AI analysis of alerts, and query the LME documentation. Everything runs on your own infrastructure — no data is sent externally unless you configure a cloud model.
How to Send a Message
- Type your question in the text area at the bottom of the chat panel
- Press Enter to send (or click the arrow button)
- Your message appears as a blue bubble on the right
- A thinking animation plays while the AI processes
- The AI response appears as a gray bubble on the left, with text streaming in word by word
Keyboard shortcuts:
- Enter — send the message
- Shift+Enter — insert a new line (for multi-line messages)
- Escape — clear the search bar (in the alerts area)
RAG Mode: Documentation-Grounded Answers
RAG (Retrieval-Augmented Generation) is a mode that searches the LME documentation before answering. This grounds the AI's response in real documentation rather than general knowledge.
How RAG works
When you send a message:
- Your question is converted to a vector embedding by the embeddings server
- The embedding is compared against all LME documentation chunks stored in pgvector
- The top 10 most relevant documentation passages are retrieved (filtered by a minimum similarity threshold of 0.55 and a minimum length of 200 characters)
- Those passages are included as context in the prompt sent to the LLM
- The AI answers based on the documentation and its own training data
- Source cards appear below the response showing:
- Section title of each matched document
- URL to the documentation page (clickable, opens in a new tab)
- Similarity percentage (how relevant the match was)
- A short excerpt from the matched passage
If no relevant documentation is found the LLM will provide a link to the LME docs website.
Analyzing Alerts with AI
From the Alert Card
- Go to the Alerts view
- Find the alert you want to investigate
- Click the "Analyze" button on the alert card
- An "Analyzing..." indicator appears
- The AI receives the full alert JSON (trimmed to 4KB) and responds with exactly three sections:
- What happened — a plain-English explanation of the alert
- Risk — how serious this is
- Action — specific steps you should take
- The analysis appears in a colored box directly below the alert card
This works on all four alert tabs (Kibana, Wazuh, Sysmon, Windows Defender).
From the Chat Panel
You can also paste alert data into the chat:
- Click "Details" on an alert card to expand the JSON
- Copy the JSON
- Paste it into the chat with a question like: "Analyze this alert and tell me if it's a real threat"
- The AI breaks down the alert and gives you guidance
This is useful when you want a follow-up conversation about a specific alert.
Clearing Conversations
Click the "Clear" button at the top of the chat panel to reset the conversation history and start fresh.
Conversation history is also cleared when you refresh the page.
Tips for Better Results
Be Specific
| Less effective | More effective |
|---|---|
| "Help with alerts" | "I see 50 Sysmon Event ID 1 alerts from PC-04 in the last hour — is this normal?" |
| "Explain this" | "What does this Wazuh level 12 alert mean and should I be concerned?" |
| "How does LME work" | "How does LME collect Windows event logs from endpoints?" |
Provide Context When Analyzing Alerts
Include as much context as you can:
- Use the Analyze button (sends the full alert automatically) rather than manually describing the alert
- Mention whether this is a one-time event or recurring
- Mention what kind of environment this is (production, test, etc.)
Understand Model Limitations
The default local model (LFM2.5 1.2B) is small and fast but has limited reasoning. For complex analysis:
- Download a larger local model (7B+ parameters) — see Managing Models
- Connect a cloud model (GPT-4, Claude) for the best quality
- See Managing Models for instructions
Privacy
- Local models: All processing happens on your LME server. No data leaves your network.
- Cloud models: If you configure a cloud model, your prompts (including alert data) are sent to that cloud provider. Only use cloud models if your organization's data handling policies allow it.
- RAG data: The documentation in pgvector comes from the public LME docs website. No sensitive data is stored in the RAG database.