-- Living Mobile --: June 2025

Sunday, June 29, 2025

Detail on Hardware available on Google collab

Wh What are the differences between below hardwares available in Google Collab?

CPU

T4 GPU

A100 GPU

L4 GPU

v2-8 TPU

v6e-1 TPU

v5e-1 TPU

When is CPU ideal for inference?

Thursday, June 26, 2025

What is Agent Mode in Visual Studio code with Co-Pilot

Why use agent mode?

Agent mode is optimized for making autonomous edits across multiple files in your project. It is particularly useful for complex tasks that require not only code edits but also the invocation of tools and terminal commands. You can use agent mode to:

Refactor parts of your codebase, such as "refactor the app to use a Redis cache".

Plan and implement new features, such as "add a login form to the app using OAuth for authentication".

Migrate your codebase to a new framework, such as "migrate the app from React to Vue.js".

Generate an implementation plan for a complex task, such as "create a meal-planning web app using a Swift front-end and a Node.js back-end".

Define a high-level requirement, such as "add social media sharing functionality".

Agent mode is particularly useful for coding tasks when you have a less well-defined task that might also require running terminal commands and tools. Agent mode autonomously determines the relevant context and tasks to accomplish the request. It can also iterate multiple times to resolve intermediate issues, such as syntax errors or test failures.

Examples are:

Enter your prompt for making edits in the chat input field and select Send (Enter) to submit it.

You can specify a high-level requirement, and you don't have to specify which files to work on. In agent mode, the AI determines the relevant context and files to edit autonomously.

Experiment with some of these example prompts to get started:

Create a meal-planning web app using React and Node.js

Add social media sharing functionality

Replace current auth with OAuth

Agent mode might invoke multiple tools to accomplish different tasks. Optionally, select the Tools icon to configure which tools can be used for responding to your request.

If your project has configured tasks in tasks.json, agent mode tries to run the appropriate tasks. For example, if you've defined a build task, agent mode will run the build task before running the application. Enable or disable running workspace tasks with the github.copilot.chat.agent.runTasks setting.

Copilot detects issues and problems in code edits and terminal commands and will iterate and perform additional actions to resolve them.

Enable the github.copilot.chat.agent.autoFix setting to automatically diagnose and fix issues in the generated code changes. This setting is enabled by default.

For example, agent mode might run unit tests as a result of a code edit. If the tests fail, it uses the test outcome to resolve the issue

Copilot Edits agent mode iterates multiple times to resolve issues and problems. The chat.agent.maxRequests setting controls the maximum number of requests that Copilot Edits can make in agent mode.

As Copilot processes your request, notice that Copilot streams the suggested code edits directly in the editor.

The Chat view shows the list of files that were edited in bold text. The editor overlay controls enable you to navigate between the suggested edits.

Review the suggested edits and accept or discard the suggested edits.

Continue to iterate on the code changes to refine the edits or implement additional features.

Agent mode tools

Agent mode uses tools to accomplish specialized tasks while processing a user request. Examples of such tasks are listing the files in a directory, editing a file in your workspace, running a terminal command, getting the output from the terminal, and more.

Agent mode can use the following tools:

Built-in tools

MCP tools

Tools contributed by extensions

You can view and manage the tools that can be used for responding to a request. Select the Tools icon in the Chat view to view and manage the tools that are available in agent mode.

Based on the outcome of a tool, Copilot might invoke other tools to accomplish the overall request. For example, if a code edit results in syntax errors in the file, Copilot might explore another approach and suggest different code changes.

You can enable or disable the use of agent tools by configuring the chat.extensionTools.enabled setting. Learn how to centrally manage this setting in your organization by checking Centrally Manage VS Code Settings in the enterprise documentation.

Define tool sets

A tool set is a collection of tools that you can use in chat. You can use tool sets in the same way as you would use individual tools. For example, select a tool set with the tools picker in agent mode or reference the tool set directly in your prompt by typing # followed by the tool set name.

Tool sets enable you to group related tools together, making it easier to use them in your chat prompts, prompt files, or custom chat modes. This can be particularly useful when you have many installed tools from MCP servers or extensions.

How to create a new Tool set for Co-Pilot?

To create a tool set, use the Chat: Configure Tool Sets > Create new tool sets file command in the Command Palette. A tool sets file is a .jsonc file that is stored in your user profile.

A tool set has the following structure:

<tool set name>: name of the tool set, which is displayed in the tools picker and when referencing the tool set in your prompt.

tools: list of tool names that are included in the tool set. The tools can be built-in tools, MCP tools, or tools contributed by extensions.

description: brief description of the tool set. This description is displayed alongside the tool set name in the tools picker.

icon: icon for the tool set, values can be found in the Product Icon Reference.

{

"reader": {

"tools": [

"changes",

"codebase",

"fetch",

"findTestFiles",

"githubRepo",

"problems",

"usages"

"description": "description",

"icon": "tag"

}

Manage tool approvals

When a tool is invoked, Copilot requests confirmation to run the tool. This is because tools might run locally on your machine and perform actions that modify files or data.

In the Chat view, after a tool invocation, use the Continue button dropdown options to automatically confirm the specific tool for the current session, workspace, or all future invocations.

In case you want to auto-approve all tools, you can now use the experimental chat.tools.autoApprove setting. This will automatically approve all tool invocations, and VS Code will not ask for confirmation when a language model wishes to run tools. Bear in mind that with this setting enabled, you will not have the opportunity to cancel potentially destructive actions a model wants to take.

As an enhanced boundary, you might choose to set chat.tools.autoApprove only when connected to a remote environment. You'll want to set this as a remote, rather than user-level, setting. Note that remote environments that are part of your local machine (like dev containers) or that have access to your credentials will still pose different levels of risk.

How to use instructions to get AI edits that follow your coding style?

Use instructions to get AI edits that follow your coding style

To get AI-generated code edits that follow your coding style, preferred frameworks, and other preferences, you can use instruction files. Instruction files enable you to describe your coding style and preferences in Markdown files, which the AI uses to generate code edits that match your requirements.

You can manually attach instruction files as context to your chat prompt, or you can configure the instruction files to be automatically applied.

The following code snippet shows an example of an instruction file that describes your coding style and preferences:

---

applyTo: "**"

---

# Project general coding standards

## Naming Conventions

- Use PascalCase for component names, interfaces, and type aliases

- Use camelCase for variables, functions, and methods

- Prefix private class members with underscore (_)

- Use ALL_CAPS for constants

## Error Handling

- Use try/catch blocks for async operations

- Implement proper error boundaries in React components

- Always log errors with contextual information

Wednesday, June 25, 2025

Knowledge Graph, When generating the Nodes and relationship from passages, how to link them to the actual chunks from which these are extracted?

When using a Knowledge Graph (KG) for Retrieval Augmented Generation (RAG), getting the nodes is just the first step. The real power comes from being able to retrieve the original, relevant passages from your documents that back up the facts in your graph. This is crucial for providing the Large Language Model (LLM) with the rich context it needs to generate accurate and grounded answers.

There are a few effective strategies to achieve this, ranging from simpler KG-only methods to more robust hybrid approaches:

Strategy 1: Store Content Directly on Paragraph/Chunk Nodes (Simpler KG-Only)

This is a straightforward approach where the actual text content of your document chunks is stored as a property on the Paragraph (or Chunk) nodes within your Neo4j graph.

How it works:

During Ingestion:

When you extract entities and relationships from a text chunk, you also create a Paragraph node for that chunk itself.

Store the page_content of the chunk as a property (e.g., content) on this Paragraph node.

Establish relationships from your extracted entities (like Concept, Person, Organization) back to the Paragraph node that mentions or discusses them (e.g., [:DISCUSSES], [:MENTIONED_IN]).

During Retrieval:

First, perform your multi-hop query on the Knowledge Graph to find the relevant entities and relationships.

As part of the same Cypher query, add a step to traverse from those relevant entities to the Paragraph nodes that are connected to them.

Retrieve the content property from these Paragraph nodes.

Example Cypher Modification for Retrieval:

MATCH (concept:Concept)<-[:DEVELOPED_BY]-(org:Organization)

WHERE org.name = 'Google'

RETURN concept.name AS ConceptDevelopedByGoogle

MATCH (concept:Concept)<-[:DEVELOPED_BY]-(org:Organization)

WHERE org.name = 'Google'

// Now, find paragraphs that discuss this concept

MATCH (p:Paragraph)-[:DISCUSSES]->(concept)

RETURN concept.name AS ConceptDevelopedByGoogle, COLLECT(DISTINCT p.content) AS RelevantPassages

Pros:

All data (structured facts and raw text) is in one place (Neo4j).

Simplifies initial setup if you're already ingesting into Neo4j.

Single query language (Cypher) for both relational facts and text retrieval.

Cons:

Performance and Scalability: Neo4j is optimized for graph traversals, not for storing and retrieving large amounts of raw text. Storing massive text content directly on nodes can lead to a larger database size and potentially slower performance for text-heavy operations if you have a very large corpus.

No Semantic Search on Text: You lose the benefit of vector similarity search on the raw text content itself. If a user asks a question that isn't directly answerable by the graph structure but requires semantic understanding of the passages, you can't easily retrieve relevant chunks based on similarity alone.

Strategy 2: Hybrid Approach (Knowledge Graph + Vector Database) - Recommended

This is generally considered the most robust and scalable approach for production-grade RAG systems. It leverages the strengths of both database types.

How it works:

Separate Storage:

Vector Database (e.g., ChromaDB, Pinecone, Weaviate): Store all your original document chunks here. Each chunk should have a unique identifier (e.g., a UUID or a simple chunk_id). Include useful metadata with each chunk (like document_title, section_title, order, etc.). The vector database is optimized for storing text and performing fast semantic similarity searches.

Knowledge Graph (Neo4j): Store your extracted entities and relationships.

Linking Mechanism:

When your LLM extracts a Concept (or Person, Technology, etc.) from a specific chunk, store the chunk_id (or a list of chunk_ids) as a property on the corresponding Concept node in your Neo4j graph. You can also create explicit relationships like (Concept)-[:MENTIONED_IN]->(ChunkNode) if you want ChunkNodes to be part of the graph (though often just storing the ID as a property is sufficient if the ChunkNode isn't heavily queried for graph traversals).

During Retrieval (RAG Workflow):

Step A: Structured Retrieval from KG:

Take the user's query and use it to formulate a multi-hop Cypher query on your Neo4j graph.

Retrieve the relevant structured facts (e.g., "Transformer architecture developed by Google") AND the chunk_ids associated with those facts/concepts.

Step B: Contextual Retrieval from Vector DB:

Use the retrieved chunk_ids (from Step A) to directly fetch the full, original textual content from your Vector Database.

Optionally, you can also perform a semantic similarity search on the Vector DB using the original user query (or a reformulated query) to get additional, semantically similar chunks that might not have been directly linked by the KG but are still relevant.

Step C: Augmentation for LLM:

Combine the structured facts from the KG (e.g., "Fact: Transformer architecture was developed by Google. Fact: GPT-3 is based on Transformer architecture.")

Concatenate the full, relevant passages retrieved from the Vector Database.

Pass this combined, rich context to your LLM prompt.

Pros:

Optimal Performance: Each database specializes in its strength (graph traversals for KG, semantic search for Vector DB).

Scalability: Can handle very large document corpora and complex knowledge graphs independently.

Flexibility: Allows for sophisticated RAG strategies, combining structured knowledge with unstructured text effectively.

Reduced Duplication: Core text content is stored once in the Vector DB.

Cons:

Increased Complexity: Requires managing and orchestrating two different database systems.

More Involved Workflow: The RAG retrieval pipeline has more steps.

Example Python Sketch for Hybrid Approach:

# (Assuming Neo4j setup as before, and a ChromaDB client initialized)

from langchain_community.vectorstores import Chroma

from langchain_core.documents import Document as LC_Document

import uuid # For unique chunk IDs

# --- Simulated Chunking and Ingestion into Vector DB ---

# In a real scenario, this would come from your document loader/splitter

simulated_chunks = [

{

"content": "The Transformer architecture was introduced in the paper 'Attention Is All You Need' by Google researchers.",

"metadata": {"section": "Architectures", "doc_id": "doc_llm_overview"}

{

"content": "Many modern LLMs, like GPT-3, are based on the Transformer architecture.",

"metadata": {"section": "Architectures", "doc_id": "doc_llm_overview"}

# ... more chunks

]

# Simulate adding unique IDs and converting to LangChain Document format for Chroma

chroma_docs = []

for i, chunk_data in enumerate(simulated_chunks):

chunk_id = f"chunk_{uuid.uuid4()}" # Generate a unique ID for each chunk

chunk_data['metadata']['chunk_id'] = chunk_id # Add chunk_id to metadata

chroma_docs.append(LC_Document(page_content=chunk_data['content'], metadata=chunk_data['metadata']))

# Initialize a dummy ChromaDB (replace with your actual Chroma client)

# For this example, we won't actually embed, just simulate storing and retrieving by ID

class DummyChromaDB:

def __init__(self, documents):

self.documents_by_id = {doc.metadata['chunk_id']: doc.page_content for doc in documents}

self.metadata_by_id = {doc.metadata['chunk_id']: doc.metadata for doc in documents}

def get_by_ids(self, ids):

return {id: self.documents_by_id.get(id) for id in ids}

def query(self, query_texts, n_results=2):

# Simulate very basic semantic similarity search

# In real Chroma, this would be a vector search

results = []

for doc_id, content in self.documents_by_id.items():

if any(qt.lower() in content.lower() for qt in query_texts):

results.append(LC_Document(page_content=content, metadata=self.metadata_by_id.get(doc_id)))

if len(results) >= n_results:

break

return {"documents": [[doc.page_content for doc in results]], "metadatas": [[doc.metadata for doc in results]]}

vector_db = DummyChromaDB(chroma_docs)

print("Simulated Vector Database initialized with chunks.")

# --- Neo4j Ingestion (Modified to link Concepts to chunk_ids) ---

# ... (Neo4j driver and clear db part - same as previous example) ...

# Ingest data into Neo4j (UPDATED for chunk_ids)

# (Assume concepts_developed_by_google and model_based_on_concept are identified by LLM

# from specific chunks)

# In a real scenario, your LLM extraction would produce `chunk_id` for each extracted entity

# For this example, we'll manually link:

chunk_id_for_transformer_google = simulated_chunks_with_ids[0]['id']

chunk_id_for_gpt3_transformer = simulated_chunks_with_ids[1]['id']

with driver.session() as session:

# ... (ingest Document, Organization, Concepts as before) ...

# Link Concepts to their originating chunk_ids (NEW STEP for Hybrid)

# This stores the chunk ID on the concept node.

session.run("""

MATCH (c:Concept {name: 'Transformer architecture'})

SET c.originatingChunkIds = CASE

WHEN c.originatingChunkIds IS NULL THEN [$chunk_id]

ELSE c.originatingChunkIds + $chunk_id

END

""", {"chunk_id": chunk_id_for_transformer_google})

session.run("""

MATCH (c:Concept {name: 'GPT-3'})

SET c.originatingChunkIds = CASE

WHEN c.originatingChunkIds IS NULL THEN [$chunk_id]

ELSE c.originatingChunkIds + $chunk_id

END

""", {"chunk_id": chunk_id_for_gpt3_transformer})

# ... (Ingest other relationships as before) ...

print("Neo4j data ingestion (with chunk_ids) complete.")

# --- RAG Workflow: Hybrid Retrieval ---

# User Query

user_query = "What concepts developed by Google are used as a basis for GPT-3 models? Provide details from the document."

# Step 1: Structured Retrieval from KG

target_developer = "Google"

target_user_model = "GPT-3"

kg_query = f"""

MATCH (concept:Concept)<-[:DEVELOPED_BY]-(org:Organization)

WHERE org.name = '{target_developer}'

MATCH (model:Concept)-[:BASED_ON]->(concept)

WHERE model.name = '{target_user_model}'

RETURN concept.name AS BridgingConcept, org.name AS Developer, model.name AS User, concept.originatingChunkIds AS ConceptChunkIds

"""

print(f"\n--- RAG Step 1: Querying Knowledge Graph ---\n{kg_query}")

kg_results = []

all_relevant_chunk_ids = set()

with driver.session() as session:

result = session.run(kg_query)

for record in result:

kg_results.append({

"BridgingConcept": record["BridgingConcept"],

"Developer": record["Developer"],

"User": record["User"]

})

if record["ConceptChunkIds"]:

all_relevant_chunk_ids.update(record["ConceptChunkIds"])

print("\nKG Results (Structured Facts):")

for res in kg_results:

print(f"- Concept: {res['BridgingConcept']}, Developer: {res['Developer']}, User: {res['User']}")

print(f"Relevant Chunk IDs from KG: {list(all_relevant_chunk_ids)}")

# Step 2: Contextual Retrieval from Vector DB

retrieved_passages = {}

if all_relevant_chunk_ids:

# Retrieve specific passages identified by KG

passages_by_id = vector_db.get_by_ids(list(all_relevant_chunk_ids))

for chunk_id, content in passages_by_id.items():

if content:

retrieved_passages[chunk_id] = content

print(f"\n--- RAG Step 2: Retrieving Specific Passages from Vector DB ---")

print(f"Passages retrieved by KG-provided IDs: {len(retrieved_passages)} chunks")

# Optional: Add top N semantically similar passages (if not already retrieved by ID)

# This handles cases where the KG doesn't have a direct link but text is relevant

print("\n--- RAG Step 2 (Optional): Retrieving Semantically Similar Passages from Vector DB ---")

semantic_results = vector_db.query(query_texts=[user_query], n_results=2)

for doc_content in semantic_results["documents"][0]:

# Only add if not already present from direct ID retrieval

found = False

for existing_passage in retrieved_passages.values():

if doc_content == existing_passage: # Simple content match

found = True

break

if not found:

# In a real setup, you'd get the ID from metadata and add it

# For this dummy, we just add content.

retrieved_passages[f"semantic_chunk_{len(retrieved_passages)}"] = doc_content

print(f" Added semantically similar passage: \"{doc_content[:50]}...\"")

print(f"\nTotal passages for LLM: {len(retrieved_passages)} chunks")

# Step 3: Augment LLM Prompt

llm_context = ""

llm_context += "Here are structured facts from a knowledge graph:\n"

for res in kg_results:

llm_context += f"- Concept: {res['BridgingConcept']}, Developed by: {res['Developer']}, Used by: {res['User']}\n"

llm_context += "\nHere are relevant document passages for context:\n"

for chunk_id, passage in retrieved_passages.items():

llm_context += f"--- Passage {chunk_id} ---\n{passage}\n\n"

llm_context += f"Based on the above information, answer the following question: {user_query}"

print("\n--- RAG Step 3: Combined Context for LLM ---")

print(llm_context)

driver.close()

Summary and Recommendation:

For small, self-contained documents where text content is not excessively large and semantic search on raw text is not a primary concern, Strategy 1 (KG-only with content on Paragraph nodes) can work. It keeps everything in Neo4j and simplifies your toolchain.

For larger document corpuses, or when you need sophisticated semantic search capabilities, the Hybrid Approach (Strategy 2) is highly recommended. It separates concerns, leveraging vector databases for what they do best (text embeddings and similarity search) and knowledge graphs for their strength in structured data and relational querying. This combined power allows your RAG system to answer complex, multi-faceted questions with high accuracy and grounded responses.

Is Mongo Atlas Vector Search an alternative to Graph DB?

Mongo Vector search, while they are excellent solutions for many RAG use cases and can significantly enhance retrieval, they generally do not fully replace the core strengths of a dedicated Knowledge Graph for all scenarios.

Let's break down where Mongo Atlas Vector Search excels and where a Knowledge Graph still holds a distinct advantage, especially concerning "multi-hop" or complex relational queries.

Where Mongo Atlas Vector Search Excels:

Unified Storage & Management: You store your raw document content, its metadata, and its vector embeddings all within a single database system. This simplifies your architecture, deployment, and operational overhead compared to managing separate vector stores and graph databases.

Hybrid Search Power: Mongo Atlas Vector Search allows you to combine:

Semantic Similarity Search: Finding documents or chunks whose vectors are close to your query vector.

Exact Keyword Matching: Filtering results based on specific terms in your text fields.

Metadata Filtering: Applying filters on indexed metadata fields (e.g., section: "Introduction", author: "John Doe", year: { $gt: 2020 }). This makes searches like "find documents about AI applications in the Healthcare section from 2023" very efficient.

Scalability & Reliability: As a managed service, Mongo Atlas handles horizontal scaling, sharding, backups, and high availability, making it suitable for large-scale production environments.

Developer Familiarity: For teams already familiar with MongoDB, it's a natural extension, leveraging existing skills and tooling.

Where Mongo Atlas Vector Search (and similar solutions) Still Falls Short of a Knowledge Graph:

The fundamental difference lies in how relationships are modeled and queried.

Native Multi-Hop Querying & Graph Traversal:

Knowledge Graph: A KG's primary purpose is to store and query relationships as first-class citizens. You define nodes (entities) and explicit, typed, directed relationships between them. Multi-hop queries are expressed declaratively (e.g., in Cypher or Gremlin) and executed extremely efficiently by graph-optimized algorithms. You can ask: "Show me all concepts DEVELOPED_BY 'Google' that are BASED_ON a Concept that is USED_BY 'GPT-3'." This is a single, direct traversal.

Mongo Atlas Vector Search: While you can store IDs of related documents in metadata (e.g., related_concepts_ids: ["concept_x_id", "concept_y_id"]), querying relationships deeper than one hop requires multiple, sequential database calls and application-level joins.

To answer the multi-hop query above, you'd need to:

Query for Concept documents where developer_org_id is Google's ID. Get their concept_ids.

Take those concept_ids and query for Concept documents where based_on_concept_id is one of the previous IDs. Get those concept_ids.

Take those concept_ids and query for Concept documents where used_by_model_id is one of those IDs.

This iterative process is complex, inefficient, and difficult to generalize for arbitrary depth or new relationship types. MongoDB's $lookup aggregation stage can perform joins, but it's not optimized for arbitrary-depth graph traversals and can become very inefficient for more than a few hops.

Relationship as First-Class Citizens:

Knowledge Graph: Relationships have types and can have properties (e.g., (Document)-[:HAS_SECTION {order: 1}]->(Section)). You can query about relationships themselves.

Mongo Atlas Vector Search: Relationships are implied by IDs stored in document fields or by metadata values. They are not traversable entities with their own properties. Querying "what types of relationships exist between X and Y?" is not native.

Inference and Reasoning:

Knowledge Graph: KGs, especially when combined with ontological models or reasoning engines, are powerful for inferring new facts from existing relationships (e.g., if A is a PART_OF B, and B is a PART_OF C, then A is implicitly PART_OF C).

Mongo Atlas Vector Search: No native reasoning capabilities. All "inferences" must be explicitly pre-calculated and stored as data, or handled in application logic.

Data Modeling Expressiveness for Interconnected Data:

Knowledge Graph: Inherently designed for highly interconnected data. It excels at representing complex networks where the connections are as important as the nodes themselves.

Mongo Atlas Vector Search: While flexible, it's fundamentally a document database. Representing dense, arbitrary, and deep networks of relationships can lead to heavily embedded documents, redundancy, or complex application-level data management.

Visualization and Exploration:

Knowledge Graph: Graph visualization tools allow intuitive exploration of relationships, making complex knowledge bases understandable.

Mongo Atlas Vector Search: While you can retrieve documents, visualizing the network of implied relationships from metadata requires custom application development.

When to Choose Which:

Choose Mongo Atlas Vector Search (or similar hybrid vector DB):

Your primary need is semantic search with powerful filtering across your document chunks.

Most of your queries involve retrieving relevant passages based on content and direct metadata.

"Relationships" are mostly one-hop (e.g., "give me documents by this author" or "in this section").

You want a unified, simpler operational stack for RAG.

The "knowledge" is largely contained within individual chunks or simple direct links, rather than in deep, transitive relationships between entities.

Choose a Knowledge Graph (e.g., Neo4j) (potentially in conjunction with a Vector DB):

Your core problem involves understanding and querying complex, indirect, multi-hop relationships between disparate entities (e.g., "Find all papers that cited a research group which collaborated with a company that developed this technology").

You need to reason over interconnected facts and infer new knowledge.

The relationships themselves carry significant meaning and are central to your queries.

You need to build a rich, interconnected map of your domain knowledge that goes beyond simple document attributes.

You want visual exploration of complex relationships.

Conclusion:

Mongo Atlas Vector Search is a fantastic and powerful evolution for RAG, bridging the gap between semantic search and structured filtering. For many, many use cases, it will be perfectly sufficient and a great choice.

However, it is not a direct replacement for a Knowledge Graph when the complexity lies in the depth and interconnectedness of relationships rather than just the content of individual documents. If your questions truly demand multi-hop traversals, inference, and understanding of a complex network of facts, a dedicated Knowledge Graph remains the superior tool. Often, the most robust RAG systems leverage both – a vector database for semantic chunk retrieval and a knowledge graph for structured knowledge and relational reasoning.

Tuesday, June 24, 2025

What is Lyria?

Lyria 2 delivers high-fidelity music and professional-grade audio, capturing subtle nuances across a range of genres and intricate compositions.

Lyria 2 is an advanced AI model, developed with input and insights from musicians and producers. Artists can use text prompts to shape the music, while controlling the key, BPM, and other characteristics.

High-fidelity music

Generates music with remarkable detail and richness, capturing nuance across different instruments and playing styles. Produces professional-grade 48kHz stereo audio, ready for seamless integration into any project.

Granular creative control

Allows musicians to control fine details within their compositions, so they can create pieces that match their intent and vision.

Diverse musical possibilities

Allows composition across a wide range of genres and styles. Classical, jazz, pop, electronic — musicians can steer compositions towards their tastes, and explore diverse musical ideas along the way.

references:

https://deepmind.google/models/lyria/

Good ADK first time setup

python -m venv .venv

source .venv/bin/activate

pip install google-adk

mkdir agentic-apps

mkdir renovation-agent

renovation-agent/

__init__.py

agent.py

requirements.txt

.env

The agent code is from https://github.com/AbiramiSukumaran/adk-renovation-single-agent/blob/main/agent.py

A storage bucket to be created https://cloud.google.com/storage/docs/creating-buckets#console

Single agent System Source Code Explanation

The agent.py file defines the structure and behavior of our kitchen renovation multi-agent system using the Agent Development Kit (ADK). Let's break down the key components:

Agent Definition

Root Agent (Orchestrator): proposal_agent

The root_agent acts as the orchestrator of this single-agent system. It receives the initial renovation request and determines which tools to invoke based on the request's needs.

The root_agent then collects the responses from the tools and combines them to provide a comprehensive response to the user. In this case we just have one tool "store_pdf".

7. Data Flow & Key Concepts

The user initiates a request through the ADK interface (either the terminal or the web UI).

The request is received by the root_agent.

The root_agent analyzes the request and routes it to the tool as and when required.

The tool "store_pdf" is designed to write the renovated text content to a PDF file, then upload it to Google Cloud Storage.

This then returns the response to the root_agent.

The root_agent combines the responses and provides a final output to the user.

LLMs (Large Language Models)

The agents rely heavily on LLMs to generate text, answer questions, and perform reasoning tasks. The LLMs are the "brains" behind the agents' ability to understand and respond to user requests. We are using Gemini 2.5 in this application.

Google Cloud Storage

Used to store the generated renovation proposal documents. You need to create a bucket and grant the necessary permissions for the agents to access it.

Cloud Run (Optional)

The OrderingAgent uses a Cloud Run function to interface with AlloyDB. Cloud Run provides a serverless environment to execute code in response to HTTP requests.

AlloyDB

If you're using the OrderingAgent, you'll need to set up an AlloyDB database to store order information.

.env file

The .env file stores sensitive information like API keys, database credentials, and bucket names. It's crucial to keep this file secure and not commit it to your repository. It also stores configuration settings for the agents and your Google Cloud project. The root_agent or supporting functions will typically read values from this file. Make sure all required variables are properly set in the .env file. This includes the Cloud Storage bucket name

Your agent's ability to understand user requests and generate responses is powered by a Large Language Model (LLM). Your agent needs to make secure calls to this external LLM service, which requires authentication credentials. Without valid authentication, the LLM service will deny the agent's requests, and the agent will be unable to function.

Get an API key from Google AI Studio.

In the next step where you set up the .env file, replace <<your API KEY>> with your actual API KEY value.

GOOGLE_GENAI_USE_VERTEXAI=FALSE

GOOGLE_API_KEY=<<your API KEY>>

GOOGLE_CLOUD_LOCATION = us-central1 <<or your region>>

GOOGLE_CLOUD_PROJECT = <<your project id>>

PROJECT_ID = <<your project id>>

GOOGLE_CLOUD_REGION=us-central1 <<or your region>>

STORAGE_BUCKET = next-demo-store <<or your storage bucket name>>

cd agentic-apps/renovation-agent

pip install -r requirements.txt

adk run .

adk web

user>>

Hello. Generate Proposal Document for the kitchen remodel requirement in a proper format that applies to a renovation contract. Remember this text will eventually be stored as a pdf file so make sure to have the formatting appropriate. I have no other specification.

references:

https://codelabs.developers.google.com/your-first-agent-with-adk#10

What is Gremlin

Gremlin is a graph traversal language and machine developed by Apache TinkerPop. It's used to navigate and query relationships within graph databases, allowing users to explore complex connections and perform advanced operations. Essentially, it's a way to interact with and analyze data organized as a graph, where data points are nodes and their connections are edges

Here's a more detailed explanation:

1. Graph Traversal Language: Gremlin is a functional, data-flow language that allows users to express complex traversals (queries) on a property graph. It's designed to be flexible and compatible with various graph databases that support the Apache TinkerPop framework.

Key Concepts:

Nodes (Vertices): Represent individual data points or entities within the graph.

Edges: Represent relationships between nodes, indicating how they are connected.

Traversals: Sequences of steps that navigate through the graph, following edges and filtering data based on specific criteria.

3. How it works: Gremlin uses a set of instructions (steps) that tell the traversal machine how to move through the graph. These instructions can include steps to:

Move between nodes: Follow specific types of edges.

Filter data: Select nodes or edges based on properties or other criteria.

Aggregate results: Calculate statistics or summaries from the traversal.

4. Example: A Gremlin query might look like this: g.V().has('name', 'John').out('knows').in('created').values('title'). This query does the following:

Starts at a vertex (node) with the name 'John'.

Follows outgoing edges labeled 'knows'.

Follows incoming edges labeled 'created'.

Retrieves the 'title' property of the resulting vertices.

5. Benefits:

Flexibility:

Gremlin allows for diverse and complex graph traversals, making it suitable for various applications.

Compatibility:

It works with different graph databases, promoting portability and reducing vendor lock-in.

Expressiveness:

It enables users to write concise and efficient queries for exploring relationships in graph data.

In essence, Gremlin is a powerful tool for working with graph data, enabling users to explore, analyze, and manipulate complex relationships within their data models.

Monday, June 23, 2025

What is @classmethod, @staticmethod and regular instance methods in python?

In Python, @classmethod is a decorator used to define a method that is bound to the class and not the instance. It’s part of Python’s method types alongside @staticmethod and regular instance methods.

What is @classmethod?

• A @classmethod takes the class itself (cls) as its first argument, instead of self.

• It can access and modify class state that applies across all instances.

class Book:

count = 0

def __init__(self, title):

self.title = title

Book.count += 1

@classmethod

def get_count(cls):

return cls.count

Book("A Tale of Two Cities")

Book("1984")

print(Book.get_count()) # Output: 2

Here, get_count() is a class method used to read a class-level variable, not tied to a single instance.

⸻

🧠 When to Use @classmethod

• Factory methods (from_*) that return class instances:

class User:

def __init__(self, name, age):

self.name = name

self.age = age

@classmethod

def from_string(cls, user_str):

name, age = user_str.split(',')

return cls(name, int(age))

user = User.from_string("Alice,30")

• Accessing or modifying class-level state

• Supporting inheritance-aware behavior

class MathUtils:

@staticmethod

def square(x):

return x * x

print(MathUtils.square(4)) # Output: 16

Summary

• Use @classmethod when you need access to the class (cls)

• Use @staticmethod for utility functions that don’t need class or instance

• Use regular instance methods for behavior tied to an object

references:

OpenAI

Saturday, June 21, 2025

What is Gecko Evaluator

The world of generative AI is moving fast, with models like Lyria, Imagen, and Veo now capable of producing stunningly realistic and imaginative images and videos from simple text prompts. However, evaluating these models is still a steep challenge. Traditional human evaluation, while the gold standard, can be slow and costly, hindering rapid development cycles.

To address this, we're thrilled to introduce Gecko, now available through Google Cloud’s Vertex AI Evaluation Service. Gecko is a rubric-based and interpretable autorater for evaluating generative AI models that empowers developers with a more nuanced, customizable, and transparent way to assess the performance of image and video generation models.

The challenge of evaluating generative models with auto-raters

Creating useful, performant auto-raters is challenging as the quality of generation dramatically improves. While specialised models can be efficient, they lack the interpretability developers need to understand model behavior and pinpoint areas for improvement. For instance, when evaluating how accurately a generated image depicts a prompt, a single score doesn't reveal why a model succeeded or failed.

Gecko offers a fine-grained, interpretable, and customizable auto-rater. This Google DeepMind research paper shows that such an auto-rater can reliably evaluate image and video generation across a range of skills, reducing the dependency on costly human judgment. Notably, beyond its interpretability, Gecko exhibits strong performance and has already been instrumental in benchmarking the progress of leading models like Imagen.

Gecko makes evaluation interpretable with its clear, step-by-step rubric-based approach. Let’s take an example and use Gecko to evaluate the generated media of a cup of coffee and a croissant on a table.

Step 1: Semantic prompt decomposition.

Gecko leverages a Gemini model to first break down the input text prompt into key semantic elements that need to be verified in the generated media. This includes identifying entities, their attributes, and the relationships between them.

For the running example, the prompt is broken down into keywords: Steaming, cup of coffee, croissant, table.

Step 2: Question generation.

Based on the decomposed prompt, the Gemini model then generates a series of question-answer pairs. These questions are specifically designed to probe the generated image or video for the presence and accuracy of the identified elements and relationships. Optionally, Gemini can provide justifications for why a particular answer is correct, further enhancing transparency.

Let’s take a look at the running example and generate question-answer pairs for each keyword. For the keyword Steaming, the question-answer pair is ‘is the cup of coffee steaming? [‘yes’, ‘no’]’ with the ground-truth answer ‘yes’.

Step 3: Scoring

Finally, the Gemini model scores the generated media against each question-answer pair. These individual scores are then aggregated to produce a final evaluation score.

For the running example, all questions were found to be correct, giving a perfect final score.

Evaluate with Gecko on Vertex AI

Gecko is now available via the Gen AI Evaluation Service in Vertex AI, empowering you to evaluate image or video generative models. Here's how you can get started with Gecko evaluation for images and videos on Vertex AI:

First, you'll need to set up configurations for both rubric generation and rubric validation.

# Rubric Generation

rubric_generation_config = RubricGenerationConfig(

prompt_template=RUBRIC_GENERATION_PROMPT,

parsing_fn=parse_json_to_qa_records,

)

# Rubric Validation

pointwise_metric = PointwiseMetric(

metric="gecko_metric",

metric_prompt_template=RUBRIC_VALIDATOR_PROMPT,

custom_output_config=CustomOutputConfig(

return_raw_output=True,

parsing_fn=parse_rubric_results,

)

# Rubric Metric

rubric_based_gecko = RubricBasedMetric(

generation_config=rubric_generation_config,

critique_metric=pointwise_metric,

)

Next, prepare your dataset for evaluation. This involves creating a Pandas DataFrame with columns for your prompts and the corresponding generated images or videos.

prompts = [

"steaming cup of coffee and a croissant on a table",

"steaming cup of coffee and toast in a cafe",

# ... more prompts

]

images = [

'{"contents": [{"parts": [{"file_data": {"mime_type": "image/png", "file_uri": "gs://cloud-samples-data/generative-ai/evaluation/images/coffee.png"}}]}]}',

# ... more image URIs

]

eval_dataset = pd.DataFrame(

{

"prompt": prompts,

"image": images, # or "video": videos for video evaluation

}

)

Now, you can generate the rubrics based on your prompts using the configured rubric_based_gecko metric.

Finally, run the evaluation using the generated rubrics and your dataset. The evaluate method of EvalTask will use the rubric validator to score the generated content.

eval_task = EvalTask(

dataset=dataset_with_rubrics,

metrics=[rubric_based_gecko],

)

eval_result = eval_task.evaluate(response_column_name="image") # or "video"

What s Chirp HD Voices?

Text-to-Speech Chirp 3: HD voices represent the latest generation of Text-to-Speech technology. Powered by our cutting-edge LLMs, these voices deliver an unparalleled level of realism and emotional resonance.

def synthesize_text():

"""Synthesizes speech from the input string of text."""

from google.cloud import texttospeech

text = "Hello there."

client = texttospeech.TextToSpeechClient()

input_text = texttospeech.SynthesisInput(text=text)

# Note: the voice can also be specified by name.

# Names of voices can be retrieved with client.list_voices().

voice = texttospeech.VoiceSelectionParams(

language_code="en-US",

name="en-US-Chirp3-HD-Charon",

)

audio_config = texttospeech.AudioConfig(

audio_encoding=texttospeech.AudioEncoding.MP3

)

response = client.synthesize_speech(

input=input_text,

voice=voice,

audio_config=audio_config,

)

# The response's audio_content is binary.

with open("output.mp3", "wb") as out:

out.write(response.audio_content)

print('Audio content written to file "output.mp3"')

Scripting and prompting tips

Creating engaging and natural-sounding audio from text requires understanding the nuances of spoken language and translating them into script form. The following tips will help you craft scripts that sound authentic and capture the chosen tone.

Understanding the Goal: Natural Speech

The primary objective is to make the synthesized voice sound as close to a natural human speaker as possible. This involves:

Mimicking Natural Pacing: How quickly or slowly someone speaks.

Creating Smooth Flow: Ensuring seamless transitions between sentences and phrases.

Adding Realistic Pauses: Incorporating pauses for emphasis and clarity.

Capturing Conversational Tone: Making the audio sound like a real conversation.

Key Techniques for Natural Speech

Punctuation for Pacing and Flow

Periods (.): Indicate a full stop and a longer pause. Use them to separate complete thoughts and create clear sentence boundaries.

Commas (,): Signal shorter pauses within sentences. Use them to separate clauses, list items, or introduce brief breaks for breath.

Ellipses (...): Represent a longer, more deliberate pause. They can indicate trailing thoughts, hesitation, or a dramatic pause.

Example: "And then... it happened."

Hyphens (-): Can be used to indicate a brief pause or a sudden break in thought.

Example: "I wanted to say - but I couldn't."

Incorporating Pauses and Disfluencies

Strategic Pauses: Use ellipses, commas, or hyphens to create pauses in places where a human speaker would naturally pause for breath or emphasis.

Disfluencies (Ums and Uhs): While some Text-to-Speech models handle disfluencies automatically, understanding their role is crucial. They add authenticity and make the speech sound less robotic. Even if the model adds them, being aware of where they would naturally occur in human speech helps you understand the overall flow of your script.

Experimentation and Iteration

Re-synthesizing: Don't be afraid to re-synthesize the same message with the same voice multiple times. Minor tweaks to punctuation, spacing, or word choice can significantly impact the final audio.

Listen Critically: Pay close attention to the pacing, flow, and overall tone of the synthesized audio. Identify areas that sound unnatural and adjust your script accordingly.

Voice Variation: If the system allows for it, try using different voices to see which one best suits your script and chosen tone.

Practical Scripting Tips

Read Aloud: Before synthesizing, read your script aloud. This will help you identify awkward phrasing, unnatural pauses, and areas that need adjustment.

Write Conversationally: Use contractions (e.g., "it's," "we're") and informal language to make the script sound more natural.

Consider the Context: The tone and pacing of your script should match the context of the audio. A formal presentation will require a different approach than a casual conversation.

Break Down Complex Sentences: Long, convoluted sentences can be difficult for TTS engines to handle. Break them down into shorter, more manageable sentences.

Sample Script Improvements

Original Script (Robotic): "The product is now available. We have new features. It is very exciting."

Improved Script (Natural): "The product is now available... and we've added some exciting new features. It's, well, it's very exciting."

Original Script (Robotic): "This is an automated confirmation message. Your reservation has been processed. The following details pertain to your upcoming stay. Reservation number is 12345. Guest name registered is Anthony Vasquez Arrival date is March 14th. Departure date is March 16th. Room type is Deluxe Suite. Number of guests is 1 guest. Check-in time is 3 PM. Check-out time is 11 AM. Please note, cancellation policy requires notification 48 hours prior to arrival. Failure to notify within this timeframe will result in a charge of one night's stay. Additional amenities included in your reservation are: complimentary Wi-Fi, access to the fitness center, and complimentary breakfast. For any inquiries, please contact the hotel directly at 855-555-6689 Thank you for choosing our hotel."

Improved Script (Natural): "Hi Anthony Vasquez! We're so excited to confirm your reservation with us! You're all set for your stay from March 14th to March 16th in our beautiful Deluxe Suite. That's for 1 guest. Your confirmation number is 12345, just in case you need it.

So, just a quick reminder, check-in is at 3 PM, and check-out is at, well, 11 AM.

Now, just a heads-up about our cancellation policy… if you need to cancel, just let us know at least 48 hours before your arrival, okay? Otherwise, there'll be a charge for one night's stay.

And to make your stay even better, you'll have complimentary Wi-Fi, access to our fitness center, and a delicious complimentary breakfast each morning!

If you have any questions at all, please don't hesitate to call us at 855-555-6689. We can't wait to welcome you to the hotel!"

Explanation of Changes:

The ellipses (...) create a pause for emphasis.

"and we've" uses a contraction for a more conversational tone.

"It's, well, it's very exciting" adds a small amount of disfluency, and emphasis.

"Okay?" friendly reminder softens tone.

By following these guidelines, you can create text-to-audio scripts that sound natural, engaging, and human-like. Remember that practice and experimentation are key to mastering this skill.

What is Graph Data Science (GDS) in Neo4j r

Graph Data Science (GDS) in Neo4j refers to a powerful library and framework that allows data scientists to leverage the inherent connectedness of graph data to gain deeper insights, improve predictions, and enhance machine learning models. It goes beyond simple querying to apply advanced analytical techniques directly on your graph.

Here's a breakdown of what GDS is and why it's so valuable:

What is Neo4j Graph Data Science (GDS)?

At its core, Neo4j GDS is a library of highly optimized graph algorithms, graph transformations, and machine learning pipelines that operate directly within or in conjunction with a Neo4j graph database. It's designed to support the full data science workflow, from data preparation and feature engineering to model training and deployment, all within the context of graph structures.

Key Components and Concepts of GDS:

Graph Algorithms: This is the heart of GDS. It provides efficient, parallel implementations of a wide range of algorithms categorized into:
- Centrality Algorithms: (e.g., PageRank, Betweenness Centrality, Degree Centrality) Identify the most important or influential nodes in a network.
- Community Detection Algorithms: (e.g., Louvain, Label Propagation, Connected Components) Discover groups or clusters of densely connected nodes.
- Similarity Algorithms: (e.g., Node Similarity, Jaccard Similarity) Find how similar nodes or relationships are to each other.
- Pathfinding Algorithms: (e.g., Dijkstra, A*, Shortest Path) Find the shortest or most optimal paths between nodes.
- Node Embedding Algorithms: (e.g., Node2Vec, GraphSAGE) Transform graph structures into numerical vector representations (embeddings) that capture the context and relationships of nodes, making them suitable for traditional machine learning models.
- Link Prediction Algorithms: (e.g., Adamic-Adar, Preferential Attachment) Predict the likelihood of new connections forming between nodes.
- Topological Algorithms: (e.g., Topological Sort for DAGs, Triangle Count) Analyze the structural properties of the graph.
Graph Projections (In-Memory Graphs):
- To run algorithms efficiently, GDS typically projects a portion of your Neo4j database into an optimized, in-memory graph format. This allows algorithms to run at high speed without constantly hitting the disk.
- You can control which nodes, relationships, and properties are included in the projection, allowing you to focus on specific subgraphs relevant to your analysis.
Machine Learning Pipelines:
- GDS provides end-to-end pipelines for common graph machine learning tasks like node classification, link prediction, and node regression.
- These pipelines streamline the process of feature engineering (using graph algorithms), training models (e.g., Logistic Regression, Random Forest), and making predictions directly on the graph.
Integration with Data Ecosystems:
- Cypher Procedures: Most GDS functionality is exposed through Cypher procedures, meaning you can call graph algorithms directly from your Cypher queries within the Neo4j Browser or via any Neo4j driver.
- GDS Python Client (graphdatascience): This client library allows data scientists to interact with GDS directly from Python, enabling integration with popular Python data science tools and workflows.
- Connectors: Neo4j provides connectors for integrating with data warehouses (Snowflake, BigQuery), BI tools (Power BI, Tableau), and other data platforms.
Editions:
- GDS is available in a Community Edition (open source with full algorithms but some operational limits) and an Enterprise Edition (optimized for large-scale production deployments, clustering, and advanced features).
- Neo4j AuraDS: This is a fully managed cloud service that provides GDS capabilities without the need for self-hosting.

Typical GDS Workflow:

Load Data: Get your connected data into Neo4j.
Project Graph: Create an in-memory graph projection (a subset or the whole graph) from your Neo4j database.
Run Algorithm(s): Execute relevant graph algorithms on the projected graph.
Analyze/Mutate/Write Back:
- Stream: Get the results back immediately as a Cypher result set for analysis.
- Mutate: Update the in-memory projected graph with the algorithm's results (e.g., add PageRank scores as a node property).
- Write Back: Write the results (e.g., new node properties, relationships) back to the persistent Neo4j database for long-term storage or use in applications.
(Optional) ML Pipelines: Use the algorithm outputs as features for machine learning pipelines to train models for predictions.

Why is GDS Helpful?

Uncover Hidden Insights: Traditional data analysis often struggles with connected data. GDS algorithms can reveal patterns, structures, and influences that are invisible in tabular data.
Improve Predictions: Graph-based features (e.g., centrality scores, community memberships, embeddings) can significantly boost the accuracy of machine learning models for tasks like fraud detection, recommendation engines, customer churn prediction, and more.
Faster and More Scalable Analysis: GDS algorithms are highly optimized and parallelized, allowing for efficient analysis of large and complex graphs.
Native Graph Capabilities: It leverages the strengths of the graph database, where relationships are first-class citizens, making complex queries and multi-hop analysis intuitive and performant.
Operationalization: GDS supports the entire data science lifecycle, from exploration to deploying models in production.

In essence, Neo4j GDS empowers data scientists to unlock the full value of their connected data by providing a specialized toolkit for graph-native analytics and machine learning

extraction_prompt_template = """

You are an expert at extracting structured information from technical documentation to build a knowledge graph.

Your task is to identify entities and their relationships based on the provided text chunk.

**Entities to Identify (and their properties):**

- **Document**: The overall document. Properties: `title` (from metadata).

- **Section**: Main sections (e.g., "1. Introduction", "2. Device Discovery Feature Enhancement"). Properties: `title`, `order`.

- **SubSection**: Subsections (e.g., "2.1 Automated Gateway Discovery"). Properties: `title`, `order`.

- **Item**: Specific elements within sections/subsections like tables, code blocks, or figures. Properties: `type` (e.g., "Table", "Figure", "CodeBlock"), `title` (if present), `content` (extract relevant text content if short).

- **Concept**: Key technical terms, features, or ideas (e.g., "Device Discovery", "DUAP API", "Gateway Discovery"). Properties: `name`.

- **Person**: Named individuals (e.g., "Mr. David Chen", "Dr. Evelyn Reed", "Sujay"). Properties: `name`.

- **Team**: Departments or named groups (e.g., "Engineering Team", "Bank Team", "Infrastructure Operations"). Properties: `name`.

- **Vendor**: Third-party companies (e.g., "TechSolutions Inc."). Properties: `name`.

- **Project**: Named projects or initiatives (e.g., "Project Aurora", "Quantum Leap", "Project Zenith"). Properties: `name`.

- **Platform**: Specific software/hardware platforms (e.g., "Core Services"). Properties: `name`.

- **Role**: Job titles or specific roles (e.g., "Chief Technology Officer", "Sponsor"). Properties: `title`.

**Relationship Types (all in CAPS, directional, with example properties if applicable):**

- **Hierarchical/Structural:**

- `HAS_SECTION`: (Document)-[:HAS_SECTION {{order: 1}}]->(Section) - *The LLM should generate the 'order' integer.*

- `HAS_SUBSECTION`: (Section)-[:HAS_SUBSECTION {{order: 1}}]->(SubSection) - *The LLM should generate the 'order' integer.*

- `CONTAINS_ITEM`: (SubSection)-[:CONTAINS_ITEM {{type: "Table", title: "Example Title"}}]->(Item) - *The LLM should generate the 'type' and 'title' strings.*

- `NEXT_SECTION`: (Section)-[:NEXT_SECTION]->(Section) (for sequential flow of main sections)

- `NEXT_SUBSECTION`: (SubSection)-[:NEXT_SUBSECTION]->(SubSection)

- **Conceptual/Semantic:**

- `DISCUSSES`: (Section/SubSection/Item)-[:DISCUSSES]->(Concept)

- `IMPACTS`: (Concept)-[:IMPACTS]->(Concept)

- `UTILIZES`: (Concept/Project)-[:UTILIZES]->(Concept/Platform)

- **Organizational/Responsibility:**

- `LED_BY`: (Project/Team)-[:LED_BY]->(Person) OR (Project)-[:LED_BY]->(Team)

- `REPORTS_TO`: (Person)-[:REPORTS_TO]->(Person)

- `SPONSORS`: (Person)-[:SPONSORS]->(Project)

- `INCLUDES_MODULE`: (Project)-[:INCLUDES_MODULE]->(Module)

- `CRITICAL_FOR`: (Module)-[:CRITICAL_FOR]->(Project)

- `DEVELOPED_BY`: (Module)-[:DEVELOPED_BY]->(Team/Vendor)

- `PROVIDES_SUPPORT_FOR`: (Vendor)-[:PROVIDES_SUPPORT_FOR]->(Project)

- `MAINTAINED_BY`: (Platform)-[:MAINTAINED_BY]->(Team)

- `UNDER_DEVELOPMENT_BY`: (Module)-[:UNDER_DEVELOPMENT_BY]->(Team)

- `WILL_INTEGRATE_WITH`: (Module)-[:WILL_INTEGRATE_WITH]->(Project) (for future plans)

**Output Format:**

Return a single JSON object with two keys: "nodes" and "relationships".

```json

{{

"nodes": [

{{"id": "unique_id_or_name", "label": "NodeLabel", "properties": {{"prop1": "value1", "prop2": "value2"}}}},

{{"id": "another_id", "label": "AnotherLabel", "properties": {{"prop_a": "value_a", "content": "extracted content"}}}}

"relationships": [

{{"source_id": "source_node_id", "target_id": "target_node_id", "type": "REL_TYPE", "properties": {{"order": 1}}}},

{{"source_id": "another_source_id", "target_id": "another_target_id", "type": "ANOTHER_REL_TYPE", "properties": {{}}}}

]

}}

-- Living Mobile --

Sunday, June 29, 2025

Detail on Hardware available on Google collab

Thursday, June 26, 2025

What is Agent Mode in Visual Studio code with Co-Pilot

How to use instructions to get AI edits that follow your coding style?

Wednesday, June 25, 2025

Knowledge Graph, When generating the Nodes and relationship from passages, how to link them to the actual chunks from which these are extracted?

Is Mongo Atlas Vector Search an alternative to Graph DB?

Tuesday, June 24, 2025

What is Lyria?

Good ADK first time setup

What is Gremlin

Monday, June 23, 2025

What is @classmethod, @staticmethod and regular instance methods in python?

Saturday, June 21, 2025

What is Gecko Evaluator

What s Chirp HD Voices?

What is Graph Data Science (GDS) in Neo4j r

Followers

Blog Archive

About Me