feat: Add Official Microsoft & Gemini Skills (845+ Total)

🚀 Impact Significantly expands the capabilities of **Antigravity Awesome Skills** by integrating official skill collections from **Microsoft** and **Google Gemini**. This update increases the total skill count to **845+**, making the library even more comprehensive for AI coding assistants. ✨ Key Changes 1. New Official Skills - **Microsoft Skills**: Added a massive collection of official skills from [microsoft/skills](https://github.com/microsoft/skills). - Includes Azure, .NET, Python, TypeScript, and Semantic Kernel skills. - Preserves the original directory structure under `skills/official/microsoft/`. - Includes plugin skills from the `.github/plugins` directory. - **Gemini Skills**: Added official Gemini API development skills under `skills/gemini-api-dev/`. 2. New Scripts & Tooling - **`scripts/sync_microsoft_skills.py`**: A robust synchronization script that: - Clones the official Microsoft repository. - Preserves the original directory heirarchy. - Handles symlinks and plugin locations. - Generates attribution metadata. - **`scripts/tests/inspect_microsoft_repo.py`**: Debug tool to inspect the remote repository structure. - **`scripts/tests/test_comprehensive_coverage.py`**: Verification script to ensure 100% of skills are captured during sync. 3. Core Improvements - **`scripts/generate_index.py`**: Enhanced frontmatter parsing to safely handle unquoted values containing `@` symbols and commas (fixing issues with some Microsoft skill descriptions). - **`package.json`**: Added `sync:microsoft` and `sync:all-official` scripts for easy maintenance. 4. Documentation - Updated `README.md` to reflect the new skill counts (845+) and added Microsoft/Gemini to the provider list. - Updated `CATALOG.md` and `skills_index.json` with the new skills. 🧪 Verification - Ran `scripts/tests/test_comprehensive_coverage.py` to verify all Microsoft skills are detected. - Validated `generate_index.py` fixes by successfully indexing the new skills.
2026-02-11 20:16:23 +05:00
parent 167d7c97c7
commit 17bce709de
145 changed files with 44081 additions and 72 deletions
--- a/skills/official/microsoft/java/foundry/voicelive/SKILL.md
+++ b/skills/official/microsoft/java/foundry/voicelive/SKILL.md
@@ -0,0 +1,225 @@
+---
+name: azure-ai-voicelive-java
+description: |
+  Azure AI VoiceLive SDK for Java. Real-time bidirectional voice conversations with AI assistants using WebSocket.
+  Triggers: "VoiceLiveClient java", "voice assistant java", "real-time voice java", "audio streaming java", "voice activity detection java".
+package: com.azure:azure-ai-voicelive
+---
+
+# Azure AI VoiceLive SDK for Java
+
+Real-time, bidirectional voice conversations with AI assistants using WebSocket technology.
+
+## Installation
+
+```xml
+<dependency>
+    <groupId>com.azure</groupId>
+    <artifactId>azure-ai-voicelive</artifactId>
+    <version>1.0.0-beta.2</version>
+</dependency>
+```
+
+## Environment Variables
+
+```bash
+AZURE_VOICELIVE_ENDPOINT=https://<resource>.openai.azure.com/
+AZURE_VOICELIVE_API_KEY=<your-api-key>
+```
+
+## Authentication
+
+### API Key
+
+```java
+import com.azure.ai.voicelive.VoiceLiveAsyncClient;
+import com.azure.ai.voicelive.VoiceLiveClientBuilder;
+import com.azure.core.credential.AzureKeyCredential;
+
+VoiceLiveAsyncClient client = new VoiceLiveClientBuilder()
+    .endpoint(System.getenv("AZURE_VOICELIVE_ENDPOINT"))
+    .credential(new AzureKeyCredential(System.getenv("AZURE_VOICELIVE_API_KEY")))
+    .buildAsyncClient();
+```
+
+### DefaultAzureCredential (Recommended)
+
+```java
+import com.azure.identity.DefaultAzureCredentialBuilder;
+
+VoiceLiveAsyncClient client = new VoiceLiveClientBuilder()
+    .endpoint(System.getenv("AZURE_VOICELIVE_ENDPOINT"))
+    .credential(new DefaultAzureCredentialBuilder().build())
+    .buildAsyncClient();
+```
+
+## Key Concepts
+
+| Concept | Description |
+|---------|-------------|
+| `VoiceLiveAsyncClient` | Main entry point for voice sessions |
+| `VoiceLiveSessionAsyncClient` | Active WebSocket connection for streaming |
+| `VoiceLiveSessionOptions` | Configuration for session behavior |
+
+### Audio Requirements
+
+- **Sample Rate**: 24kHz (24000 Hz)
+- **Bit Depth**: 16-bit PCM
+- **Channels**: Mono (1 channel)
+- **Format**: Signed PCM, little-endian
+
+## Core Workflow
+
+### 1. Start Session
+
+```java
+import reactor.core.publisher.Mono;
+
+client.startSession("gpt-4o-realtime-preview")
+    .flatMap(session -> {
+        System.out.println("Session started");
+        
+        // Subscribe to events
+        session.receiveEvents()
+            .subscribe(
+                event -> System.out.println("Event: " + event.getType()),
+                error -> System.err.println("Error: " + error.getMessage())
+            );
+        
+        return Mono.just(session);
+    })
+    .block();
+```
+
+### 2. Configure Session Options
+
+```java
+import com.azure.ai.voicelive.models.*;
+import java.util.Arrays;
+
+ServerVadTurnDetection turnDetection = new ServerVadTurnDetection()
+    .setThreshold(0.5)                    // Sensitivity (0.0-1.0)
+    .setPrefixPaddingMs(300)              // Audio before speech
+    .setSilenceDurationMs(500)            // Silence to end turn
+    .setInterruptResponse(true)           // Allow interruptions
+    .setAutoTruncate(true)
+    .setCreateResponse(true);
+
+AudioInputTranscriptionOptions transcription = new AudioInputTranscriptionOptions(
+    AudioInputTranscriptionOptionsModel.WHISPER_1);
+
+VoiceLiveSessionOptions options = new VoiceLiveSessionOptions()
+    .setInstructions("You are a helpful AI voice assistant.")
+    .setVoice(BinaryData.fromObject(new OpenAIVoice(OpenAIVoiceName.ALLOY)))
+    .setModalities(Arrays.asList(InteractionModality.TEXT, InteractionModality.AUDIO))
+    .setInputAudioFormat(InputAudioFormat.PCM16)
+    .setOutputAudioFormat(OutputAudioFormat.PCM16)
+    .setInputAudioSamplingRate(24000)
+    .setInputAudioNoiseReduction(new AudioNoiseReduction(AudioNoiseReductionType.NEAR_FIELD))
+    .setInputAudioEchoCancellation(new AudioEchoCancellation())
+    .setInputAudioTranscription(transcription)
+    .setTurnDetection(turnDetection);
+
+// Send configuration
+ClientEventSessionUpdate updateEvent = new ClientEventSessionUpdate(options);
+session.sendEvent(updateEvent).subscribe();
+```
+
+### 3. Send Audio Input
+
+```java
+byte[] audioData = readAudioChunk(); // Your PCM16 audio data
+session.sendInputAudio(BinaryData.fromBytes(audioData)).subscribe();
+```
+
+### 4. Handle Events
+
+```java
+session.receiveEvents().subscribe(event -> {
+    ServerEventType eventType = event.getType();
+    
+    if (ServerEventType.SESSION_CREATED.equals(eventType)) {
+        System.out.println("Session created");
+    } else if (ServerEventType.INPUT_AUDIO_BUFFER_SPEECH_STARTED.equals(eventType)) {
+        System.out.println("User started speaking");
+    } else if (ServerEventType.INPUT_AUDIO_BUFFER_SPEECH_STOPPED.equals(eventType)) {
+        System.out.println("User stopped speaking");
+    } else if (ServerEventType.RESPONSE_AUDIO_DELTA.equals(eventType)) {
+        if (event instanceof SessionUpdateResponseAudioDelta) {
+            SessionUpdateResponseAudioDelta audioEvent = (SessionUpdateResponseAudioDelta) event;
+            playAudioChunk(audioEvent.getDelta());
+        }
+    } else if (ServerEventType.RESPONSE_DONE.equals(eventType)) {
+        System.out.println("Response complete");
+    } else if (ServerEventType.ERROR.equals(eventType)) {
+        if (event instanceof SessionUpdateError) {
+            SessionUpdateError errorEvent = (SessionUpdateError) event;
+            System.err.println("Error: " + errorEvent.getError().getMessage());
+        }
+    }
+});
+```
+
+## Voice Configuration
+
+### OpenAI Voices
+
+```java
+// Available: ALLOY, ASH, BALLAD, CORAL, ECHO, SAGE, SHIMMER, VERSE
+VoiceLiveSessionOptions options = new VoiceLiveSessionOptions()
+    .setVoice(BinaryData.fromObject(new OpenAIVoice(OpenAIVoiceName.ALLOY)));
+```
+
+### Azure Voices
+
+```java
+// Azure Standard Voice
+options.setVoice(BinaryData.fromObject(new AzureStandardVoice("en-US-JennyNeural")));
+
+// Azure Custom Voice
+options.setVoice(BinaryData.fromObject(new AzureCustomVoice("myVoice", "endpointId")));
+
+// Azure Personal Voice
+options.setVoice(BinaryData.fromObject(
+    new AzurePersonalVoice("speakerProfileId", PersonalVoiceModels.PHOENIX_LATEST_NEURAL)));
+```
+
+## Function Calling
+
+```java
+VoiceLiveFunctionDefinition weatherFunction = new VoiceLiveFunctionDefinition("get_weather")
+    .setDescription("Get current weather for a location")
+    .setParameters(BinaryData.fromObject(parametersSchema));
+
+VoiceLiveSessionOptions options = new VoiceLiveSessionOptions()
+    .setTools(Arrays.asList(weatherFunction))
+    .setInstructions("You have access to weather information.");
+```
+
+## Best Practices
+
+1. **Use async client** — VoiceLive requires reactive patterns
+2. **Configure turn detection** for natural conversation flow
+3. **Enable noise reduction** for better speech recognition
+4. **Handle interruptions** gracefully with `setInterruptResponse(true)`
+5. **Use Whisper transcription** for input audio transcription
+6. **Close sessions** properly when conversation ends
+
+## Error Handling
+
+```java
+session.receiveEvents()
+    .doOnError(error -> System.err.println("Connection error: " + error.getMessage()))
+    .onErrorResume(error -> {
+        // Attempt reconnection or cleanup
+        return Flux.empty();
+    })
+    .subscribe();
+```
+
+## Reference Links
+
+| Resource | URL |
+|----------|-----|
+| GitHub Source | https://github.com/Azure/azure-sdk-for-java/tree/main/sdk/ai/azure-ai-voicelive |
+| Samples | https://github.com/Azure/azure-sdk-for-java/tree/main/sdk/ai/azure-ai-voicelive/src/samples |