<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Introduction to UVA RC GenAI | RC Learning Portal</title>
    <link>/notes/uva-rc-genai/</link>
      <atom:link href="/notes/uva-rc-genai/index.xml" rel="self" type="application/rss+xml" />
    <description>Introduction to UVA RC GenAI</description>
    <generator>Source Themes Academic (https://sourcethemes.com/academic/)</generator><language>en-us</language><copyright>©&nbsp;2026 The Rector and Visitors of the University of Virginia</copyright><lastBuildDate>Mon, 13 Apr 2026 00:00:00 +0000</lastBuildDate>
    <image>
      <url>/images/icon_hu13341279237897646923.png</url>
      <title>Introduction to UVA RC GenAI</title>
      <link>/notes/uva-rc-genai/</link>
    </image>
    
    <item>
      <title>University Policies</title>
      <link>/notes/uva-rc-genai/policies/guidelines/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/policies/guidelines/</guid>
      <description>&lt;h2 id=&#34;using-ai-at-uva&#34;&gt;Using AI at UVA&lt;/h2&gt;
&lt;p&gt;UVA RC GenAI aligns with the University’s GenAI Use Guidelines. These can be found at 
&lt;a href=&#34;https://in.virginia.edu/genai-useguidelines&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;Using AI at UVA: Guidelines &amp;amp; Terms of Use&lt;/a&gt;.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Research Data Usage</title>
      <link>/notes/uva-rc-genai/policies/research_usage/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/policies/research_usage/</guid>
      <description>&lt;h2 id=&#34;what-type-of-research-data-can-i-submit&#34;&gt;What Type of Research Data Can I Submit?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Approved&lt;/strong&gt;: Any data suitable for Standard Security Zone (SSZ) (Rivanna/Afton), including sensitive University research data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Prohibited&lt;/strong&gt;: Highly sensitive data, public health information (PHI), or other controlled access data types.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Unauthorized use of UVA RC GenAI may result in disciplinary action under the 
&lt;a href=&#34;https://in.virginia.edu/genai-useguidelines&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;University&amp;rsquo;s GenAI usage guidelines&lt;/a&gt;.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;When in doubt&lt;/strong&gt;: Contact us for 
&lt;a href=&#34;https://www.rc.virginia.edu/support&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;support&lt;/a&gt; before submitting&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;what-data-is-saved&#34;&gt;What Data is Saved?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Prompts and responses are not saved&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Metadata may be retained&lt;/strong&gt; to better understand number of users, number of requests, demand for service, tokens generated and other metrics that will aid in ensuring a quality delivery of the service.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Logs are accessible only to admins&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
</description>
    </item>
    
    <item>
      <title>Access and Getting Started</title>
      <link>/notes/uva-rc-genai/usage/service_overview/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/service_overview/</guid>
      <description>&lt;p&gt;UVA RC Gen AI is available exclusively to users with active 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/access/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;Research Computing accounts&lt;/a&gt;. It&amp;rsquo;s accessible via both a chat-like browser interface and API within standard security zone systems (Afton/Rivanna).&lt;/p&gt;
&lt;h2 id=&#34;where-can-i-access-uva-rc-genai-from&#34;&gt;Where can I Access UVA RC GenAI from?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On grounds&lt;/strong&gt;: Direct browser access is available. API access must run from a standard security zone HPC node. See 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/login/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;here&lt;/a&gt; for HPC access methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Off grounds&lt;/strong&gt;: VPN is required for browser access and access to HPC nodes.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Additional details can be found in our 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/rcgenai-userguide/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;user guide&lt;/a&gt;, and support resources can be found 
&lt;a href=&#34;https://www.rc.virginia.edu/support/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;here&lt;/a&gt; on our website.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Connecting to UVA RC GenAI</title>
      <link>/notes/uva-rc-genai/usage/access/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/access/</guid>
      <description>&lt;h2 id=&#34;how-to-login&#34;&gt;How to Login&lt;/h2&gt;
&lt;p&gt;Below are the first steps needed for all users regardless of access method:&lt;/p&gt;
&lt;div style=&#34;background-color: #dc3545; border-left: 4px solid
  #2196F3; padding: 12px; margin: 16px 0;&#34;&gt;
  &lt;strong&gt;Note:&lt;/strong&gt; Accessing UVA RC GenAI requires an active UVA computing
  ID, research computing account and EServices password for Netbadge
  authentication. Duo two-factor authentication is required for
  browser access.
&lt;/div&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;If you don’t have an RC user account please see 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/access/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;how to get access to HPC Services&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Once you have an RC account, you are ready to use 
&lt;a href=&#34;https://open-webui.rc.virginia.edu/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;UVA RC GenAI&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Ensure 
&lt;a href=&#34;https://in.virginia.edu/vpn&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;VPN connection&lt;/a&gt; if off grounds&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Login with Computing ID and EServices password&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Complete Duo 2FA&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
</description>
    </item>
    
    <item>
      <title>Terminology</title>
      <link>/notes/uva-rc-genai/usage/vocab/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/vocab/</guid>
      <description>&lt;p&gt;The following are common terms used when working with UVA RC GenAI and LLMs in general.&lt;/p&gt;
&lt;h3 id=&#34;core-units&#34;&gt;Core Units&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Token&lt;/strong&gt; - The basic unit of text LLMs process. Can be a whole word, part of a word, or a single character. Roughly 100 tokens ≈ 75 words. All usage, limits, and costs are measured in tokens.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Prompt&lt;/strong&gt; - The input text you send to the model. Prompts can be a question, instruction, or conversation history. A prompt is measured in tokens.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;System Prompt&lt;/strong&gt; - A hidden prompt that sets the model&amp;rsquo;s behavior rules for the entire conversation—such as tone, role, or safety constraints. It frames how the model interprets all subsequent user prompts.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;capacity-limits&#34;&gt;Capacity Limits&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Context Window&lt;/strong&gt; - The maximum number of tokens (input + output combined) a model can consider at one time.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Batched Token Length&lt;/strong&gt; - The total tokens across multiple requests sent together as a single batch. Batching can improve efficiency; this is the sum of all tokens in that group.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;generation-parameters-api-access-only&#34;&gt;Generation Parameters (API access only)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Temperature&lt;/strong&gt; - Controls output randomness. Lower values (0.1-0.3) produce more focused, deterministic responses; higher values (0.8-1.0) produce more varied and creative outputs.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Seed&lt;/strong&gt; - An integer that initializes the random number generator. Using the same seed with identical prompts and settings produces reproducible outputs.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;usage-and-operations&#34;&gt;Usage and Operations&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Metadata&lt;/strong&gt; - Non-content information about an API exchange: timestamp, model version, token counts, request ID, cost.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Rate Limiting&lt;/strong&gt; - Restrictions on tokens or requests per time period to prevent overload and ensure fair access.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
</description>
    </item>
    
    <item>
      <title>Browser Access</title>
      <link>/notes/uva-rc-genai/usage/browser/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/browser/</guid>
      <description>&lt;p&gt;After signing in, you should have browser access to UVARC GenAI through the OpenWebUI interface.&lt;/p&gt;















&lt;figure &gt;


  &lt;a data-fancybox=&#34;&#34; 
    data-alt=&#34;Screenshot of OpenWebUI interface in browser with UVA RC GenAI&#34;
    href=&#34;/notes/uva-rc-genai/img/openwebui.png&#34; 
    &gt;


  &lt;img src=&#34;/notes/uva-rc-genai/img/openwebui.png&#34; alt=&#34;Screenshot of OpenWebUI interface in browser with UVA RC GenAI&#34;  &gt;
&lt;/a&gt;



&lt;/figure&gt;

&lt;p&gt;Here, you can chat through the conversational interface, adjust integrations (e.g., web search), or even upload and attach content to the chat session.&lt;/p&gt;
&lt;p&gt;Files can be loaded into the web interface – supported extensions include: pdf, docx, txt, md, csv, png, jpeg, jpg, pptx, xls, xlsx, json, sh, html, htm, xhtml, js, and py.&lt;/p&gt;
&lt;div style=&#34;background-color: #dc3545; border-left: 4px solid
  #2196F3; padding: 12px; margin: 16px 0;&#34;&gt;
  &lt;strong&gt;Note:&lt;/strong&gt; Chats are not saved. Conversation history disappears
  when you close the browser tab, sign out, or if the session expires.
&lt;/div&gt;
&lt;p&gt;More on data management will be discussed 
&lt;a href=&#34;/notes/uva-rc-genai/usage/data_management&#34;


&gt;later&lt;/a&gt;.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>API Access</title>
      <link>/notes/uva-rc-genai/usage/api/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/api/</guid>
      <description>&lt;p&gt;API access lets you send requests to UVA RC GenAI programmatically from your code, instead of typing into a chat window. This enables automation and integration with existing workflows like bulk processing, large-scale data analysis, and connection to other tools.&lt;/p&gt;
&lt;h2 id=&#34;use-cases&#34;&gt;Use Cases&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Browser Access:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Asking a single research question or a quick explanation&lt;/li&gt;
&lt;li&gt;Brainstorming ideas for a project or paper&lt;/li&gt;
&lt;li&gt;Drafting or polishing a single email or paragraph&lt;/li&gt;
&lt;li&gt;Experimenting with prompts&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;API Access:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Processing hundreds or research papers to extract summaries&lt;/li&gt;
&lt;li&gt;Building a pipeline that automatically categorizes and tags new data as it arrives&lt;/li&gt;
&lt;li&gt;Integrating the LLM into a custom internal tool&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Overall, use the browser when you&amp;rsquo;re interacting directly, and use the API when you want the LLM to work inside your existing workflows to handle volume without you having to type each prompt.&lt;/p&gt;
&lt;h2 id=&#34;getting-your-api-key&#34;&gt;Getting your API Key&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Sign in to 
&lt;a href=&#34;https://open-webui.rc.virginia.edu/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;UVA RC GenAI&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Click profile (top right) → Settings → Account&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Select “Show” next to “API Keys”&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;















&lt;figure &gt;


  &lt;a data-fancybox=&#34;&#34; 
    data-alt=&#34;Screenshot of settings tab in browser where API key can be viewed, copied, or created&#34;
    href=&#34;/notes/uva-rc-genai/img/apikey.png&#34; 
    &gt;


  &lt;img src=&#34;/notes/uva-rc-genai/img/apikey.png&#34; alt=&#34;Screenshot of settings tab in browser where API key can be viewed, copied, or created&#34;  &gt;
&lt;/a&gt;



&lt;/figure&gt;

&lt;p&gt;You will have the option to view, copy or create a new API key.&lt;/p&gt;
&lt;h2 id=&#34;securely-storing-your-key&#34;&gt;Securely Storing Your Key&lt;/h2&gt;
&lt;p&gt;Use environment variables to safely store your key (e.g., &lt;code&gt;export UVARC_GenAI_API=&amp;quot;your-key-here&amp;quot;&lt;/code&gt;). Make sure to never commit keys to code repositories, and regenerate keys in UVA RC GenAI browser if compromised.&lt;/p&gt;
&lt;div style=&#34;background-color: #dc3545; border-left: 4px solid
  #2196F3; padding: 12px; margin: 16px 0;&#34;&gt;
  &lt;strong&gt;Note:&lt;/strong&gt; You need to be on a compute node to run your code.
&lt;/div&gt;
&lt;p&gt;HPC compute nodes can be accessed either via 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/ood/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;OpenOn Demand&lt;/a&gt; (JupyterLab, Desktop, etc) or through an 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/slurm/#submitting-an-interactive-job&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;ijob&lt;/a&gt; from the command line.&lt;/p&gt;
&lt;h2 id=&#34;code-examples&#34;&gt;Code Examples&lt;/h2&gt;
&lt;p&gt;Each of the following examples prompts &amp;ldquo;Hello&amp;rdquo; to UVA RC GenAI via API.&lt;/p&gt;
&lt;details class=&#34;spoiler &#34;  id=&#34;spoiler-1&#34;&gt;
  &lt;summary&gt;Curl Example&lt;/summary&gt;
  &lt;p&gt;&lt;p&gt;&lt;span id=&#34;code-download-0-filename&#34; class=&#34;sr-only&#34;&gt;Contents of uva-rc-genai-curl.sh&lt;/span&gt;&lt;/p&gt;
&lt;pre&gt;&lt;code class=&#34;language-bash&#34;&gt;export UVARC_GenAI_API=&amp;quot;&amp;lt;yourAPIkey&amp;gt;&amp;quot; 
curl -X POST &amp;quot;https://open-webui.rc.virginia.edu/api/chat/completions&amp;quot; -H &amp;quot;Authorization: Bearer $UVARC_GenAI_API&amp;quot; -H &amp;quot;Content-Type: application/json&amp;quot; -d &#39;{&amp;quot;model&amp;quot;: &amp;quot;Kimi K2.5&amp;quot;, &amp;quot;messages&amp;quot;: [{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hello&amp;quot;}]}&#39; | grep &#39;^data: &#39; | grep -oP &#39;&amp;quot;content&amp;quot;:&amp;quot;\K[^&amp;quot;]*&#39; | tr -d &#39;\n&#39; &amp;amp;&amp;amp; echo

&lt;/code&gt;&lt;/pre&gt;
&lt;a href=&#34;/notes/uva-rc-genai/codes/uva-rc-genai-curl.sh&#34; download aria-labelledby=&#34;code-download-0-link code-download-0-filename&#34; title=&#34;Download uva-rc-genai-curl.sh file&#34;&gt;
&lt;i class=&#34;fa fa-download&#34; aria-hidden=&#34;true&#34;&gt;&lt;/i&gt;
&lt;span id=&#34;code-download-0-link&#34; class=&#34;sr-only&#34;&gt;Download uva-rc-genai-curl.sh file&lt;/span&gt;
&lt;/a&gt;
&lt;br&gt;&lt;br&gt;&lt;/p&gt;
&lt;/details&gt;
&lt;details class=&#34;spoiler &#34;  id=&#34;spoiler-2&#34;&gt;
  &lt;summary&gt;Python with OpenAI Library&lt;/summary&gt;
  &lt;p&gt;&lt;p&gt;&lt;span id=&#34;code-download-0-filename&#34; class=&#34;sr-only&#34;&gt;Contents of uva-rc-genai-oai.py&lt;/span&gt;&lt;/p&gt;
&lt;pre&gt;&lt;code class=&#34;language-python&#34;&gt;import os
import openai

client = openai.OpenAI(
    base_url=&amp;quot;https://open-webui.rc.virginia.edu/api/&amp;quot;,
    api_key=os.environ.get(&amp;quot;UVARC_GenAI_API&amp;quot;)
)

response = client.chat.completions.create(
    model=&amp;quot;Kimi K2.5&amp;quot;,
    messages=[{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hello&amp;quot;}],
    stream=True
)

# Handle streaming response
full_text = &amp;quot;&amp;quot;
for chunk in response:
    if chunk.choices[0].delta.content:
        full_text += chunk.choices[0].delta.content

print(full_text)

&lt;/code&gt;&lt;/pre&gt;
&lt;a href=&#34;/notes/uva-rc-genai/codes/uva-rc-genai-oai.py&#34; download aria-labelledby=&#34;code-download-0-link code-download-0-filename&#34; title=&#34;Download uva-rc-genai-oai.py file&#34;&gt;
&lt;i class=&#34;fa fa-download&#34; aria-hidden=&#34;true&#34;&gt;&lt;/i&gt;
&lt;span id=&#34;code-download-0-link&#34; class=&#34;sr-only&#34;&gt;Download uva-rc-genai-oai.py file&lt;/span&gt;
&lt;/a&gt;
&lt;br&gt;&lt;br&gt;&lt;/p&gt;
&lt;/details&gt;
&lt;h2 id=&#34;saving-outputs&#34;&gt;Saving Outputs&lt;/h2&gt;
&lt;p&gt;Outputs can be saved programmatically by capturing the response from your HTTP client (like &lt;code&gt;requests&lt;/code&gt; or &lt;code&gt;openai&lt;/code&gt; in Python, &lt;code&gt;axios&lt;/code&gt; in Node.js, or &lt;code&gt;curl&lt;/code&gt; in shell) using standard I/O methods to save the results to your storage.&lt;/p&gt;
&lt;p&gt;For example, pipe &lt;code&gt;curl&lt;/code&gt; output directly to a file with &lt;code&gt;&amp;gt; output.txt&lt;/code&gt;, or in Python use&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;with open(&amp;quot;output.txt&amp;quot;, &amp;quot;w&amp;quot;) as f:
    f.write(response_text)
&lt;/code&gt;&lt;/pre&gt;
&lt;h2 id=&#34;token-tracking&#34;&gt;Token Tracking&lt;/h2&gt;
&lt;p&gt;It&amp;rsquo;s important to monitor token usage to stay within limits and ensure requests don&amp;rsquo;t exceed the context window. Token counts are included in every API response. For example, in the OpenAI library:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;print(response.usage.prompt_tokens)      # Input tokens
print(response.usage.completion_tokens)  # Output tokens
print(response.usage.total_tokens)       # Combined
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Similar fields exist in the Anthropic SDK and other tools.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Claude Code</title>
      <link>/notes/uva-rc-genai/usage/claude_code/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/claude_code/</guid>
      <description>&lt;h2 id=&#34;claude-code-on-hpc&#34;&gt;Claude Code on HPC&lt;/h2&gt;
&lt;p&gt;Claude Code is an AI-assistant for software engineering tasks that runs directly in the terminal to write, edit and analyze code. You can link it to UVA RC GenAI.&lt;/p&gt;
&lt;h3 id=&#34;installing-claude-code&#34;&gt;Installing Claude Code&lt;/h3&gt;
&lt;p&gt;First, you&amp;rsquo;ll need to install claude code into your home account on a login node with the following:&lt;/p&gt;
&lt;p&gt;&lt;code&gt;curl -fsSL https://claude.ai/install.sh | bash&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;once the above command is finished running, run the following to add &lt;code&gt;~/.local/bin&lt;/code&gt; to your path:&lt;/p&gt;
&lt;p&gt;&lt;code&gt;echo &#39;export PATH=&amp;quot;$HOME/.local/bin:$PATH&amp;quot;&#39; &amp;gt;&amp;gt; ~/.bashrc &amp;amp;&amp;amp; source ~/.bashrc&lt;/code&gt;&lt;/p&gt;
&lt;h3 id=&#34;claude-code-with-uva-rc-genai&#34;&gt;Claude Code with UVA RC GenAI&lt;/h3&gt;
&lt;p&gt;Set these environment variables:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;export API_KEY=&amp;quot;&amp;lt;your_api_key&amp;gt;&amp;quot;
export ANTHROPIC_BASE_URL=&amp;quot;https://open-webui.rc.virginia.edu/api&amp;quot;
export ANTHROPIC_AUTH_TOKEN=$API_KEY
export ANTHROPIC_API_KEY=&amp;quot;&amp;quot;
export CLAUDE_CODE_MAX_OUTPUT_TOKENS=4096
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Then launch with:
&lt;code&gt;claude --model &#39;Kimi K2.5&#39;&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;claude&lt;/code&gt; will run in whatever directory it&amp;rsquo;s launched in.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Jupyter Notebooks</title>
      <link>/notes/uva-rc-genai/usage/jupyter_notebooks/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/jupyter_notebooks/</guid>
      <description>&lt;h2 id=&#34;jupyter-notebook-workflow&#34;&gt;Jupyter Notebook Workflow&lt;/h2&gt;
&lt;p&gt;UVA RC GenAI can be accessed from a Jupyter notebook using the python API client to integrate LLM capabilites for data analysis, code generation, and automated research workflows.&lt;/p&gt;
&lt;p&gt;Download this 
&lt;a href=&#34;https://www.rc.virginia.edu/data/LLM_API_Example.zip&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;ZIP file&lt;/a&gt; for a Jupyter notebook reference.&lt;/p&gt;
&lt;h2 id=&#34;research-use-cases&#34;&gt;Research Use Cases&lt;/h2&gt;
</description>
    </item>
    
    <item>
      <title>Data Management</title>
      <link>/notes/uva-rc-genai/usage/data_management/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/usage/data_management/</guid>
      <description>&lt;h2 id=&#34;methods-for-saving-chats&#34;&gt;Methods for Saving Chats&lt;/h2&gt;
&lt;p&gt;Methods for saving chats via API were discussed 
&lt;a href=&#34;/notes/uva-rc-genai/usage/api/#saving-outputs&#34;


&gt;earlier&lt;/a&gt;, but you also have the option to export or save chats through the OpenWebUI interface. Clicking the three dots next to your icon in the upper-right section of the interface displays the options to either download or copy the chat session.&lt;/p&gt;















&lt;figure &gt;


  &lt;a data-fancybox=&#34;&#34; 
    data-alt=&#34;Screenshot of OpenWebUI option to Download or Copy chat&#34;
    href=&#34;/notes/uva-rc-genai/img/browser_save.png&#34; 
    &gt;


  &lt;img src=&#34;/notes/uva-rc-genai/img/browser_save.png&#34; alt=&#34;Screenshot of OpenWebUI option to Download or Copy chat&#34;  &gt;
&lt;/a&gt;



&lt;/figure&gt;

&lt;p&gt;The chat can be downloaded in three formats (&lt;code&gt;.txt&lt;/code&gt;, &lt;code&gt;.pdf&lt;/code&gt;, &lt;code&gt;.json&lt;/code&gt;). Saving as a &lt;code&gt;.txt&lt;/code&gt; file stores only your prompts and the LLM&amp;rsquo;s responses/reasonings; however, saving as a &lt;code&gt;.json&lt;/code&gt; includes metadata (e.g., timestamps, model, version, message IDs, etc), which may be useful in certain cases.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;Copy&amp;rdquo; option copies only prompts, responses and reasonings, similar to what&amp;rsquo;s stored in the &lt;code&gt;.txt&lt;/code&gt; file.&lt;/p&gt;
&lt;h2 id=&#34;storage-options&#34;&gt;Storage Options&lt;/h2&gt;
&lt;p&gt;Downloaded browser chats will be saved to your local workstation but can be easily copied to HPC storage. If you&amp;rsquo;re saving chats on HPC storage it&amp;rsquo;s important to remember the limitations and use cases for each of RCs storage options.
See 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/hpc/storage/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;here&lt;/a&gt; for details on our complementary storage options and 
&lt;a href=&#34;https://www.rc.virginia.edu/userinfo/storage/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;here&lt;/a&gt; for leased options.&lt;/p&gt;
&lt;h2 id=&#34;reproducibility-considerations&#34;&gt;Reproducibility Considerations&lt;/h2&gt;
&lt;p&gt;Reproducibility controls are important since LLM outputs can vary across different runs. This makes it impossible to verify results, replicate research, or trace back errors. The following are some important considerations regarding output reproducibility.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Temperature and Seed&lt;/strong&gt; - Set &lt;code&gt;temperature=0&lt;/code&gt; and a fixed &lt;code&gt;seed&lt;/code&gt; when consistency matters. Higher temperature introduces randomness.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Prompt Versioning&lt;/strong&gt; - Track prompts in versioned files since even small wording changes can affect outputs.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Saving Outputs&lt;/strong&gt; - Save raw API responses with metadata (temperature, tokens, request ID) rather than just extracted text.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Context Window&lt;/strong&gt; - Document the full context window, including system prompts, prior conversation history, and any retrieved content; exceeding the window silently drops content.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
</description>
    </item>
    
    <item>
      <title>Current Limitations</title>
      <link>/notes/uva-rc-genai/support/limitations/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/support/limitations/</guid>
      <description>&lt;h2 id=&#34;server-limitations&#34;&gt;Server Limitations&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Context window:&lt;/strong&gt; 65536 tokens&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Batched Token Length:&lt;/strong&gt; 131072 tokens&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Rate Limiting:&lt;/strong&gt; 60 requests per minute&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You may receive errors if any of these limits are reached. Errors may highlight that a rate limit has been exceeded if more than 60 requests are made within a minute, or you may receive context overflow errors if you surpass the maximum context length. Responses may cut off mid-sentence when the total token budget is used.&lt;/p&gt;
&lt;h2 id=&#34;workflow-challenges&#34;&gt;Workflow Challenges&lt;/h2&gt;
&lt;p&gt;If you&amp;rsquo;re working with large documents, you may need to break them into smaller pieces to fit within the context window. High-volume workflows like processing thousands of files or running automated batches can quickly hit rate limits, so you&amp;rsquo;ll need to pace your requests.
Long-running conversations or agentic workflows can exhaust the context window as chat history and file constraints accumulate.&lt;/p&gt;
&lt;h2 id=&#34;best-practices&#34;&gt;Best Practices&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Track token usage&lt;/strong&gt; - Monitor prompts and responses to avoid silent truncation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Avoid large file uploads&lt;/strong&gt; - Split files into more maintainable chunks (if possible) to prevent filling context.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Saving Outputs&lt;/strong&gt; - Save full API output or OpenWebUI chat data (&lt;code&gt;.json&lt;/code&gt; format) for debugging and reproducibility.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Restart sessions when needed&lt;/strong&gt; - Restart session to clear context or use &lt;code&gt;/clear&lt;/code&gt; command in Claude Code to clear context window. You can also use the &lt;code&gt;/compact&lt;/code&gt; command in Claude Code to clear context while maintaining a summary of your chat.&lt;/li&gt;
&lt;/ul&gt;
</description>
    </item>
    
    <item>
      <title>FAQs</title>
      <link>/notes/uva-rc-genai/support/faqs/</link>
      <pubDate>Mon, 20 Apr 2026 00:00:00 +0000</pubDate>
      <guid>/notes/uva-rc-genai/support/faqs/</guid>
      <description>&lt;p&gt;Research Computing will continuously assess enhancements and emerging LLMs to provide the UVA research community with modern, evolving AI services. Improvements may include installing different models to expanded capabilities or adding additional hardware for enhanced performance.
UVA RC GenAI is an early-access deployment and support is limited. Please contact us through our 
&lt;a href=&#34;https://www.rc.virginia.edu/form/support-request/&#34;

 
    target=&#34;_blank&#34; 
    rel=&#34;noopener&#34;

&gt;support request&lt;/a&gt; form with any questions, concerns, or suggestions.&lt;/p&gt;
</description>
    </item>
    
  </channel>
</rss>
