BIBLIOGRAPHY

“Models & Pricing | Deepseek Api Docs.” n.d. Accessed March 21, 2025. https://api-docs.deepseek.com/quick_start/pricing/.

“Pricing Perplexity Sonar 가격.” n.d. Perplexity. Accessed March 21, 2025. https://perplexity.mintlify.app/guides/pricing.

관련노트

History

2025 API 서비스 가격 비교

[2025-01-27 Mon 16:18]

LLM Pricing 가격비교

나름 정리해본다면?

updated [2025-01-27 Mon 20:59]

ProviderModelInput/M tokensOutput/M tokensModalitiescontextNotesref
Anthropicclaude-haiku0.251.25Anthropic Console
claude-sonnet3.0015.00
claude-opus15.0075.00$yikes
claude-3.5-sonnet
OpenAIgpt-4-0125-preview10.0030.00Pricing
gpt-4o-mini0.150.60text,vision128K
Together.ai41B-70B chat models0.90 (in+out)Together Pricing
deepseekdeepseek-chat0.07 0.0141.1 0.2864KModels & Pricing
deepseek-reasoner0.142.1964K
perplexitysonar11127K
sonar-pro315200K

perplexity - 추론

https://perplexity.mintlify.app/guides/pricing (“Pricing Perplexity Sonar 가격” n.d.)

  • sonar-reasoning 127k Chat Completion
    • sonar-reasoning outputs CoT in its response as well
  • sonar-pro 200k Chat Completion
    • sonar-pro has a max output token limit of 8k
  • sonar 127k Chat Completion

upstage

10달러 쿠폰

deepseek

(“Models & Pricing | Deepseek Api Docs” n.d.) [2025-01-28 Tue 12:08]

openrouter

gptel model guide - 가격 정보 넣는 방법

;;;; model descriptions
 
;; updated 2025-01-27
 
;; sonar, sonar-pro
 
;; (defconst gptel--perplexity-models
;;   '((sonar
;;      :description "Google's latest model with enhanced capabilities across various tasks"
;;      :capabilities (tool-use json media)
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      :context-window 2000
;;      ;; input & output price is halved for prompts of 128k tokens or less
;;      :input-cost 2.50
;;      :output-cost 10
;;      :cutoff-date "2024-05")
;;     (gemini-2.0-flash-exp
;;      :description "Next generation features, superior speed, native tool use"
;;      :capabilities (tool-use json media)
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      :context-window 1000
;;      :cutoff-date "2024-12")
;;     (gemini-1.5-flash
;;      :description "A faster, more efficient version of Gemini 1.5 optimized for speed"
;;      :capabilities (tool-use json media)
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      :context-window 1000
;;      ;; input & output price is halved for prompts of 128k tokens or less
;;      :input-cost 0.15
;;      :output-cost 0.60
;;      :cutoff-date "2024-05")
;;     (gemini-1.5-flash-8b
;;      :description "High volume and lower intelligence tasks"
;;      :capabilities (tool-use json media)
;;      :context-window 1000
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      ;; input & output price is halved for prompts of 128k tokens or less
;;      :input-cost 0.075
;;      :output-cost 0.30
;;      :cutoff-date "2024-10")
;;     (gemini-2.0-flash-thinking-exp
;;      :description "Stronger reasoning capabilities."
;;      :capabilities (tool-use media)
;;      :context-window 32
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "text/plain" "text/csv" "text/html")
;;      :cutoff-date "2024-08")
;;     (gemini-exp-1206
;;      :description "Improved coding, reasoning and vision capabilities"
;;      :capabilities (tool-use json media)
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      :cutoff-date "2024-12")
;;     (gemini-pro
;;      :description "The previous generation of Google's multimodal AI model"
;;      :capabilities (tool-use json media)
;;      :mime-types ("image/png" "image/jpeg" "image/webp" "image/heic" "image/heif"
;;                   "application/pdf" "text/plain" "text/csv" "text/html")
;;      :context-window 32
;;      :input-cost 0.50
;;      :output-cost 1.50
;;      :cutoff-date "2023-02"))
;;   "List of available Gemini models and associated properties.
;; Keys:
 
;; - `:description': a brief description of the model.
 
;; - `:capabilities': a list of capabilities supported by the model.
 
;; - `:mime-types': a list of supported MIME types for media files.
 
;; - `:context-window': the context window size, in thousands of tokens.
 
;; - `:input-cost': the input cost, in US dollars per million tokens.
 
;; - `:output-cost': the output cost, in US dollars per million tokens.
 
;; - `:cutoff-date': the knowledge cutoff date.
 
;; - `:request-params': a plist of additional request parameters to
;;   include when using this model.
 
;; Information about the Gemini models was obtained from the following
;; source:
 
;; - <https://ai.google.dev/pricing>
;; - <https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models>")