[Feature Request] 更灵活的视觉模型判别 · ChatGPTNextWeb/NextChat#5843

(4 comments) (1 reaction) (0 assignees)TypeScript (59,717 forks)batch import

enhancementgood first issuehelp wanted

Repository metrics

当前项目采用固定的关键词、排除关键词的方案进行视觉模型判别（isVisionModel），加上各模型厂商并没有采取一致的命名方案，导致模型视觉判别滞后和频繁修改，如最新的 gemini-exp-1114 也支持视觉能力了，但是当前的视觉判别不能直接适配，急需优化更灵活的视觉模型判别方法

可能的解决方案：

No response

Research direction: Investigate the current isVisionModel function in the codebase (likely in src/utils/model.ts) to understand how vision capability is determined. Then design a flexible mechanism such as an environment variable (e.g., VISION MODELS) or a frontend configuration that allows users to define which models have vision support. Consider how to parse and apply such configuration across the application. Review existing environment variable patterns in the project (e.g., CUSTOM MODELS) for consistency.
Tech stack: typescriptreactnodejs
Domain: backendfull stack
Issue type: Feature
Difficulty: 3
Estimated time: Half day
Activity status: Needs maintainer response
Clarity: Mostly clear
Prerequisites: Understand current isVisionModel implementationFamiliarity with environment variable configuration in NextChat
Newbie friendliness: 50