Firecraw本地部署无法搜索问题的解决

发表于11/03/2025由信沉

最近在尝试使用deep research，发现在这里面LLM算力的成本远不如搜索API的成本高。所以就动起来本地部署一个搜索工具。由于我在使用的这个deep research实现主要使用Firecrawl，而它又刚好可以本地化部署，所以就选择了Firecrawl

在根据Firecraw的官方self-hosting教程设置之后，用该本地部署搜索时，会出现“Connection violated security rules” 报错，而无法给出搜索结果。在其issue列表中，给出的解决方案，即在.env文件中添加ALLOW_LOCAL_WEBHOOKS=true选项并没有解决我所用的版本的问题。所以这里采用了一个非官方的解决方法：

信息来源：https://stackoverflow.com/a/79749124

根据stackoverflow的描述，这个问题主要是safeFetch.js中的函数isIPPrivate（这里的函数名与链接中不同）返回true时会禁止脚本运行。所以解决方案也就显而易见了：修改这个函数

# 启动对应docker容器
docker run -it firecrawl-api /bin/bash

在docker容器中，修改文件

# 装个文本编辑其
apt update
apt install vim
# 打开文件并编辑
vim /app/dist/src/scraper/scrapeURL/engines/utils/safeFetch.js

我选择直接把isIPPrivate函数修改为直接返回false

function isIPPrivate(address) {
    return false;
}

commit当前docker实例的修改回firecrawl-api

docker commit <container id> firecrawl-api

commit之后重新运行docker compose up即可开始搜索。

TODO：在完成上述修改后，使用deep research本地部署会遇到搜索引擎的antibot导致搜索失败的问题，这部分的解决方法我还没找到，不过可以通过降低搜索的并发度缓解。

发表在积少成多 | 留下评论

kvm开启后virtualbox无法使用的替代方案

发表于10/26/2025由信沉

因为docker使用的kvm虚拟化与virtualbox不兼容，这样在需要使用一些windows应用是就有点麻烦了。大概搜索了以下，发现了一个替代方案：winboat. 这是一个基于docker的方案，基于windos docker container项目。使用了一下，基本就是基于这个项目添加了设置的gui和基于FreeRDP的显示功能。

安装

使用

windows桌面（软件安装等操作，或需要多app协同）

在需要多app协同时，或者安装软件是，直接使用windows桌面是更方便的方式，在Apps中选择Windows Desktop就可以激活这个选项。对于安装软件，可以在Configuration中打开Shared Home Folder选项以方便传文件。

seamless模式（具体软件使用）

seamless模式可以只打开应用的窗口，例如excel：

带开的窗口可以resize，但是resize的过程没有在windows桌面下那么顺滑。

tips

简单使用发现，目前的winboat还不是特别完善，有一些需要注意的点：

登陆密码是明文存储的！这是非常糟糕的一个选择，不知道后面会不会有改进。目前密码以明文的形式保存在$HOME/.winboat/docker-compose.yml文件中
如果在windows中登陆了微软账户，会造成当前账户的登录密码更改，winboat无法登陆桌面。应该避免这样的操作
目前该软件只能共享完整的home目录，而不是可以选择share的文件夹，这会造成一些风险。最好仅在必要时打开。。。
有些软件窗口在seamless模式下无法resize，这个是试用vscode时发现的，对于这种情况，只能使用完整的桌面模式

发表在积少成多 | 留下评论

libxc编译出现`’func_reference_type’ has no member named ‘key’`报错的解决

发表于09/15/2025由信沉

问题描述：libxc编译时无法通过，出现报错

func_reference.c: In function 'xc_func_reference_get_key':
func_reference.c:29:21: error: 'func_reference_type' has no member named 'key'
   29 |     return reference->key;
      |                     ^~

同样的问题见于gitlab, https://gitlab.com/libxc/libxc/-/issues/505#note_1836351853 . libxc作者指出这个问题是编译环境的问题

问题原因：环境变量C_INCLUDE_PATH中包含了老的libxc头文件，编译器读取了老的头文件导致编译错误。

解决方法：清空C_INCLUDE_PATH环境变量

unset C_INCLUDE_PATH

发表在积少成多 | 留下评论

ubuntu中gnome-shell插件无法通过edge安装的解决

发表于07/13/2025由信沉

ubuntu中的gnome-shell插件无法使用edge安装, 进入插件页后会出现提示

Although GNOME Shell integration extension is running, native host connector is not detected. Refer documentation for instructions about installing connector

这是一个老问题了, 解决方案页非常简单 (来源https://askubuntu.com/a/1034692), 安装chrome-gnome-shell

sudo apt install chrome-gnome-shell

发表在积少成多 | 留下评论

linux使用macos的键盘快捷键 (kinton在ubuntu24.04上的安装)

发表于07/13/2025由信沉

由于最近不停的在自己的macbook和linux台式机之间来回切换, 键盘快捷键的不统一让我非常难受, 经常ctrl-c cmd-c乱用. 在stackexchange上发现了一个解决方法比较简单(https://unix.stackexchange.com/a/714504), 使用kinto软件.

安装

安装和使用方法都可以从它的github上得到: https://github.com/rbreaves/kinto?tab=readme-ov-file#quick-install-method, 直接摘录下来

#install
/bin/bash -c "$(wget -qO- https://raw.githubusercontent.com/rbreaves/kinto/HEAD/install/linux.sh || curl -fsSL https://raw.githubusercontent.com/rbreaves/kinto/HEAD/install/linux.sh)"

#uninstall
/bin/bash <( wget -qO- https://raw.githubusercontent.com/rbreaves/kinto/HEAD/install/linux.sh || curl -fsSL https://raw.githubusercontent.com/rbreaves/kinto/HEAD/install/linux.sh ) -r

依赖库变化的修复

但是由于这个软件已经有一段时间没有维护了, 它依赖的xkeysnail版本更新改变了api,导致在ubuntu24.04上无法运行. 这个问题在kinto的issue上面有人反馈并给出了解决方法 (https://github.com/rbreaves/kinto/issues/902#issue-3207105991)

检查问题:

journalctl -f --unit=xkeysnail.service -b

输出结果中包含如下报错:

AttributeError: 'InputDevice' object has no attribute 'fn'. Did you mean: 'fd'?

解决方法, 修改/usr/local/lib/python3.12/dist-packages/xkeysnail/input.py文件. github的issue中给出了patch文件. 为了方便,这里负伤了修改后的input.py文件, 直接替换源文件即可. kinton_fix

激活使用

修复上述问题后, 软件就可以正常使用了, Kinto.sh窗口的右下角状态栏会出现绿色的active

在system tray中的kinto图标中,可以激活macos的键盘模式(keyboard types – Apple)

不过我的常用快捷键中, 有一个收到了影响, 就是打开terminal的ctrl-alt-t的ctrl变成了cmd, 不过这个问题可以通过settings-keyboard-keyboard shortcuts中设置terminal的快捷键解决.

这样就可以在linux中使用macos的键盘快捷键了, 特别是可以在terminal中直接使用cmd-c和cmd-v来进行复制粘贴!

发表在积少成多 | 留下评论

远程更新release遇到的一些问题和解决方法

发表于06/06/2025由信沉

今天终于把服务器从ubuntu22.04更新到了24.04. 远程更新系统总是让人心惊胆战, 不过好在这次更新还算有惊无险. 遇到了两个问题, 还是记录一下备用

问题1: 在安装了大部分更新后, 更新还是因为一些以来问题部分失败了. 重启后还是正常进入了24.04, 但是在使用apt install -f修复问题是, 出现了报错Temporary failure in name resolution. ask ubuntu 上的帖子将问题定位到了systemd-resolve. 在这次更新中的情况是, 这个软件包没有安装, 解决方法就是安装这个软件包

sudo  apt install systemd-resolved

万幸更新时已经下载了这个包的deb文件, 可以直接安装上.

问题2: worldpress无法访问了, 502错误. 根据stack overflow 的帖子, 这个问题是由于更新后php-fpm版本发生变化, 路径与ngnix的site configure文件不匹配导致的. 修复方法也很直接, 将/etc/nginx/sites-enabled/ 中的站点配置文件中关于php-fpm的行更新称正确的路径即可.

发表在积少成多 | 留下评论

Performance model

发表于05/22/2025由hancockdeng

Introduction

Without fast iteration cycles, the most prior thing we should be paying attention to, in the field of adapting scientific algorithms to high performance computing devices with parallel, shared memory, multi-core systems, is setting up an empirical model that predicts the performance given specific hardware architecture and algorithms.

In this blog-like article, I intend to summarize all required knowledge from a thesis(Anthony Joseph, 2009) to build performance models step by step.

Background knowledge

This section intends to fill the knowledge gas for those who are not familiar with the technical details of hardwares. For those who are already experts in the field, they can skip this section and move onto the next.

One should consult Hennessy and Patterson’s texts (computer organization and design) for details.

Microprocessor

A microprocessor has a Central Processing Unit (CPU) on an integrated circuit. The CPU operates on instructions defined by an Instruction Set Architecture (ISA), for example, x86/ARM/RISC-V. The process can be summarized in a loop, in which both instructions and data undergo:

fetch: instructions are fetched from the main memory
decode: instructions are decoded and required data are fetched
execution: CPU executes the instruction and the instruction is retired
write back: result of the operation from the execution is stored back

The von Neumann architecture means that both data and instructions are stored in main memory. The main memory is separated from the CPU so that data needs to be explicitly moved from main memory into the on-chip registers.

The explicitly required movement of data is normally the rate determining step, i.e. the performance largely depends on how fast we can move the data. To solve this problem to some extent, computer scientists proposed to use cache memory sit in between CPU and the main memory.

In its essence, the usage of cache memory brings locality. There are two types of locality: space locality and time locality. Space locality means that required data are stored in chunk, so we need fewer memory operations. Temporal locality means that the data/instruction can stay in the cache for a long time.

Parallelism

Data level parallelism (DLP): A form of parallelism with a sign of a single operation manipulating a set of data. An example is Single Instruction Multiple Data (SIMD)
Instruction level parallelism (ILP): This form of parallelism intends to maximize the utilization of hardware resources by increasing number of instruction being executed at any given time. Instructions are retired ‘in-order’ or ‘out-of-order’ to maximize ILP.
- Super scalar execution
- Instruction pipelining
- Out-of-order execution
- Branch prediction and Speculative execution

发表在积少成多 | 留下评论

ubuntu24.04中安装Molden

发表于05/11/2025由信沉

Molden是电子结构计算前后处理的优秀软件，而且开源免费。但是似乎在ubuntu上面编译它并不是非常直接。本文主要参考Reddit 上的一个解决方法，针对我目前用的系统给出编译和安装流程

根据Molden文档安装依赖库

sudo apt-get install gfortran
sudo apt-get install libgl1-mesa-dev
sudo apt-get install build-essential
sudo apt-get install mesa-common-dev
sudo apt-get install libglu1-mesa-dev
sudo apt-get install libxmu-dev
sudo apt-get install xutils-dev
sudo apt-get install wget

在我的系统中，libX11-6，libX11-dev和libgl1-mesa-glx无法安装，但似乎没有影响后续编译。

去Molden官网下载源文件并解压，目前最新版为molden7.3

wget https://ftp.science.ru.nl/Molden/molden7.3.tar.gz
tar xvf molden7.3.tar.gz
cd molden7.3

修改makefile以便适应新的gfortran编译器
根据Reddit教程的解释, 在gfortran 10版本之后, argument miss match会从warning变为error, 从而导致无法正常编译. 因此, 需要在编译时添加flag -w -fallow-argument-mismatch 来避免编译报错. 具体操作为: 将makefile, src/ambfor/makefile以及docker/makefile 三个文件中的所有FFLAGS 都加上上述flag (与原帖不同, docker/makefile文件也需要修改)

与原教程不同的是, 我还注释掉了makefile中all tag中的$(EXTEN), 根据个人理解, 这部分应该属于install步骤而不是编译环节
make
```
make
```
make后, 可执行文件全部包含在了bin文件夹中, 这时候编译工作就完成了.
(可选) 进一步修改makefile及utils中的文件以便正确安装
首先, install tag中直接将安装目录设置为了/usr/loc/bin, 如果需要安装到本地目录, 如需更换, 可以更改. 同时需要联动更改的还有extent2标签中的/usr/local/bin文件夹的位置. 同时, 应该删除install中的sudo 以提升安全性
然后修改utils/register_extension.sh, 将DIR的赋值改为DIR=$1来正确读取脚本的argument. 同时注释掉最后的sudo cp ~/.local/share/applications/$APP.desktop /usr/share/app-install/desktop/$APP:$APP.desktop
最后make install进行安装.

可以直接下载修改后的molden文件, 并更改makefile中的PREFIX来修改安装目录, 方便安装.

发表在积少成多 | 留下评论

ubuntu中fcitx5中文输入无法在edge和chrome中使用

发表于05/07/2025由信沉

解决方案来源：https://github.com/fcitx/fcitx5/issues/384#issuecomment-1532428129

问题描述：
ubuntu安装fcitx5-pinyin后，可以在terminal等应用中使用，但是无法在edge浏览器等其他应用中激活输入法

解决方法
编辑文件~/.config/environment.d/profile.conf，添加输入法相关设置

GTK_IM_MODULE=fcitx
QT_IM_MODULE=fcitx
XMODIFIERS="@im=fcitx"

发表在积少成多 | 留下评论

在开启secure boot的ubuntu上安装virtualbox

发表于03/10/2025由信沉

在开启了secure boot的ubuntu电脑上安装virtualbox无法直接使用，原因是virtualbox新安装的module需要签名才能加载。否则会报错

Kernel driver not installed (rc=-1908)

ask ubuntu的帖子给出的方法可以解决这个问题，具体方法摘录如下

对文件签名并生成新的kernel

#生成MOK.der和MOK.priv文件
sudo /sbin/vboxconfig 
#安装签名工具
sudo apt-get install mokutil
#生成签名文件
openssl req -new -x509 -newkey rsa:2048 -keyout MOK.priv -outform DER -out MOK.der -nodes -days 36500 -subj "/CN=VirtualBox/"
# 将签名文件添加到kernel
sudo /usr/src/linux-headers-$(uname -r)/scripts/sign-file sha256 ./MOK.priv ./MOK.der $(modinfo -n vboxdrv)
# 将新的kernel注册到secure boot
sudo mokutil --import MOK.der

在上述步骤的最后一步需要设定一个密码，需要记住这个密码（重启电脑后要用）

重启电脑，会进入Perform MOK mangement为标题的蓝色界面 -> 选择Enroll MOK，一路继续到输入刚刚设定的密码 -> 重启电脑。

再次重启电脑后，virtual box就应该可以使用了.

By the way, put virtual machines on SSD will provide much better performance than on HDD.

发表在积少成多 | 留下评论

apt使用系统代理

发表于03/09/2025由信沉

使用ubuntu的一些第三方源的时候apt update有可能遇到Could not connect to 错误，如果这是因为apt没有使用系统代理，则可以用如下解决方法

sudo visudo

在打开的文本编辑器中添加设置条目

Defaults env_keep = "http_proxy https_proxy ftp_proxy DISPLAY XAUTHORITY"

reference：

发表在积少成多 | 留下评论

把主机作为git服务器

发表于07/22/2024由信沉

把主机当做git服务器使用非常容易

在remote主机创建git服务的用户，这里假设叫gituser
```
sudo useradd -m gituser
```

在remote主机设置ssh公钥

su gituser
cd ~
mkdir .ssh
vim .ssh/authorized_keys # 复制公钥内容
chmod 700 .ssh
chmod 600 .ssh/authorized_keys

在remote主机初始化空想要同步的repo（使用--bare flag）

mkdir -p ~/path_to/some_test_repo.git
cd ~/path_to/some_test_repo.git
git init --bare

在本地电脑添加remote信息并push

cd some_local/repo_path
git remote add origin gituser@remote-server-url:/path_to/some_test_repo.git
git push

参考材料：

发表在积少成多 | 留下评论

获取python module的路径

发表于06/20/2024由信沉

每个module都会有__file__属性记录它的路径，以numpy为例

import numpy as np
import os
path = os.path.abspath(np.__file__)

Reference：
https://stackoverflow.com/questions/247770/how-to-retrieve-a-modules-path

发表在积少成多 | 留下评论

细水阁

Firecraw本地部署无法搜索问题的解决

kvm开启后virtualbox无法使用的替代方案

安装

使用

windows桌面（软件安装等操作，或需要多app协同）

seamless模式（具体软件使用）

tips

docker使用代理

libxc编译出现`’func_reference_type’ has no member named ‘key’`报错的解决

ubuntu中gnome-shell插件无法通过edge安装的解决

linux使用macos的键盘快捷键 (kinton在ubuntu24.04上的安装)

安装

依赖库变化的修复

激活使用

github ssh timeout问题解决

远程更新release遇到的一些问题和解决方法

Performance model

Introduction

Background knowledge

Microprocessor

Parallelism

ubuntu24.04中安装Molden

ubuntu中fcitx5中文输入无法在edge和chrome中使用

在开启secure boot的ubuntu上安装virtualbox

apt使用系统代理

把主机作为git服务器

获取python module的路径

分类

近期文章

归档

日历

分类

近期文章

归档

2025 年 11 月
日	一	二	三	四	五	六
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

2025 年 11 月
日	一	二	三	四	五	六
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30