Spring Boot集成Flink实现ES数据同步

【代码】Spring Boot集成Flink实现ES数据同步。

qq_40464334

228人浏览 · 2025-10-30 09:20:34

qq_40464334 · 2025-10-30 09:20:34 发布

Spring Boot集成Flink实现ES数据同步

添加依赖

在pom.xml中配置核心依赖项：

<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-java_2.12</artifactId>
    <version>1.15.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-connector-elasticsearch7_2.12</artifactId>
    <version>1.15.0</version>
</dependency>
<dependency>
    <groupId>org.springframework.boot</groupId>
    <artifactId>spring-boot-starter</artifactId>
</dependency>

配置Elasticsearch连接

创建Elasticsearch配置类定义传输参数：

@Configuration
public class ElasticsearchConfig {
    @Value("${es.host}") private String host;
    @Value("${es.port}") private int port;
    
    @Bean
    public TransportClient transportClient() throws UnknownHostException {
        return new PreBuiltTransportClient(Settings.EMPTY)
            .addTransportAddress(new TransportAddress(InetAddress.getByName(host), port));
    }
}

实现数据同步逻辑

构建Flink流处理作业进行数据转换和写入：

@Service
public class FlinkESService {
    @Autowired private TransportClient esClient;

    public void startSync() throws Exception {
        final StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
        
        // 模拟数据源（实际替换为Kafka/MySQL等连接器）
        DataStream<String> sourceStream = env.addSource(new SourceFunction<String>() {
            @Override public void run(SourceContext<String> ctx) {
                ctx.collect("{\"id\":1,\"name\":\"test\"}");
            }
            @Override public void cancel() {}
        });

        // 数据转换处理
        DataStream<Map<String, Object>> processedStream = sourceStream
            .map(JSON::parseObject)
            .map(obj -> {
                Map<String, Object> doc = new HashMap<>();
                doc.put("id", obj.getInteger("id"));
                doc.put("name", obj.getString("name"));
                return doc;
            });

        // 写入Elasticsearch
        List<HttpHost> httpHosts = Collections.singletonList(
            new HttpHost(esClient.settings().get("host"), 9200, "http"));
        
        processedStream.addSink(ElasticsearchSink.<Map<String,Object>>builder(
            httpHosts,
            (element, ctx, indexer) -> indexer.add(
                Requests.indexRequest()
                    .index("target_index")
                    .source(element)
            )
        ).build());

        env.execute("ES Data Sync Job");
    }
}

启动类配置

主启动类添加Flink作业触发点：

@SpringBootApplication
public class Application implements CommandLineRunner {
    @Autowired private FlinkESService flinkESService;
    
    public static void main(String[] args) {
        SpringApplication.run(Application.class, args);
    }

    @Override
    public void run(String... args) throws Exception {
        flinkESService.startSync();
    }
}

关键配置参数

application.yml需包含以下参数：

es:
  host: 127.0.0.1
  port: 9300
flink:
  parallelism: 4

注意事项

生产环境需替换模拟数据源为实际源连接器（如KafkaConsumer）
Elasticsearch版本需与flink-connector版本匹配
批量写入建议开启bulk.flush.max.actions参数
异常处理需实现ActionRequestFailureHandler接口

智能体开发者社区

中国智能体开发者社区，聚焦智能体与大模型开发，提供前沿资讯、实用工具链、开源项目及行业案例。通过技术沙龙、开发者大赛等活动，促进经验交流与协作，助力开发者快速构建创新智能应用。

更多推荐

OpenClaw 本地部署完整指南（Windows + Ollama）

本文档基于实际部署经验编写，旨在帮助你在 Windows 系统上从零开始搭建 OpenClaw，并连接本地 Ollama 模型（如 Qwen2.5 或 Qwen3），使其具备完整的智能体能力。文档包含了所有关键步骤以及常见问题的解决方案。

智能体开发者社区

OpenClaw 小白安装指南（Windows版）

（类似一个能自动执行任务的AI机器人），不是游戏。API Key只保存在你本地电脑的加密文件里，不会上传到任何地方。访问：https://github.com/miaoxworld/openclaw-manager/releases。: 一键安装脚本会自动安装Node.js 22+，如果失败，手动下载安装：https://nodejs.org/：在PowerShell中，鼠标右键就是粘贴，不需要按

智能体开发者社区

飞书 × OpenClaw 接入指南：不用服务器，用长连接把机器人跑起来

这个项目存在的意义，就是把“飞书接 OpenClaw”这件事，整理成一套的配置入口，并把官方文档没覆盖到的坑集中写成排查清单。先说清楚它的角色：OpenClaw 现在已经内置官方飞书插件 @openclaw/feishu，功能更完整、维护也更及时。，说明飞书 + AI 的接入已经走通。另外，仓库也推荐了一个新项目：把 OpenClaw 变成“多 Agent 团队”，用多个 Agent 分工，Sla