Netease Sufan's Personal Space - OSCHINA - Chinese Open Source Technology Exchange Community

NetEase Counts Sails

Focus on the dissemination of digital basic software technology

Certified official account

3.3K

Experience value

3.3K

Open-source bean

two hundred and sixty-seven

follow private letter

Distributed storage system

three hundred and twenty-five

Service Grid Components

twenty

High performance general JDBC and SQL execution engine

forty-seven

Multi tenant visual Kubernetes management platform

eighteen

Cloud native diagnosis O&M orchestration framework

six

High performance cloud native API gateway

thirteen

Cloud native log system

thirty-nine

Flow type warehouse service

twenty

Display all 8 software

Double Hit day

fork: star:

Loading

In November 2023, GF Securities became one of the first securities companies to obtain management quantification level (Level 4) through DCMM data management maturity capability maturity assessment. At present, tens of thousands of Kyuubi operations have become the core part of GF data comprehensive governance and key data system. Ben

2023/12/29 16:15

two hundred and forty-nine

This article is from Netease Hangyan big data technology expert, Apache Kyuubi PMC Member, and Apache Spark Committee You Xiduo. It focuses on Apache Spark and Native Engine, sharing what is Native Engine and why do Na

2023/12/04 16:30

four hundred and twenty-one

Yan Qing, a Netease Hangzhou Research Institute and Netease Sufan big data expert, has added a new title. This time, he is a member of Apache Incubator PMC. Recently, the Apache Incubator PMC (Apache Software Foundation Incubator Project Management Committee, referred to as IPMC) announced that it has accepted

2023/12/01 12:34

six hundred and thirty-five

Amoro is a hucang management system built on Apache Iceberg and other open data lake forms. It is open source launched by the big data team of Netease Sufan, and provides a set of pluggable data self optimization mechanism and management services, aiming to bring out of the box hucang users

2023/11/20 15:03

three hundred and sixteen

Amoro is a warehouse management system built on Apache Iceberg and other open data lake forms, which provides a set of pluggable data self optimization mechanism and management services, aiming to bring users an out of the box warehouse use experience. On November 7, 2023, Amo

amoro flink https kubernetes ams orc github data storage delete docs iceberg terminal releases

2023/11/17 17:50

one hundred and seventy-nine

Apache Kyuubi [1] is a distributed multi tenant SQL gateway. Its main function is to accept the SQL submitted by users through JDBC/REST and other protocols and route it to the SQL engine under its management for execution according to the multi tenant isolation policy. In the latest version of Kyuubi 1.8, K

Server software apache spark

2023/10/18 09:39

three hundred and forty-seven

In the era of digital intelligence, open source software has gained unprecedented attention because it has become the core support for enterprise competitiveness. But today, the storm of cost reduction and efficiency increase has swept the world. It is difficult to stick to open source, and Netease Digital Fan has always been active in open source. Recently, Envoy community announced that it is

Cloud primordial

cloud computing

2023/09/27 10:18

nine hundred and seventy-five

This article is compiled from Pan Cheng, a software engineer of Netease Digital Sail, who shared it in ASF CommunityOverCode Asia 2023 (Beijing). The main contents of this article are: 1) The benefits and challenges of Spark Cloud native; 2) How to build a unified Spark task network based on Apache Kyuubi

2023/08/25 10:49

1.1K

Background service discovery is the core of microservice governance. The traditional microservice architecture uses the Consumer/Provider model. The provider registers service information with the registry, and the consumer discovers the provider's service information through the registry. In the cloud native service grid system

istio nacos github zookeeper eureka

cloud computing

2023/04/27 09:49

2.8K

Arctic is an open architecture warehouse management system. On top of the open lceberg data lake format, it provides more optimization for flow and update scenarios, as well as a set of pluggable data self optimization mechanisms and management services. Background lake data and data warehouse are both common

apache ams benchmark big data github hudi

2023/04/23 11:49

3.2K

Elasticsearch is widely used in the production environment. This paper introduces a method based on NetEase's open source Curve file storage to achieve significant improvement in storage cost, performance, capacity and operation and maintenance of Elasticsearch. Four Benefits of ES Using CurveFS

2023/01/12 10:17

4.1K

Loggie sprouted from the actual needs of Netease's strict selection business, grew from the long-term co construction of strict selection and Dofan, and continued to develop in the close cooperation between Netease Dofan, Netease Media and ICBC. The extensive ecology enables the project to be constantly improved and mature based on business needs. Already

interceptor github apache flume

2023/01/11 14:43

2.9K

Background Yangzhou Wanfang Technology Co., Ltd. is mainly engaged in the scientific research and production of communication, computers and servers, intelligent vehicles, basic software and other products. It is a national high-tech enterprise, a small giant enterprise specializing in new technologies, and a unit undertaking the National Torch Plan. Business Introduction Shenwei Processor

2022/12/23 10:53

1.8K

Curve is the Sandbox project of the Cloud Native Computing Foundation (CNCF), which is an open source high-performance, easy to operate and maintain, cloud native distributed storage system initiated by Netease Digital Fan. In order to make it easier for everyone to use and understand Curve, we look forward to the next series of application practice articles

curve session epoll login iscsitarget fio migrate performance optimization https Cloud primordial event_handler Open Source

2022/12/02 14:08

1.5K

With the promotion of national industrial upgrading and the maturity of cloud native technology, the multipoint DMALL big data technology has also undergone structural adjustment and change from the integration of storage and computing to the separation of storage and computing. This article will describe the process of this exploration practice from the perspective of introducing Kyuubi to realize unified SQL proxy

apache spark big data hive impala sentry

2022/11/25 12:19

1.7K

Business background: Chuangyun Rongda is a high-tech enterprise focusing on the storage and management of massive data, based on the enterprise level private cloud construction capability, and providing data assets and data midrange products and solutions. In recent years, in order to optimize people's service experience of paying taxes, provinces

2022/11/24 09:42

1.4K

Curve file storage is a POSIX compatible distributed file system, which is suitable for private cloud, public cloud and hybrid cloud environments. We can easily access 10 billion level files through Curve file storage. First, give a brief introduction to the architecture of Curve file storage. File

2022/11/11 10:15

2.3K

In the actual big data business of NetEase Media, there are a lot of quasi real-time computing demand scenarios, and the business side's requirements for data effectiveness are generally minute level; In this scenario, the traditional offline data warehouse solution cannot meet the user's requirements in terms of effectiveness. Instead, the full link solution is used

apache flink hive apache spark ams watermark impala

2022/11/09 09:30

4.6K

01 Background With the rapid development of B station business in recent years, the amount of data continues to increase. The scale of offline computing cluster has grown from the initial 200 to nearly 10000, and from single room to multi room architecture. At present, we mainly use Spark, Presto, H

kyuubi engine adhoc spark yarn executor session big data scala Computing engine presto dispatcher

2022/10/27 11:34

5.1K

Background With the development of big data business, Hive based digital warehouse system is gradually unable to meet the growing business needs. On the one hand, there are a large number of users, but it is seriously lacking in real-time and functionality; On the other hand, systems like Hudi and Iceberg are transactional

kafka apache flink apache hudi github ams apache spark apache ranger big data hive

2022/10/26 09:51

1.8K

No more

Loading failed, please refresh the page

© OSCHINA(OSChina. NET)

Ministry of Industry and Information Technology

Open Source Software Promotion Alliance

Designated official community

Community norms

Copyright Shenzhen Aosi Network Technology Co., Ltd

Yue ICP Bei No. 12009483

Top