This week at Dozer #1

Weekly at Dozer Series

从本周开始,Dozer开始有开源周报啦!大家可以据此了解我们的进展。如有任何反馈或其他希望看到的内容,随时与我们联系~

这是一个重大更新,我们发布了几个正在开发中的功能。

Dozer is now available on homebrew and deb install.

# Mac
brew tap getdozer/dozer
brew install dozer

# Ubuntu
curl -sLO https://github.com/getdozer/dozer/releases/download/latest/dozer_linux_x86_64.deb \
&& sudo dpkg -i dozer_linux_x86_64.deb

Release v.0.1.11

Dozer v.0.1.11 is avaiable. Checkout the release notes here.

Additional DateTime operations

Based on feedback, we extended additional date time capabilities. Allowing to manipulate and work with dates better.

Extract date part from date and time #1178

select extract(timezone from last_update) from actor;
select extract(year from last_update) from actor;
select extract(hour from last_update) from actor;
select extract(month from last_update) from actor;

Timestamp difference #1074

Duration is extracted as i64 when timestamps are subtracted.

select (date1 - date2) from logs;

Window Capability #1175 (Alpha)

Now Dozer supports window capabilities as an alpha feature.

Hopping Window

SELECT taxi_id, completed_at, window_start, window_end
FROM HOP('taxi_trips', 'completed_at', '1 MINUTE', '2 MINUTES')

Tumbling Window

SELECT taxi_id, completed_at, window_start, window_end 
FROM TUMBLE ('taxi_trips', 'completed_at', '2 MINUTES');

Refer to this issue for further information #893.

Performance Improvements

We have been able to increase performance by an order of magnitude and have introduced several enhancements.

  • Pipeline indexes in memory #1084
  • Remove nested txns in LMDB #1084
  • Optimize cache insertion and query when schema is append only #1192

Perfomance improvements are an ongoing effort and we will be sharing more in the coming weeks.

Schema Evolution v1 (Alpha)

Dozer’s main use case is to be a very fast cache + API layer. So between restarts, data is not retained, just like Redis. We are currently testing blue/green cache functionality where API upgrades are seamless with zero downtime.

  • Blue Green Cache #1061
  • Automatic Switch based on no of records #1092

Ingest using Arrow Format

Arrow format is commonly used in data analysis and for cross language support. Now developers can easily ingest arrow format (i.e. from Polars or Pandas). This is our initial integration with Arrow. We love where arrow format is heading and stay tuned for further updates.

  • Implement arrow format for grpc ingestion #1087

Delltake Connector (Alpha)

Dozer now has a connector for Deltalake leveraging deltalake-rs RUST library.

Other Minor Updates

  • Show sources ingestion progress #1079
  • Parallelized ingestion of Postgres snapshot data #1094

What Next?

  • Robust Data Type Handing & Stability
  • Performance improvements
  • Build various samples to showcase Dozer

New Contributors

We are happy to see growing interest in Dozer. We welcome any contributions and are very thankful to help our community grow.

  • @readall made their first contribution in #1122
  • @hoangnh93 made their first contribution in #1035

Changelog

https://github.com/getdozer/dozer/compare/v0.1.10...v0.1.11

Contact us

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值