top of page

Big data in Japan Edition



What's Hot 🍵 

Leader's Spotlight 🌸 OpenAI

Buzzword - data encryption 🗝

Anime's Big Data 🐹

Uncharted Path - Japan 🛳



Monday, June 24, 2024


Note from the editor


Hi everyone and happy Monday. I am delighted to introduce the revamped Dose of Data Clarity newsletter!


With this new approach, our emphasis will shift from corporate data and tech news to delving deeper into the ongoing discussions within the data realm. Our goal is to dissect these conversations to offer our readers additional context, equilibrium, and clarity.


Your feedback is always valued as I strive to enhance this newsletter. Please share your thoughts on how things are progressing!


Warm regards,

Dave


Oracle to Invest More Than $8 Billion in Cloud Computing and AI in Japan

“We are dedicated to meeting our customers and partners where they are in their cloud journey,” said Toshimitsu Misawa, member of the board, corporate executive officer and president, Oracle Corporation Japan. “By growing our cloud footprint and providing a team to support sovereign operations in Japan, we are giving our customers and partners the opportunity to innovate with AI and other cloud services while supporting their regulatory and sovereignty requirements.” (Source:oracle.com)


Fujitsu and Oracle Collaborate to Deliver Sovereign Cloud and AI Capabilities in Japan

Kazushi Koga, SEVP, Fujitsu Ltd., said, “Fujitsu has been working with partners who have strengths in their respective fields to solve customers’ challenges as part of its Fujitsu Uvance Hybrid IT offerings. Our collaboration with Oracle positions us to deliver a sovereign cloud offering that enables hyperscale functionality and digital sovereignty capabilities while ensuring operational governance by Fujitsu.” (Source: oracle.com)


Microsoft to invest US$2.9 billion in AI and cloud infrastructure in Japan while boosting the nation’s skills, research and cybersecurity

Miki Tsusaka, President, Microsoft Japan 

“We are honored to contribute to Japan and its future with our largest investment to date, technology and knowledge. In collaboration with our partners, Microsoft Japan is fully committed to supporting the people and organizations of Japan to solve social problems and achieve more.”  (Source: news.microsoft.com)


Leader's Spotlight 🌸


Tadao Nagasaki, OpenAI

[April 16, 2024] "We want to build a track record through repeated dialogue with companies in Japan,” said Tadao Nagasaki, the newly named Japan president for OpenAI, during a news conference Monday. The Tokyo office — OpenAI’s third overseas outpost following offices in London and Dublin — will grow to about 10 to 20 workers this year, he said. (Source: japantimes.co.jp) 



Beyond the Buzzwords 🗝

Encryption is a way of scrambling data so that only authorized parties can understand the information. In technical terms, it is the process of converting human-readable plaintext to incomprehensible text, also known as ciphertext. In simpler terms, encryption takes readable data and alters it so that it appears random. (Source: Cloudfare.com)



 data encryption 



Interested in more? Take a look at The Mathematical Theory of Cartography - Case 20878 (declassified), Author: C.E. Shannon, September 1, 1945

(Source: iacr.org)



Big Data Anime 🐹


[MyAnimeList] Started externally providing big data of 18 million overseas users who like Japanese anime and manga

One of the world's largest Japanese anime and manga communities and databases, with 18 million registered members, users from 240 countries and regions around the world, and over 1.2 billion viewing/reading big data. Launched in 2004, it has become an important source of information for overseas fans, especially Generation Z in North America. (Source: prtimes.jp)



Uncharted Path 🛳


Using social media cherry blossom images and AI to track climate patterns

"Through this new approach we are able to accurately map large scale patterns across broad distances and also gain detailed perspectives on subtle flowering patterns."

The researchers collected images from Flickr that were uploaded by users tagged as "cherry blossoms." Computer vision and AI algorithms were applied to the images to filter out those that were irrelevant. (Source: physics.org)


ree

June 2023, Monash University 


Want this weekly dose of data clarity delivered to your inbox?





Our Data Assisted World


What's Hot 🔥

Google Cloud Next 2024 🌤

Buzzword - data repository 📦

Fed Gov & AI 🔬

Uncharted Paths - Roterdam 🛳



Thursday, April 11, 2024

What's Hot 🔥


Snowflake previews genAI-based SQL Copilot

Snowflake today announced the public preview of its generative artificial intelligence (genAI) powered SQL assistant — Snowflake Copilot, with limited availability in Amazon Web Services (AWS) US regions. It is designed to make complex SQL queries accessible through simple natural language prompts.


  • Data exploration: Copilot allows users to inquire about their data’s structure, refine their analysis with follow-up questions and gain deeper insights, without writing complex queries.

  • Intelligent corrections: Beyond generating queries, Copilot helps users to write cleaner and more efficient SQL queries by suggesting query optimizations, providing explanations and recommending fixes for potential issues in existing queries, streamlining the data analysis process.

  • Snowflake documentation: Copilot can answer queries about Snowflake documentation, for example, it can help look for a specific function or existing features and capabilities. (Source: SDXcentral.com)



4 Key Takeaways From CEO Andy Jassy's Letter to Amazon Shareholders

Amazon Chief Executive Officer (CEO) Andy Jassy released his annual letter to Amazon (AMZN) shareholders Thursday, covering a range of topics across its business from the emergence of artificial intelligence (AI) to the benefits of regionalizing its warehouse network. (Source: investopedia.com)





Informatica Expands Partnership with Google Cloud, Launches Extension for Trusted Customer Data Analytics and Enterprise Gen AI Applications


"By collaborating with Google Cloud, a trailblazer in AI innovation, we're empowering our joint customers with a trusted data foundation for their generative AI applications and Customer Data Platform on Google Cloud.


The MDM Extension for BigQuery makes it fast and easy for customers to enhance their enterprise data foundation in BigQuery with Informatica's industry-leading Master Data Management, said Rik Tamm-Daniels, Group Vice President of Strategic Ecosystems and Technology at Informatica.


We're excited to see how our customers will leverage these new capabilities to transform their businesses, making real-time, data-driven decisions easier than ever before." (Source: investing.com)


Beyond the Buzzwords 📦

Data repositories are platforms that hold data, organize it in a logical way, and can make it available for reuse. They are used by research communities to share and discover data. (Source: utoronto.ca)



data repository



Different data repositories offer different services and functions. Some things to consider before selecting a repository include:


  • Funder or journal requirements

  • Disciplinary research data

  • Persistent identifiers (PID)

  • Access restrictions 

  • Data licensing

  • Cost 

  • Retention and preservation

  • Ability to update dataset

  • Curation services




Federal government use of AI in hundreds of initiatives revealed by new research database 🔬

Joanna Redden, an associate professor at Western University in London, Ont., pieced together the database using news reports, documents tabled in Parliament and access-to-information requests.


Of the 303 automated tools in the register as of Wednesday, 95 per cent were used by federal government agencies. "There needs to be far more public debate about what kinds of systems should be in use, and there needs to be more public information available about how these systems are being used," Redden said in an interview. (Source: cbc.ca)


Uncharted Path - Roterdam NL 🛳


ree

Roterdam's smart shipping has several project in the works that are creating a digital twin of the port to enable better data analytics insights. 

Here's a sample:




  • HaMIS: The Port Management Information System (Haven Management Informatie Systeem, HaMIS) is the central system for the administrative settlement, guidance and inspection of port calls. 

  • PortMaps: PortMaps is the digital version of Rotterdam’s geographical chart of the port area. 

  • Smart Infra: Fitting the port infrastructure with sensors and data communications devices (Internet of Things) has yielded a mass of useful information about, for example, the duration and intensity of a specific object’s use. 

  • Floating Lab: The Port Authority has fitted one of the Harbour Master Division’s former harbour patrol boats (RPA3) with an array of cameras, measuring devices and sensors for the development and testing of new applications in the field of smart shipping and autonomous shipping. 

  • (Source: portofroterdam.com)





Want this weekly dose of data clarity delivered to your inbox?






Great Visuals with Data


What's Hot 🍜

Great Visuals w/ Data 🎞

Buzzword - Data Interoperability 📲

Uncharted Path 🛳


Thursday, April 4th


What's Hot 🍜


AXA and AWS Developing the First Global B2B Risk Management and Prevention Platform


Scott Gunter, CEO, AXA XL

"Globally clients are grappling with extreme weather, cyber-attacks and other shocks and disruptions. We believe that building resilience is more essential than ever. That is why we are excited to be working with AWS, combining their tech capabilities with AXA’s expertise in commercial insurance to develop the next generation of risk management insights and services to help clients unlock their full potential".


Kathrin Renz, VP of AWS Industries

“AXA is a customer-focused business, and we are excited to collaborate with them to develop new business models, using AWS capabilities, that will help companies around the world operate with greater confidence. AWS will help AXA support its business clients, creating secure, compliant new services and capabilities fueled by advanced technologies like generative AI. Amazon’s experience in building, operating, and scaling marketplaces will also help AXA tap into innovation from companies beyond the financial services industry, offering more ways to plan for uncertainty.”  (Source: axa.com)



Great Visuals w/ Data 🎞


Amazon Web Services Introduces Pay As You Go Cloud Rendering Service For Deadline-Sensitive VFX and CG Work

Antony Passemard, general manager of creative tools at AWS, notes that VFX rendering might take “a few hours per frame on large complex frames for a movie, for example.” A full VFX-heavy scene, therefore, “could be many days of rendering, or many, many, many machines are working in parallel. Even when you’re not rendering, you’re still paying somewhat of an underlying infrastructure cost. With Deadline Cloud you only pay for when you render. When you have downtime in your production, it costs you zero.”

(Source: variety.com)


IBM watsonx Brings New Generative AI Capabilities to Masters Tournament Digital Platforms

  • Data-driven recaps of how each hole has played daily and throughout the 2024 Tournament (e.g., "The 14th hole has played difficult today, with 25% of shots resulting in bogies.").

  • Projections of how each hole might play, based on past and current performance data (e.g., "The 9th hole is projected to be the third most difficult hole today.").

  • Historical insights into how each hole has played, based on eight years of Tournament data – including more than 170,000 shots – and ball position on course (e.g., "Shots historically hit in this location have an 82% chance of resulting in a birdie."). (Source: newsroom.ibm.com)


Scientists Use NASA Data to Predict Solar Corona Before Eclipse

"We developed a software pipeline that took in the magnetic field maps, picked out all of the areas that should be energized, and then fine-tuned the amount of energy to add to those areas,” Emily Mason, a research scientist at Predictive Science.

(Source: science.nasa.gov)


Beyond the Buzzwords 📲

This week's buzzword was inspired by the following article, The Next Thing to Look For in AI Vendors: Interoperation


 data interoperability


Data interoperability refers to the ways in which data is formatted that allow diverse datasets to be merged or aggregated in meaningful ways. It is a key aspect of the FAIR Data Principles, constituting the “I” in FAIR. (Source: nnlm.gov)


  • Findable.

  • Accessible.

  • Interoperable.

  • Reusable.


Uncharted Path 🛳

Exploring Location Data Using a Hexagon Grid


A comprehensive guide on how to use Uber’s H3 hexagon grid in data analysis

In this article, we will use Helsinki city bike data to demonstrate how one can utilise H3 hexagons to analyse spatial data. First, we provide an introduction to the H3 hexagon grid and its resolutions. Next, we delve into the main functionalities of the H3 library. Following that, we illustrate how a hexagon grid can enhance data analysis. Finally, we address some issues associated with hexagonal grids. All the notebooks used in this analysis can be found on this GitHub repository. All images in this article, unless otherwise noted, are by the author. (Source: towardsdatascience.com)



Want this weekly dose of data clarity delivered to your inbox?





fuse data logo
bottom of page