《SODA Foundation:2022年数据和存储趋势报告(英文版)(43页).pdf》由会员分享,可在线阅读,更多相关《SODA Foundation:2022年数据和存储趋势报告(英文版)(43页).pdf(43页珍藏版)》请在三个皮匠报告上搜索。
1、Data and Storage Trends 2022Data and Storage Strategies in the Era of the Data-Driven EnterpriseDecember 2022Foreword by Rakesh Jain TOC Co-Chair,SODA Foundation,and Senior Technical Staff Member,IBM ResearchIn partnership withContentsInfographic:Top 12 trends in data and storage.3Foreword.4Introduc
2、tion.5Current data and storage requirements.6Data analytics and databases lead production workloads.6Cloud container services significantly lead in production infrastructure,followed by cloud VMs.7Storage technologies are in transition from traditional to cloud-based.8Types of storage interconnects
3、in use.9Projected data growth increases significantly for organizations with 1PB or more.1093%of organizations use open source in production.12End-user organizations gravitate to heterogeneous storage vendor relationships.13Cost,performance,reliability and quality lead top storage vendor attributes.
4、14Containerization plans.15Organizations are quickly adopting containers in production.15NAS file storage is the most common storage type for container workload deployment.16Oracle,SQL Server,and SAP are the top three traditional workflows to containerize.16Security and heterogeneous environments ar
5、e the main challenges for adopting containers in production.17Cost,performance,and management drive container rollbacks.18Approaches to cloud data and storage.1991%of organizations deploy workloads in public cloud.1965%of organizations use 1 or more clouds for their data storage.19Use cases for clou
6、d storage services.20Data security and privacy drive use of private cloud.20Flexibility is the top reason for multi-cloud deployments.22Data security and protection are top multi-cloud challenges.23Vendors and service providers preferred for cloud transitions.24Future perspectives on data and storag
7、e.25Future data storage investment areas.25AI-and metadata-driven capabilities lead the future of data management and analytics.25Future metadata management priorities.27Observability-based solutions(remote or local)are much in demand.28Top five challenges for storage observability.29The impact of o
8、pen source on data and storage.31OSS can help improve quality,reliability,security,costs,and promote collaboration.31Organizations plan to use SODA OSS projects to support multi-cloud environments,interoperability,monitoring,and containers.31Non-SODA data and storage project use.33Conclusions.36End-
9、user organizations continue their journey to the data-driven enterprise.36End users embrace a hybrid and multi-cloud future.36Private clouds excel at information security and data privacy.36Open source software is uniquely positioned to address data and storage requirements of hybrid and multi-cloud
10、 needs.37Methodology.37Demographics.38About the author.41Acknowledgments.41Disclaimer.42Copyright 2022 The Linux Foundation|December 2022.This report is licensed under the Creative Commons Attribution-NoDerivatives 4.0 International Public LicenseInfographic:Top 12 trends in data and storageIn 2022,
11、the growth in data was 3 times higher than in 2021.Data security is the greatest challenge facing container deployments.56%of end users deploy open source multi-cloud management in their production environments.43%of end users demand the freedom to leverage multiple storage vendors.Public clouds run
12、 more than 40%of end-user organization workloads.Primary data storage,complete data protection,and disaster recovery represent the top 3 use cases for cloud storage services.Information security and data privacy are the leading reasons to use a private cloud solution.The biggest challenge facing mul
13、ti-cloud solutions is the security and protection of data.Cloud technologies represent the most significant area of data and storage technology investment over the next three years.AI-driven hybrid data management is considered the most critical area for data management and analytics over the next 2
14、-4 years.Data quality,governance,and security are top priorities when selecting metadata management solutions.Cloud storage monitoring is the greatest challenge facing data and storage observability.4DATA AND STORAGE TRENDS 2022ForewordEnterprises today want real-time,consistent,con-nected,and trust
15、ed data to support their critical business operations and insights.Any delay in the availability of data can have a negative impact on businesses.Ever-expanding data volumes,new gov-ernance requirements,data silos across clouds and on-premises,etc.,can cause enterprises to slow down on their data st
16、rategy and create business challenges.Therefore,data storage and protection systems remain critical to managing IT infrastructure.Cloud native technology brings new challenges and opportunities to the storage world.Data movement from on-premises to the cloud or between the clouds,immutable snapshot
17、requirements due to ransom-ware attacks,edge computing,machine learning,AI,and 5G need to connect and collect everything.Data governance laws bring interesting use cases from a storage perspective and demand changes in storage platforms and operational models.SODA Foundations objective is to bring a
18、ll open source data and storage efforts under one umbrella.SODA Foundation has many goals,including building solutions for end users,integrating with other open source projects,standardizing data management,and obtaining deeper sector insights to keep our projects aligned with the industry trends.We
19、 have seen more companies joining the SODA Foundation,either as full members or in supporting roles,end users,or part of the ecosystem.With the move to the cloud continuing,application modernization,and related challenges including hybrid and multi-cloud adoption and regulatory com-pliance requireme
20、nts,we want to ensure we address the right priorities in the near term.To accomplish these goals,every year,we conduct a comprehensive survey of the current data and storage landscape and the role open source plays in it.This report is the culmination of the 2022 Data and Storage Trends survey condu
21、cted in partner-ship with the Linux Foundation Research team.To expand our reach across different domains,we invited other open source communities,such as Cloud Native Computing Foundation(CNCF),Storage Networking Industry Association(SNIA),Open Infrastructure Foundation,Storage Performance Council,
22、Japan Data Storage Forum,China Opensource Cloud League,Mulan Opensource Community,and others,to participate in the dis-tribution of the survey.Without their support,this report would not be possible.In terms of innovation,every technology goes through a hype cycle.Currently,computational storage,imm
23、utable data vault,container backup,and container-native storage are at the peak of inflated expectations,while hybrid and multi-cloud storage are gaining expectations.This survey shows that data analytics is the leading production workload.While some findings of this report,such as the rise of cloud
24、 native and hybrid cloud,align with the visible trends,we see that the organizations plan to use open source software,most notably for multi and hybrid cloud data management.We also wanted to re-evaluate how businesses are dealing with open source software.Open source code is prevalent in software p
25、ackages,from business applications to network and server pro-cesses.According to a recent study(2022 Synopsys Open Source Security and Risk Analysis),open source code running in software is at an all-time high.Often,enterprises are unaware of the use of open source code in their software because it
26、is deeply embedded,and they dont have the inventory of the open source code in it.This causes problems related to policies,licenses,vulnerabilities,and versions.The recent Log4j vulnerability is an interesting example of that.Despite these issues,we see from the survey results that the top reason co
27、mpanies adopt open source software is quality,reliability,and security.We hope this work will help guide the technology and business leaders in their decision-making and stra-tegic approaches.We would like to thank the Linux Foundation Research team for assisting in this crucial research,our survey
28、partners,and all SODA founda-tion members who helped develop and participate in the survey and other aspects of this report.Rakesh Jain TOC Co-chair,SODA Foundation Senior Technical Staff Member,IBM Research5DATA AND STORAGE TRENDS 2022IntroductionThe SODA Foundation is an open source project under
29、the Linux Foundation that fosters an ecosystem of open source data management and storage software for data autonomy.SODA offers a neutral forum for cross-project collaboration and integration and provides end users with quality end-to-end solutions.In July 2022,the SODA Foundation,in partnership wi
30、th Linux Foundation Research,launched a worldwide survey to understand evolving data and storage trends.The SODA Foundation conducted the survey in English,Chinese,and Japanese-speaking markets to identify current data and storage strategies,reliance on cloud services,container adoption,workloads,ch
31、allenges,and data and storage strategies going forward in the era of the data-driven enterprise,cloud native technologies,Edge,IoT,AI,and 5G.This survey data intends to guide end users and vendors on critical issues,equip them to make decisions,improve their products,and assist the SODA Foundation i
32、n establishing new technical directions.The data in this report is an analysis of the 2022 SODA data and storage trends survey.For information about the survey methodology and survey demographics,please refer to those sections toward the end of this report.The analysis in this report generally focus
33、es on end-user findings.Figures 1-21 and 23-26 in this report show just end-user data.Figures 22 and 27-31 show both end-user data and IT vendor and service provider data.6DATA AND STORAGE TRENDS 2022Current data and storage requirementsData collection,persistent data storage,and data consumption ar
34、e core activities of every organization or company.They are intrinsic to how these organizations demonstrate and increase the value they provide.This chapter examines workloads,storage activities,data growth,and how organizations choose storage vendors.Data analytics and databases lead production wo
35、rkloadsUnderstanding production workloads is a crucial step to under-standing how organizations need to approach data and storage.FIGURE 1 shows the leading production workloads in place today.Note that this is a multiple-response question that asks respon-dents to select their top three workloads.T
36、he recommended way to interpret this question is to focus on the leading workloads because all of these are common organizational workloads.Data analytics at 52%and database(data management)at 46%were the top two workloads.The importance of data analytics is testimony to the importance of becoming a
37、 data-driven enter-prise.This is because data analytics is about analyzing data to make informed decisions that drive astute and calculated actions.The subject of data analytics also casts a wide net that can include data analytics,database,big data,BI,AI,metadata,and remote FIGURE 1LEADING PRODUCTI
38、ON WORKLOADSWhat are the top 3 workloads in your production environment?(select between one and three responses)52%46%29%26%24%23%21%15%13%11%2%2%1%3%Dont know or not sureOther(please specify)HPCAIOpsRemote monitoringMetadata processingTest and development toolsAI&MLCloud-native appsBusiness intelli
39、gence&applicationsBig dataWeb applicationsDatabasesData analytics2022 SODA DATA&STORAGE TRENDS,Q15,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=4837DATA AND STORAGE TRENDS 2022processing,so respondents may be equating data analytics to a higher-order construct that includes elements of other workl
40、oads presented.Data analytics demonstrates the importance of being data-driven and using mathematical techniques to address descrip-tive,diagnostic,predictive,and prescriptive analytics.Database continues to be an essential workload because of its focus on enabling systems of record,data lakes,and u
41、nparalleled support for transaction processing.Consequently,database management systems will always be a top production workload.Data analytics and data management collectively provide a way to manage transac-tions,create systems of record,and understand how best to extract insight from data.This is
42、 a hallmark of a data-driven enterprise.The second tranche of workloads that are closely clustered together includes web applications at 29%,big data at 26%,business intelligence at 24%,cloud native applications at 23%,and AI/ML at 21%.While these technology areas overlap with database and data anal
43、ytics,the focus on these workloads indicates an emphasis on web-based and cloud native application development.Web application development continues to be top of mind given the acceleration of digital transformation activities in 2020 due to the COVID-19 pandemic,the v2.0 draft release of web assemb
44、ly(WASM)in 2022,and the persistent investment in AI/ML tools in areas like NLP.These factors will improve how the development and engineer-ing of web applications respond to user needs more intelligently.Cloud container services significantly lead in production infrastructure,followed by cloud VMsFI
45、GURE 2 looks at various infrastructural deployment strategies that assess the importance of cloud computing,containers,and Dont know or not sureOther(please specify)Edge(Akamai Edge Platform,EdgeX Foundry,KubeEdge,k3s,SAP Edge,StarlingX,Edge Service from Cloud Vendors)Multi-cloud(use multiple cloud
46、service providers)Container deployment on-premises that are Kubernetes-based(OpenShift,Tanzu,Rancher,PKS,Anthos GKE etc.)Virtualization on premise(VMware,OpenStack,Hyper-V etc.)Hybrid multi-cloud(on-premises and use of multiple cloud service providers)Cloud VMs(AWS EC2,Azure VM,Google Compute Engine
47、,OpenStack)Cloud container services-Kubernetes svcs from cloud providers(Amazon EKS,Google GKE,Azure AKS,Amazon ECS,Google cloud run,Azure ACI etc.)69%42%39%37%37%28%23%1%3%FIGURE 2LEADING PRODUCTION INFRASTRUCTUREWhat are the different infrastructure deployments in your development or production en
48、vironment?(select all that apply)2022 SODA DATA&STORAGE TRENDS,Q13,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=501edge computing.Containerization benefits include the efficient use of resources,faster instance creation and destruction,improved scal-ability,and operational simplicity.The unmistaka
49、ble finding in FIGURE 2 is the widespread use of cloud container services in productionan approach used by 69%of end-user organizations.There is also a trickle-down effect that shows 37%of end-user organizations are also doing container deployments on-premises.This demonstrates the strong value prop
50、osition of containers.The ongoing migration to the cloud is also evident in FIGURE2.Hybrid multi-cloud use at 39%is strong and more appealing than deploying a multi-cloud strategy(28%);however,remember that 46%of the sample are end-user organizations,and 60%of end-user organizations are large or ent
51、erprise-level organizations.Larger organizations are more likely to have hybrid needs due to their on-premises roots.Larger orga-nizations are also more interested in multi-cloud solutions to avoid lock-in and mitigate risk.Storage technologies are in transition from traditional to cloud-basedFIGURE
52、 3 provides several views into how organizations use storage technology.Key data storage attributes include forms of data storage,types of data storage,scalability,redundancy,performance,and cost.59%of end-user organizations use public cloud storage.This is a tes-timony to the importance and success
53、 of general-purpose storage services,such as Amazons S3.The importance and widespread use of cloud storage accompanies strong support for technologies such as software-defined storage(SDS),where data storage is decoupled from the underlying hardware platform for greater flexibility and scal-ability.
54、Thirty-eight percent of end-user organizations identified SDS in FIGURE 3.The hardware independence of SDS provides flexibility and reduces cost by eliminating proprietary hardware and software elements of traditional NAS and SAN solutions.SDS is an on-ramp for hyper-converged storage.FIGURE 3 shows
55、 that just 11%of end-user 8DATA AND STORAGE TRENDS 202259%59%38%26%23%21%18%18%11%9%6%1%3%Dont know or not sureOther(please specify)TapeOptical mediaHyper-converged storageFlash technologyPersistent memory(Intel Optane,NVDIMM etc.)Block storageStorage as a Service(STaaS)Object storageSoftware define
56、d storage(SDS)File storagePublic cloud storage(AWS,Azure,Google Cloud etc.)2022 SODA DATA&STORAGE TRENDS,Q16,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=527FIGURE 3TYPES OF STORAGE IN USE What are the types of storage in your infrastructure?(select all that apply)9DATA AND STORAGE TRENDS 2022orga
57、nizations use it.While hyper-converged storage provides even greater flexibility and potential for cost savings on storage,it adds complexity and expense back through increased organizational planning.Storage as a service(STaaS)at 23%presents another interesting take on cloud storage by adding an em
58、phasis on delivering storage as a managed service both in the cloud and on-premises.The advantage of STaaS is its ability to blend cloud-based and on-prem-ises solutions while enabling an organization to sidestep a signifi-cant capital expense and better manage risk more seamlessly in a domain where
59、 technology and price points are changing rapidly.File storage will always be important because of its key role in both orchestrating and leveraging operating system activities.FIGURE3 communicates this by the 59%of end-user organizations who identified file storage as part of their storage infrastr
60、ucture.File system data is immensely important and is therefore also expe-riencing a transition to being cloud-based where it can be better managed and secured.Types of storage interconnects in useFIGURE 4 shows the types of storage interconnects used by enter-prises.At 42%,Fiber Channel tops the li
61、st,followed by iSCSI.This is the traditional approach,which continues to lead even today.But what is interesting is that the FC-NVMe has taken the third spot.NVMe over Fabric is an industry standard that enables low latency access to NVMe-based shared storage arrays providing access across a switche
62、d fabric to high-performance NVMe-based storage with the same latencies as local NVMe-based solid-state disks across a switched fabric.NVMe over Fabric has been available 42%36%29%28%22%21%19%18%17%12%1%11%Dont know or not sureOther(please specify)SMB over X(X=TCP,RDMA.)Custom clients(like Lustre,GP
63、FS,etc)NFS over X(X=UDP,TCP,RDMA.)S3 or SWIFTRDMA over Converged Ethernet(RoCE)NVMe over X(X=Fabric,TCP.)InfiniBand over Ethernet(IBoE)Non-volatile memory express over Fibre Channel(FC-NVMe)Internet small computer system interface(iSCSI)Fibre Channel protocol related(FCP,FCIP.)2022 SODA DATA&STORAGE
64、 TRENDS,Q17,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=459FIGURE 4TYPES OF STORAGE INTERCONNECTS IN USEWhich storage interconnect types do you use or plan to use in your infrastructure?(select all that apply)10DATA AND STORAGE TRENDS 2022across Fiber Channel,Ethernet,and InfiniBand transports.Th
65、e adoption of FC-NVMe is due to what is available in the market today and in the existing infrastructure of the enterprises.However,NVMe over Ethernet provides the same performance advantages with a solution that is easier and less expensive to deploy;therefore,the industry will see strong growth in
66、 NVMe over Ethernet in the coming years,as it has already reached 22%.FIGURE 4 also shows that legacy interconnect types,such as InfiniBand,RDMA,and NFS,continue to play important roles as methods to support data transfer.Projected data growth increases significantly for organizations with 1PB or mo
67、reAs we saw in FIGURE 1,the top two production workloads included data analytics and database.These workloads are data-centric and confirm explosive growth in data that continues to occur across organizations.This year,a“shift right”phenomenon is evident.FIGURE 5 shows the approximate amount of data
68、 growth by category and compares the findings for 2021 and 2022.In 2022,the distribution peaks in the 100TB-1PB category and shows 200+%growth in both 1-10PB and 10+PB categories compared to 2021.Correspondingly,2022 data growth in the 1-10TB and 10-100TB categories is down by more than 50%compared
69、to 2021.This is a significant change,and this change has happened in just one year.FIGURE 5 also shows that just 33%of end-user organizations are thinking of data growth in the tens of TBs compared to 59%of organizations who are forecasting data growth in the hundreds to thousands of TBs.This sugges
70、ts that most end-user organizations are expecting exponential growth in data storage requirements.Modeling the annual data growth in 2021 and 2022 using the mid-points of each range category enables us to understand the overall difference in data growth between the two years.The primary challenge in
71、 modeling data growth is that the top category is“More than 10PB.”Selecting a range category midpoint requires an upper limit.The left panel in FIGURE 6 uses an upper limit of 20PB for the 10+PB category to estimate data growth.We have not shown all the categories from FIGURE 5 because the categorie
72、s for less than 10TB simply do not generate material data volume,and graph-ically,it is impossible to easily see.However,we have included these smaller categories in our computation of actual data growth.The vertical scale of the left and right panels is identical,allowing for consistent visual exam
73、ination.Dont knowor not sureMore than10 PB1 PB to10 PB100 TB to1,000 TB10 TB to100 TB1 TB to10 TBLess than1 TB4%5%32%15%30%13%18%28%4%14%5%17%7%8%End-user organizations 2021End-user organizations 20222022 SODA DATA&STORAGE TRENDS,2022 Q12,SAMPLE SIZE=193.2021 Q7,SAMPLE SIZE=97.FIGURE 5AMOUNT OF DATA
74、 GROWTH BY CATEGORY IN 2021 AND 2022How much is the approximate data growth per year for your organization?(comparison of 2021 and 2022 results)11DATA AND STORAGE TRENDS 2022The right panel in FIGURE 6 shows two scenarios for how much data growth could occur for an average end-user organization.On t
75、he right panel,the left scenario presumes an upper limit of 20PB and shows annual data growth of 566 TB for an average end-user organization in 2021 and 1,746 TB in 2022.The right scenario presumes an upper limit of 25PB and shows annual data growth of 700 TB for an average end-user organization in
76、2021 and 2,208 TB in 2022.For either scenario,the data growth in 2022 is just over three times the increase in 2021.This is explosive data growth by any measure,and end-user organizations should be preparing for data growth in the PBs.FIGURE 6ANNUAL DATA GROWTH IN 2021 AND 2022Average annual data gr
77、owth in terabytes(by category)Average annual data growth in terabytes(comparison of 2021 and 2022 results)ESTIMATED ANNUAL DATA GROWTH VOLUME IN TBS BASED ON2022 Q12 AND 2021 Q7.ESTIMATED ANNUAL DATA GROWTH VOLUME IN TBS BY CATEGORYBASED ON 2022 Q12 AND 2021 Q7.Data growth 2021Data growth 2022157871
78、36269924More than 10 PB(10PB-20PB)1 PB to 10 PB100 TB to 1,000 TB10 TB to 100 TB19467910PB-25PB upper limit10PB-20PB upper limitData growth 2021Data growth 20225661,7467002,20812DATA AND STORAGE TRENDS 202293%of organizations use open source in productionOpen source software is widely used across en
79、d-user organiza-tions,IT vendors,and service providers.Linux Foundation Research consistently shows open source use by 90 to 98%of organizations.The SODA Foundation,open source vendors,service providers,and independent developers are continuously contributing to products and services for open source
80、 data and storage.The design of the SODA Foundations open data framework projects connects application platforms and solutions to backend storage services either on-premise or in the cloud through a unified API layer.Key characteristics of this framework include being appli-cation platform agnostic,
81、providing a unified and scalable API for data and storage management,and having architecture that is microservice-based and vendor agnostic to storage backends.FIGURE 7 identifies where open source solutions can add value in production environments.At 56%,multi-cloud data management is the leading o
82、pen source use case among end-user organiza-tions.This use case is likely to become more common because no end-user organization will want to be beholden to just one cloud service provider.For example,open source multi-cloud software(such as Strato)abstracts the cloud service backends,making it easi
83、er to adopt multiple cloud service providers.Multi-cloud data management is closely followed by hybrid-cloud data management at 44%.Hybrid-cloud data management spans private(on-premises)and public cloud environments.Support for hybrid-cloud data management environments can be challeng-ing depending
84、 on the configuration of the private environment;however,large and very large end-user organizations are more apt to have private cloud environments and are likely to be vested in a hybrid solution,regardless of the complexity.FIGURE 7WHERE OPEN SOURCE SOLUTIONS ADD VALUE IN PRODUCTION ENVIRONMENTSW
85、here do you think you will deploy open source solutions in your production environment?(select all that apply)2022 SODA DATA&STORAGE TRENDS,Q38,SAMPLE SIZE=179,VALID CASES=179,TOTAL MENTIONS=405Dont know or not sureOther(please specify)AIOps platform for ITOMEdge data managementContainer data manage
86、ment(Kubernetes baseddata protection,data security,monitoring)Hybrid infrastructureobservability/remote monitoringHybrid-cloud data managementMulti-cloud data management56%44%36%30%28%25%1%6%13DATA AND STORAGE TRENDS 2022We also find that 75%of end-user organizations are pursuing multi-cloud or hybr
87、id-cloud data management environments as a focus of where they will deploy open source solutions.In domains including edge data management or AIOps,where rapid product development and growth are occurring,40%of end-user organi-zations will pursue open source solutions.End-user organizations gravitat
88、e to heterogeneous storage vendor relationshipsEnd-user organizations have a strong affinity for leveraging multiple storage vendors.FIGURE 8 shows that 75%of end-user organizations already use multiple storage vendors.The most com-pelling strategy across end-user organizations(with the exception of
89、 micro-organizations)is to use multiple storage vendors with one primary vendor and plan to add more vendors.End-user pref-erence for the strategy suggests a data management and storage domain is already highly heterogeneous and fragmented.Faced with a highly complex data management environment that
90、 will only become more complicated,end-user organizations require complex solutions that are up to todays and tomorrows data storage and management tasks.Because most end-user organizations are now in alignment with the most complex vendor selection strategy,is there an underlying maturity model tha
91、t explains the journey that end-user organiza-tions are on?The answer is yes.Fifty percent of micro-organi-zations(1 to 99 employees)use only one storage vendor,which contrasts with just 12%of very large organizations(10,000+employ-ees)that rely on just one storage vendor.For all but the smallest en
92、d-user organizations,flexibility and choice are paramount,and most end-user organizations focus on using multiple storage vendors.This is indicative of a market trend toward the preference by end-user organizations for storage vendor-agnostic solutions.End-user organizations are demanding“storage fr
93、eedom,”which is also a key objective of SODA Foundation activities.FIGURE 8STRATEGIES FOR CHOOSING STORAGE VENDORSWhich best describes your choice of storage vendors?(select one,end-user organizations only,segmented by company size categories)Other(please specify)9%23%59%5%6%9%14%27%14%14%3%24%27%18
94、%24%26%43%10%46%44%8%10%2%9%15%2%3%2%1%3%Total end-user organizationsMicro(1-99 emps)SME(100-1K emps)Large(1K-10K emps)Very Large(10K+emps)Multiple storage vendors with no primary vendorMultiple storage vendor with one primary vendor and planning to add more vendorsMultiple storage vendors with one
95、primary vendor and not planning to changeOnly one storage vendor and not planning to add other vendorsOnly one storage vendor and not planning to change2022 SODA DATA&STORAGE TRENDS,Q14(END-USER ORGANIZATIONS ONLY)BY Q8,SAMPLE SIZE=17814DATA AND STORAGE TRENDS 2022Cost,performance,reliability and qu
96、ality lead top storage vendor attributesWhen end-user organizations must decide on selecting a storage vendor,FIGURE 9 shows us that cost,performance,reliability,and quality are the leading requirement for 56%of end-user organiza-tions when selecting a storage vendor or service provider.These criter
97、ia repeatedly surfaced in this study as reasons to implement as well as reasons to roll back IT changes.Most end-user organiza-tions focused on obtaining the highest performance,reliability,and quality for a particular price point(cost).Since performance and cost are highly correlated,the decision o
98、ften comes down to who can provide the lowest price for a particular level of performance.Forty-five percent of end-user organizations see how a vendor or service provider addresses data security and compliance as a leading consideration.Security features,such as data encryption,identity,and access
99、management(IAM),where geographically the data is stored(GDPR)and how the data is stored(data redun-dancy),are all leading concerns to end-user organizations.End-user organizations are often willing to pay for innovation and are open to the adoption of new technologies in vendor solu-tions(and road m
100、aps).FIGURE 9 also shows that 41%of end-user organizations are open to adopting new storage technologies by vendors and service providers.This is especially true at the inter-section of hardware and software,where a variety of SODA and SDX projects focus on integration and interoperability.The reput
101、ation of the vendor(36%),vendor support and consult-ing(29%),and the size/composition of the vendor ecosystem(24%)influence end-user organization storage vendor decision-making.This is where open source standards and solutions will also overlap and influence business decision-making going forward.FI
102、GURE 9LEADING ATTRIBUTES WHEN SELECTING A STORAGE VENDORWhat are the top three attributes you consider in selecting a storage vendor?(select between one and three responses)Dont know or not sureGreen storage/environment friendlyInteroperability&open sourceVendor ecosystemQuick support and customizat
103、ionReputation and product familyAdoption of new technologiesSecurity complianceCost,performance,reliability and quality56%45%41%36%29%24%20%11%4%2022 SODA DATA&STORAGE TRENDS,Q18,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=47915DATA AND STORAGE TRENDS 2022Containerization plansAs mentioned earlie
104、r,containerization has become a standard practice in the IT industry.Containers are more portable,resource-efficient,and scalable than virtual machines(VMs),and many companies are moving toward this technology.A single lightweight package that can run consistently across plat-forms encapsulates all
105、the components an application needs to run in a container,such as binaries,dependencies,and configu-ration files.Organizations are quickly adopting containers in productionAs shown in FIGURE 10,32%of the end-user organizations are already using container-based production deployments,while 54%are sti
106、ll in the planning phase to use containers.This 54%includes 26%who plan to use containers in 2022 and 28%who plan to use containers in 2023 or 2024.This means that 86%of end-user orga-nizations have committed to using containers.As mentioned earlier in this report,modern containers came of age in 20
107、12 when Linux containers became an operating system virtualization technology.In Figure 2,we saw immense interest in cloud container services;however,when we asked more spe-cifically about the deployment of containers to production envi-ronments,in FIGURE 10,a high degree of experimentation with con
108、tainers still appears to be occurring.This is due to the impor-tance of right-sizing containers and having an engine such as Kubernetes to scale container images up and down based on demand.This raises the bar on complexity,which may account for the significant adoption of containers that is just ar
109、ound the corner in FIGURE 10.FIGURE 10PRODUCTION USE OF CONTAINERSWhat is your plan for container-based deployments in production?(select one)Other(please specify)5%26%28%32%9%We have no plans to use containers in production in the next 3 yearsWe are planning to begin using container-based productio
110、n deployments this yearWe are planning to begin using container-based production deployments in 2023 or 2024We are already using container-based production deployments2022 SODA DATA&STORAGE TRENDS,Q21,SAMPLE SIZE=18016DATA AND STORAGE TRENDS 2022NAS file storage is the most common storage type for c
111、ontainer workload deploymentOrganizations need to make crucial decisions regarding where and how to store information.As observed in FIGURE 11,many respon-dents(42%)prefer files in network-attached storage(NAS)for con-tainer workload deployment.The respondents preferred NAS for containerized applica
112、tion deployment,as it helps the transition from traditional to containerized deployment.NAS,because of its net-work-attached orientation,provides an inherent level of flexibility in supporting both containerized and noncontainerized clusters.Other respondents prefer object(17%)and block(15%)storage.
113、In object storage,a specific repository on a distributed system keeps a discrete unit of data(an object).It is accessible through a unique identifier.In block storage,the data is stored as separate pieces across the infrastructure.When we request the data,the under-lying storage software reassembles
114、 the blocks of data.These approaches allow for more flexibility and scalability.For 19%of the respondents,any storage type is good.Oracle,SQL Server,and SAP are the top three traditional workflows to containerizeFIGURE 12 shows the workloads that end-user organizations are inter-ested in containeriz
115、ing.Oracle,(Oracle),Microsoft(SQL Server),and IBM(Db2)are among the leading database and analytics vendors,but there is overlap with ERP.Oracle(Fusion),Microsoft(Dynamics),and SAP(S4/HANA)provide ERP products and services.While DBMS products are inherently data heavy compared to ERP,ERP has sig-nifi
116、cant data management and analytic dimensions;therefore,the workload shown in FIGURE 12 is consistent with the leading data man-agement and analytic workloads shown in FIGURE 1.Because data management and analytic workloads are so critical to end-user organizations,it is no surprise that they are loo
117、king to FIGURE 12TRADITIONAL WORKLOADS THAT THE VENDOR WILL CONTAINERIZEWhich traditional workloads have you or are looking to containerize?(select all that apply)2022 SODA DATA&STORAGE TRENDS,Q23,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=343Dontknow ornot sureOther(pleasespecify)Db2ExchangeSAP
118、SQLServerOracle7%2%14%28%41%43%56%FIGURE 11STORAGE TYPES PREFERRED FOR CONTAINERIZED WORKLOADSWhich storage type do you use or prefer to use for container workload deployments?(select one)2022 SODA DATA&STORAGE TRENDS,Q25,SAMPLE SIZE=180Other(pleasespecify)BlockstorageObjectstorageAny storagetype is
119、 goodFilestorage7%15%17%19%42%17DATA AND STORAGE TRENDS 2022containerize these workloads.Doing so would help with cost,per-formance,reliability,and qualitythe leading criteria for select-ing a data storage vendor,as shown in Figure 9.But none of this changes the fact that data management and analyti
120、cs are mis-sion-critical workloads,and they are going to be containerized.Security and heterogeneous environments are the main challenges for adopting containers in productionAs displayed in FIGURE 13,53%of end-user organizations perceive security as the leading challenge for deploying containers in
121、 production.Container security requires a multilayer approach,starting with the container image;how the container,oper-ating system,and other containers interact;and the runtime environment,including infrastructure.Security is an important requirement to address when containerizing workloads.Importa
122、nt adjacent security topics in FIGURE 13 include data protection and disaster recovery(29%)and compliance(26%).Looking across data security,data protection and disaster recovery,and compliance,the containerization of at least one of these topics are a concern for 68%of end-user organizations.FIGURE
123、13 also shows that multi-cloud deployments(36%)and on-premises(hybrid-cloud)deployments(28%)also complicate container adoption.When we evaluate these factors together,we find that 50%of end-user organizations are concerned about at least one of these issues.A similar situation exists when examining
124、the migration of current noncontainerized deployments(32%)and the cross working of non-container and container-based deployments(31%).When assessing these issues together,we find that at least one of these issues are a concern for 50%of end-user organizations.This collection of containerization chal
125、lenges suggests that end-user organizations need to develop a comprehensive plan for containerizing their current environment,including future state needs,and then develop an implementation plan consistent with FIGURE 13CHALLENGES IN DEPLOYING CONTAINERIZED WORKLOADSWhat are your challenges in conta
126、iner deployments for production?(select all that apply)7%15%26%28%29%31%32%36%53%Dont know or not sureLack of production grade solutions and supportComplianceOn-premises and cloud integration(hybrid deployments)Data protection and disaster recoveryCross working of non-container and container based d
127、eploymentsMigration of current non-container deploymentsMulti-cloud deploymentsData security2022 SODA DATA&STORAGE TRENDS,Q22,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=46118DATA AND STORAGE TRENDS 2022addressing high-priority needs while showing progress and value.Security,compliance,and audits
128、 must be taken seriously since many containers are ephemeral,which could make understanding threats and exploits more complex.Cost,performance,and management drive container rollbacksContainers are often praised for the same reasons end-user orga-nizations roll back or decontainerize applications:co
129、st,perfor-mance,and ease of management.When asking respondents about container rollbacks,we did not first ask a question about whether this had happened in their organization;however,nearly all end-user organizations answered this question,regardless of whether their responses were based on experien
130、ce or expectations.FIGURE 14 shows that cost(55%),performance(52%),and ease of management(52%)are all reasons why end-user organiza-tions decontainerize applications.Unreliability(14%)does not seem to be a strong issue driving rollbacks.When we evaluate cost,performance,and ease of management collec
131、tively,85%of end-user organizations suffer from at least one of these issues.This indicates that containerization solutions must provide better cost-efficient and performing solutions,which can provide simple management,especially unified ones(like SODA)that can help to sustain the containerized dep
132、loyments in production.Cost and performance can go sideways with containers if work-loads are not implemented as microservices.Consequently,if a monolithic application deploys in a container,the lightweight and rightsizing advantages of the container evaporate.Also,there is a lack of subsidy for inc
133、remental investment in container infrastruc-ture(such as Kubernetes)because of the poor utilization of scal-ability capabilities.Ease of container management is an issue for end-user organi-zations new to containers.An incorrect configuration of contain-ers can put container and data security at ris
134、k.Because containers share the underlying operating system running on the server(unlike VMs that use guest operating systems),a vulnerable container has the potential to also impact the integrity of adjacent containers.Finally,FIGURE 14 also revisits a persistent IT problem39%of end-user organizatio
135、ns report skill set shortages.When organiza-tions cannot find full-or part-time employees,they typically resort to one of the following approaches:wait until they can find the right people,use outside professionals(consultants or SIs)tempo-rarily while they continue to search,or retrain existing sta
136、ff.FIGURE 14REASONS FOR ROLLING BACK CONTAINERIZED WORKLOADSHave you non-containerized any workloads(you previously containerized)because of the following reasons?(select all that apply)Dont know ornot sureOther(please specify)UnreliableSkills setshortageEase of managementPerformanceCost11%1%14%39%5
137、2%52%55%2022 SODA DATA&STORAGE TRENDS,Q24,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=40319DATA AND STORAGE TRENDS 2022Approaches to cloud data and storageThe use of cloud environments by end-user organizations is at an all-time high.This chapter looks at cloud deployments,strategies for managing
138、 data across cloud environments,reasons for using a private cloud,challenges of using a multi-cloud environment,and who to partner with for addressing multiclient transitions.91%of organizations deploy workloads in public cloudFIGURE 15 provides an interesting view of how end-user organi-zations lev
139、erage the public cloud.A plurality of respondents(39%)reported that 30 to 50%of their overall work was running in the public cloud.Bordering this finding on either side,22%of end-user organizations were running 50%or more of their workload in the public cloud,and another 22%of end-user organizations
140、 were running 10 to 30%of their workload in the public cloud.Only 7%of end-user enterprises had less than 10%of their workload in the public cloud,and a meager 3%had no workload in the public cloud.Given the fact that workload categories to either side of the 30 to 50%band are asymmetric(the upper b
141、and is 50 to 100%and the lower band is 10 to 30%),it is likely that public clouds run more than 40%of end-user organization workloads.What is clear based on the data is that 61%of end-user organizations run more than 30%of their workload in the public cloud.These findings support the growing emphasi
142、s that organizations are using public cloud resources and because of the strong focus on multi-cloud and hybrid cloud deployments in Figure 7,there will be a significant demand for multi-cloud management and migra-tion tools.65%of organizations use 1 or more clouds for their data storageEnd-user org
143、anizations have a variety of choices in how they can manage data storage.It generally follows that the best perfor-mance and the least latency are accomplished by collocating data and computing resources.FIGURE 16 provides an array of choices.The arrangement of these choices is in a largely prescrip
144、tive order ranging from all data managed on-premises(34%)to all data dis-tributed across multiple public clouds(43%).These two responses FIGURE 15SHARE OF WORKLOADS IN PUBLIC CLOUD ENVIRONMENTSWhat is the share of your public cloud deployments in terms of overall workloads?(select one)Dont know or n
145、ot sureWe do not have public cloud-based deploymentsLess than 10%10%to 30%30%to 50%50%or greater6%3%7%22%39%22%2022 SODA DATA&STORAGE TRENDS,Q26,SAMPLE SIZE=180,AVERAGE ACROSS CATEGORIES IS 39.6%20DATA AND STORAGE TRENDS 2022bookend a series of intermediate responses that cover managed data on a pri
146、vate cloud(28%),managed data across on-premises and a cloud presumed to be public(24%),and all data on a single public cloud(22%).Based on total mentions,the average number of valid responses(after factoring out dont know or not sure responses)to this question were 2.0 per respondent.Also,in the dat
147、a,some responses were prefixed with the word“All,”such as“All data on a single public cloud.”This was found to not be the case,and most respondents selected more than one response,even if one of the responses was worded to indicate all data managed using one use case to the exclu-sion of other use c
148、ases.Therefore,the best approach to interpreting this data is to ignore the“All”prefix.What we infer when doing this is that managing data either on-premises(34%)and managing data in a distributed way across public clouds(43%)are popular compo-nents of a data management solution but that most end-us
149、er orga-nizations have adopted a hybrid approach to managing their data.This indicates that end-user organizations need data management solutions that are hybrid and multi-cloud.Use cases for cloud storage servicesA key finding in the previous figure(FIGURE 16)was that 43%of end-user organizations d
150、istribute their data across multiple public clouds.This is consistent with FIGURE 17,where 49%of end-user organizations report that the cloud is their primary data store.Consistent with this is that 42%of end-user organizations report that data processing and analysis is an accompanying use case.Use
151、 cases that support and revolve around the cloud as a primary data store include complete data protection and disaster recovery(49%),data life cycle management(34%),and archiving/long-term data retention(31%).Demand for these capabilities will continue to grow since the growth in data and the outloo
152、k for cloud-based data management is for continued growth.Data security and privacy drive use of private cloudPrivate clouds are single-tenant environments.There are many reasons for using a private cloud.In most cases,these reasons involve performance,security,control,and regulatory compliance.FIGU
153、RE 18 shows that 57%of end-user organizations reported that the leading reason for using a private cloud was better infor-mation security and data privacy.While encryption,identity,and access management go a long way to securing data assets in the FIGURE 16HOW DATA STORAGE IS MANAGED ACROSS ON-PREMI
154、SES AND MULTI-CLOUD DEPLOYMENTSHow do you manage your data storage across your cloud deployments?(select all that apply)Dont know or not sureNon critical data on cloudDifferent cloud vendorsfor different countriesAll data distributed acrossmultiple public cloudsAll data with singlepublic cloudData d
155、istributed across onpremises and cloudData on distributedprivate cloudsAll data on premises3%14%24%43%22%24%28%34%2022 SODA DATA&STORAGE TRENDS,Q27,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=34821DATA AND STORAGE TRENDS 2022FIGURE 17LEADING USE CASES FOR CLOUD STORAGE SERVICESWhat are your key u
156、se cases for cloud storage services?(select all that apply)Dont know or not sureOther(please specify)No cloud storageOnline backupDistributed data management(regions,services etc)Archiving/long term retention onlyApplication storageData lifecycle management(primary,secondary,archive)Data processing
157、and analysisComplete data protection&disaster recoveryCloud is our primary data store2%1%3%15%23%31%31%34%42%49%49%2022 SODA DATA&STORAGE TRENDS,Q28,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=504FIGURE 18LEADING REASONS FOR PRIVATE CLOUD SOLUTIONSWhat are the top two reasons for you to choose pr
158、ivate cloud solutions?(select exactly two responses)Other(please specify)Specific hardware and infrastructure consideration(not available with cloud providers)Customer demand for private cloud solutionsOverall budget control(reuse the infrastructure etc)Performance considerationGreater accountabilit
159、yBetter control,flexibility and customizationBetter information security and data privacy0%9%15%19%26%26%48%57%2022 SODA DATA&STORAGE TRENDS,Q29,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=36022DATA AND STORAGE TRENDS 2022cloud,users cannot control for vulnerabilities introduced by other end user
160、s running on multitenant hardware.The obvious solution is a single tenancy on compute and storage resources and VPN-based networking.Data privacy is also becoming a global concern.All regions around the world are enacting data privacy and data sovereignty requirements.Better control,flexibility,and
161、customization capabilities is the other leading reason identified by 48%of end-user organizations for using a private cloud.All of this stems from being a single-tenant environ-ment.Performance considerations(26%)and greater accountability(26%)are a derivative of the ability to have greater control,
162、flexibil-ity,and customization capabilities.Private cloud resources allow end-user organizations to reprioritize workloads on demand and benefit from predictable increases in performance and throughput.This provides a level of control,flexibility,and customization that is far easier to achieve in a
163、public cloud environment.As solution providers think about data management solutions,addressing private cloud needs in the areas of security,data privacy,and environmental control,keep in mind that FIGURE 16 demonstrated that 28%of end-user organizations are managing data on distributed private clou
164、ds.Flexibility is the top reason for multi-cloud deploymentsHybrid and multi-cloud environments are common across end-user organizations,and there are some clear advantages to using a multi-cloud environment.A multi-cloud environment is where an enterprise uses more than one cloud platform from two
165、or more CSPs.The primary objective in implementing a multi-cloud environment is flexibility.FIGURE 19 shows that 65%of end-user organizations identify flex-ibility as the leading reason to choose multi-cloud environments.Flexibility comes up often when identifying reasons to use public clouds and pr
166、ivate cloudsfor somewhat different reasons.While private cloud flexibility(as discussed in FIGURE 18)focuses on the flexibility that stems from single-tenant environments,FIGURE 19REASONS FOR SELECTING MULTI-CLOUD VENDORS FOR PRODUCTION WORKLOADSWhat are the top reasons for you to choose multiple cl
167、oud vendors for your production deployments?(select all that apply)Dont know or not sureLess overhead on legal and complianceVendor agnosticEasy data lifecycle managementEasy to provide new services in a short timeAgilityScalabilityCompetitive costing(service,infrastructure and maintenance)Risk mana
168、gementFlexibility3%16%23%29%32%36%37%43%47%65%2022 SODA DATA&STORAGE TRENDS,Q30,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=59523DATA AND STORAGE TRENDS 2022multi-cloud flexibility derives from risk management(47%),com-petitive costing(43%),and agility(36%)that comes from optimiz-ing workload pla
169、cement because of more degrees of freedom.Risk management services and cloud operations include avail-ability,reliability,scalability,manageability,security,and contrac-tual relationships.Risk management is another leading driver for multi-cloud environments,as reported by 47%of end-user organi-zati
170、ons,and supports the leading driver,which is flexibility.Data security and protection are top multi-cloud challengesManaging multi-cloud environments is not without its challenges.All end-user organizations struggle to implement industry best security practices across the software supply chain and e
171、stablish clear policies and actions supporting governance,risk,and compliance.The challenge regarding data security is addressing security when data is at rest and in motion.FIGURE 20 shows that 52%of end-user organizations identify data security and protection as the leading multi-cloud challenge.T
172、here is effectively a three-way tie for the secondary challenge in using multi-cloud solutions.These include data governance and compliance(43%),managing multiple services across clouds(43%),and cost management(41%).Data governance and compliance are familiar issues that are endemic to all cloud env
173、ironments,be they public,private,single,or multi-cloud.Data governance is managing data across its life cycle with respect to security,privacy,accuracy,availability,and usability.As IT moves in the direction of web3,data governance and new regulatory requirements spurred by CCPA(in the state of Cali
174、fornia)and GDPR(and other similar international standards)will bring a sharper focus on data governance and compliance.Managing multiple services across clouds will always be challenging until third-party vendors or FIGURE 20LEADING CHALLENGES IN USING MULTI-CLOUD SOLUTIONSWhat are the top 3 challen
175、ges you face in using multi-cloud solutions?(select between one and three responses)Dont know or not sureOther(please specify)Vendor lock-inUnpredictable performanceData movement between cloudsMigration costsLack of control on infrastructureCost managementManaging multiple services across different
176、cloudsData governance and complianceData security and protection2%1%10%14%15%17%22%41%43%43%52%2022 SODA DATA&STORAGE TRENDS,Q31,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=46924DATA AND STORAGE TRENDS 2022communities bring abstraction layers to support cloud brokering.Multi-cloud cost management
177、 is always challenging but mathemati-cal optimization will eventually address this issue.Challenges cited by end-user organizations always pique the interest of software vendors and service providers because they represent opportunities to provide or improve products and services.Vendors and service
178、 providers preferred for cloud transitionsPlanning,implementing,and transitioning to a multi-cloud envi-ronment can be overwhelming for an end-user organization.End-user organizations new to either cloud or multi-cloud environ-ments will not necessarily know what roles to hire to support a cloud tra
179、nsition.They may also be concerned that if this transition is short-lived,they may be left with skilled resources that they no longer need.The solution to these problems is enlisting vendors or service providers specializing in cloud implementation and/or migration.FIGURE 21 shows that 59%of end-use
180、r organizations would look to cloud service providers(CSPs)to help address their multi-cloud needs.The advantage of using a CSP is their unparalleled knowledge of their own cloud environment.The disadvantage is that their knowl-edge of other cloud environments is not comprehensive,and their competit
181、ive nature may cloud their objectivity.End-user organiza-tions that can compartmentalize their multi-cloud services,leverage various CSPs,and address interoperability and integration require-ments through a neutral third party may succeed with this approach.Fifty-one percent of end-user organization
182、s were in favor of using cloud software solution companies to support their multi-cloud transitions.These companies can include cloud brokers,larger ISVs that are cloud neutral,and ISVs specializing in a particular aspect of multi-cloud integration.Cloud software solution companies have the advantag
183、e of being more neutral,and the extent they special-ize in multi-cloud integration can be an ideal choice for end-user organizations.The disadvantage is that if this multi-cloud integra-tion software is proprietary,it can create an additional measure of vendor lock-in and potential for another singl
184、e point of failure.System integrators were also a logical choice for multi-cloud transi-tions,and 40%of end-user organizations identified them.System integrators,such as Accenture or IBM Consulting,will likely have deep experience addressing multi-cloud transitions.This can be ideal,especially if th
185、ere is a requirement for significant customiza-tion with or between multi-cloud environments.FIGURE 21PREFERRED PARTNERS FOR MULTI-CLOUD TRANSITIONSWhat types of partners do you consider most suited to helping your company with its multi-cloud transition?(select all that apply)Dont know or not sureO
186、ther(please specify)Partner with suitable opensource or industry ecosystemNetwork equipmentSystem integratorsCloud softwaresolution companiesCloud service providers7%1%30%37%40%51%59%2022 SODA DATA&STORAGE TRENDS,Q32,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=40525DATA AND STORAGE TRENDS 2022Fut
187、ure perspectives on data and storageMetadata management is an active area of interest to end-user organizations but is primarily rooted in ensuring the treatment of data quality needs along with governance,risk,and compliance.The need to support hybrid and multi-cloud environments is shaping future
188、perspectives on data and storage management.Observability continues to be an important topic within and across environments,but challenges exist regarding how to manage the mushrooming volumes of information and support multi-envi-ronment decision-making in a scalable and automated way.Future data s
189、torage investment areasThe rapid increases in data storage growth are relevant to end-user organizations and IT vendors/service providers alike.End-user organizations are seeing explosive data growth and face important decisions about where and how to manage this data.Likewise,IT vendors and service
190、 providers must make supply-side capabilities to meet end user demands.FIGURE 22 includes a view of end user and IT vendor perspectives on data storage investment areas.We include both views to see the extent to which IT vendors are in step with the needs of end users.What we see appears to be a ver
191、y good alignment between end-user demand and vendor supply.Although there are some minor differences in how end users and vendors align,these dif-ferences are all within the margin of error for the survey.Perhaps most striking in FIGURE 22 is that most data and storage technology investments are clo
192、ud-based.FIGURE 22 shows that 70%of end-user organizations and 67%of vendors and IT service providers agree with this finding.This provides a wide array of opportunities for cloud solutions vendors in the data and storage technology markets.This also suggests a significant opportunity for the SODA F
193、oundation to provide vendor-neutral solutions to a variety of complex multi-cloud data and storage problems.Data management(36%),which historically has accounted for sig-nificant spending by end-user organization budgets,continues to be a leading investment area.Data analytics(33%)and data and stora
194、ge optimization(29%)closely accompany it.Container tech-nologies are also a significant investment area identified by 24%of end-user organizations.This data aligns well with current workloads(Figure 1),data growth(Figures 5 and 6),and the hybrid and multi-cloud focus of end-user organizations(FIGURE
195、 16).What this means is that there will be strong demand for multi-cloud data management solutions that continue to support data analytic and data management use cases.AI-and metadata-driven capabilities lead the future of data management and analyticsAs we saw in FIGURE 22 and throughout the report
196、,data man-agement and analytics are consistently the most important concerns of end-user organizations.Significantly increasing data volumes feed a desire to extract more insight from this data,which in turn requires improved,scalable approaches to data man-agement and analytics.FIGURE 23 shows that
197、 49%of end-user organizations believe that AI-driven data management is an effective solution to addressing their data management and analytics needs.While AI/ML tech-nology is still in its infancy,it is evolving rapidly and could poten-tially support augmented data management,automated database mai
198、ntenance,and augmented analytics.Another capability that 47%of end-user organizations feel has potential over the next two to four years is IT operational analytics(ITOA).While similar to AIOps,ITOA relies more on big data,optimi-zation,and predictive analytics,whereas AIOps focuses more on AI/ML.Th
199、is makes ITOA more accessible,but it still requires oversight by people skilled in modeling,statistics,and mathematical optimization.26DATA AND STORAGE TRENDS 2022FIGURE 22LEADING DATA STORAGE INVESTMENT AREAS OVER THE NEXT THREE YEARSWhat are your organizations top 3 data and storage technology inv
200、estment or deployment areas for the next 3 years?(segmented by end user and vendor/IT service provider)Dont know or not sureOther(please specify)HCI(hyper-converged infrastructure)AIOpsAll flashCommodity storageInteroperability across legacy systems and modern systemsGreen storageEdge data&storage(a
201、ll edge,IoT related)Storage performance and observabilityData&storage automationContainer technology(Kubernetes,managed Kubernetes)Data and storage optimizationData analytics(AI/ML)Data managementCloud technology(multi-cloud,hybrid cloud,private cloud,public cloud)2%1%5%6%6%7%9%9%12%13%16%24%29%33%3
202、6%70%0%1%7%8%11%7%5%8%14%13%10%27%35%27%29%67%End-user organizationsIT Vendor or Service Providers2022 SODA DATA&STORAGE TRENDS,Q11 BY Q10,SAMPLE SIZE=392,VALID CASES=392,TOTAL MENTIONS=1,07027DATA AND STORAGE TRENDS 2022Augmented analytics uses ML,NLP,advanced analytics,and process automation to su
203、percharge data analytics with the objec-tive of improved and faster decision-making.FIGURE 23 shows that 37%of end-user organizations agree that augmented analyt-ics is not just a natural progression of data analytics but also rep-resents a key investment area over the next several years.DataOps is
204、yet another field that interests 36%of end-user orga-nizations.DataOps is about applying DevOps principles to data analytics with a focus on arriving at higher quality data and faster cycle times for extracting better insights from your data.This is yet another evolutionary approach to improving dat
205、a analytics.Future metadata management prioritiesMetadata management is a cornerstone of building a data-driven business.Metadata is,of course,data about data.Understanding metadata enables the development of strategies to analyze data and drive insights.End-user organizations recognize that expandi
206、ng their data collection activities can give them better insight into their customers.This enables them to transition away from a“one-size-fits-all”go-to-market strategy and use metadata and data to address individual customer needs better.While metadata management is relevant to improving the enter
207、prises business,it also supports a more cost-effective way to address IT operations through ITOA and AIOps.FIGURE 23KEY DATA MANAGEMENT AND ANALYTIC AREAS OVER THE NEXT TWO TO FOUR YEARSWhich are the key capabilities you think are critical in the next 2-4 years for data management and data analytics
208、?(select all that apply)Dont know or not sureOther(please specify)Federated data managementReal-time unstructured data processingMetadata-centric data fabricDataOpsAugmented analyticsMetadata-driven IT operational analytics(AIOps and more)AI-driven hybrid data management6%1%25%31%34%36%37%47%49%2022
209、 SODA DATA&STORAGE TRENDS,Q20,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=47728DATA AND STORAGE TRENDS 2022Conversations around data management always seem to include metadata management these days.When mastering data man-agement was a hot topic,metadata was a key ingredient to the solution.Today
210、s efforts around data management and data ana-lytics also seem to revolve around metadata management.The priorities shown in FIGURE 24 reflect a pragmatic approach to data management and analytics by end-user organizations.Data quality(64%)and governance and security(60%)were the leading priorities
211、regarding metadata management.There is an import-ant reason for this.Because analytic insights are only as good as the availability and quality of the data,data quality becomes a top priority.This has historically been true and continues to be true.Data governance and security are almost equally imp
212、ortant because of growing privacy concerns and increased regulations,such as GDPR and CCPA.Just as important as strong policy around governance is the ability to secure data at rest and data in use.The second tranche of priorities,including integration and pro-visioning(41%),metadata stores(39%),cer
213、tain collaboration(31%),discovery and extraction(30%),and classification and lineage(26%),were all indicative of tactical activities that the use of metadata management can improve;however,these activities appear to rank as a secondary priority because most organizations still have a long distance t
214、o travel regarding data quality,gover-nance,and security.What this all means is that unified and distributed metadata man-agement can improve data quality and security.Observability-based solutions(remote or local)are much in demandObservability helps DevOps staff understand the operation of complex
215、 systems.Observability helps developers and operators understand where problems exist and the necessary improve-ments and may point to solutions for these issues.Observability extracts the value of data with actionable insights and facilitates intelligent automation.FIGURE 25 shows that the leading
216、observability use case by 47%of end-user organizations is full-fledged AIOps across DCs(AIOps observe,engage,and act).This response uniquely stands out relative to the other responses,despite being the only“full stack”response in the list.The preference for this response is also because end-user org
217、anizations are looking for solutions,FIGURE 24LEADING PRIORITIES WHEN SELECTING METADATA MANAGEMENT SOLUTIONSWhat are your top priorities when selecting metadata management solutions?(select all that apply)Dont know or not sureNot applicableClassification and lineageDiscovery and extractionSearch an
218、d collaborationMetadata storeIntegration and provisioningGovernance and securityData quality5%1%26%30%31%39%41%60%64%2022 SODA DATA&STORAGE TRENDS,Q19,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=53629DATA AND STORAGE TRENDS 2022not just information.Observability is the backbone for end-to-end AIO
219、ps.Observability is also about surfacing and coordinat-ing data from across operations and making this data foundation for decision-making to remediate or improve operational issues.Another reason behind this choice is that end-user organizations have fixated on multi-cloud solutions throughout this
220、 report.This response specifically identifies itself as one that cuts across data centers.Much like the prior question,virtually all of the other responses to this storage observability question identify more narrowly focused and tactical activities that lack the breadth and scope of an AIOps-based
221、solution.Top five challenges for storage observabilityObservability and storage observability are not without their challenges.Observability tools,especially monitoring tools,can be effective at surfacing data about activities and performance but often lack the predictive capabilities to anticipate
222、problems before they occur.Virtually all CSPs offer cloud storage monitoring,but finding a unified approach across multi-cloud environments remains challenging,especially if it needs to address problem reme-diation.Third-party tools are available that work across multi-cloud environments.Once again,
223、the focus is primarily on monitoring.In FIGURE 26,48%of end-user organizations identified cloud storage monitoring as a top challenge.Todays emphasis on FIGURE 25HOW STORAGE OBSERVABILITY SOLUTIONS ARE BEING USEDHow are you using storage observability solutions currently for your infrastructure mana
224、gement?(select all that apply)Dont know or not sureOther(please specify)No observability solutionsCross DC and remote monitoringIndividual DC monitoringIntegrated AI/ML analysisIndividual DC observabilityFull-fledged AIOps across DCs(AIOps-observe,engage and act)8%1%19%24%27%28%31%47%2022 SODA DATA&
225、STORAGE TRENDS,Q34,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=33330DATA AND STORAGE TRENDS 2022FIGURE 26LEADING CHALLENGES WHEN DEPLOYING DATA AND STORAGE OBSERVABILITY TOOLSWhat are the top 3 challenges you face in deploying observability tools for data and storage?(select between one and three
226、 responses)Dont know or not sureOther(please specify)Lack of tools and frameworkHeterogeneous storageCross-DC and hybrid(cloud,on premises,edge)observabilityRemote monitoringContainer storage monitoringUnified visualizationReal-time monitoringCloud storage monitoring4%1%18%20%21%28%33%37%38%48%2022
227、SODA DATA&STORAGE TRENDS,Q33,SAMPLE SIZE=180,VALID CASES=180,TOTAL MENTIONS=448multi-cloud environments and the rapidly increasing scope of data storage activities in these environments means that monitoring is simply table stakes.Storage observability and cloud storage mon-itoring need ways to avoi
228、d or remediate problems effectively in a scalable and highly automated way.FIGURE 26 also shows that real-time monitoring(38%)and unified visualization(37%)also qualify as leading storage observabil-ity challenges.Real-time monitoring reduces information latency but is really only a first step in ad
229、dressing a storage observabil-ity solution.Likewise,unified visualization can effectively bring together observability data across a multi-cloud environment,and prioritizing data and information is important.But once again,where is the support for evaluating a high-priority situation and informed de
230、cision-making with high quotient scalability and automation?Observability solutions,which can provide hybrid data monitoring with unified interfaces,would be immensely helpful to end-user organizations addressing their hybrid and multi-cloud container application deployments.31DATA AND STORAGE TREND
231、S 2022The impact of open source on data and storageOpen source solutions can provide benefits to cloud computing in a world of increasing data production and storage.This chapter looks at reasons for using open source,their level of involvement in SODA projects,and how they drive organizational bene
232、fit.OSS can help improve quality,reliability,security,costs,and promote collaborationUnderstanding why organizations adopt open source projects is relevant for guiding project developers and inspiring companies to plan or explore open source solutions.FIGURE 27 shows that the top two reasons that en
233、d-user organizations and IT vendors/service providers use open source is to improve quality,reliability,and security(45%overall)and because it is cost effective(43%overall).The characteristics of quality,reliability,security,and cost resonate with end-user organizations who believe that cost,per-for
234、mance,reliability,and quality are the leading attributes when selecting a storage vendor(Figure 9).Another common reason for OSS adoption is cost-effectiveness(47%of end-user organizations and 40%of vendors and service providers selected this).OSS products reduce costs since their IP is free.This ha
235、s always been a strong“selling”point for OSS products;however,there can be functionality differences between OSS products and proprietary competitors.Almost one-third of our respondents(30%of end-user organi-zations and 33%of IT vendors and service providers)adopt OSS because of the open ecosystem t
236、hat can accompany an OSS project.An open ecosystem of partners from cloud vendors,OSS providers/foundations(such as SODA)can bring more trust to users in these ecosystems.Organizations can join a digital value creation network in which companies share software and data and offer services to each oth
237、er.These ecosystems can reach value and innovation levels that no single company could create alone.End-user organizations and IT vendors/service providers have realized that they cannot do everything alone,and OSS provides the perfect foundation for large-scale collaboration.Another advantage of th
238、ese ecosystems is creating a support network that can be composed of end users,vendors,community members,and consultants.This goes against the misconception of a lack of support for OSS products.Open source can bring global trust and openness to the framework solutions to provide unified software so
239、lutions for multi-cloud,metadata,container,and observability demands.A significant segment of our respondents(27%of end-user orga-nizations and 33%of IT vendors and service providers)adopt OSS specifically because OSS products have a vibrant support network in place.Another advantage of the open eco
240、system is that organizations can participate in and influence the development of the products.Indeed,30%of end-user organizations and 33%of IT vendors and service providers reported that transparent and collaborative development was a reason to adopt OSS.Organizations can also develop custom feature
241、s if needed,provid-ing great flexibility and overcoming restrictions imposed by pro-prietary vendors(34%of end users and 26%of IT vendors selected this reason).Organizations plan to use SODA OSS projects to support multi-cloud environments,interoperability,monitoring,and containersThe SODA Foundatio
242、n is an open source project under the Linux Foundation that aims to foster an ecosystem of open source data management and storage software for data autonomy.The SODA Foundation offers a neutral forum for cross-project collaboration 32DATA AND STORAGE TRENDS 2022FIGURE 27LEADING REASONS FOR ADOPTING
243、 OPEN SOURCE PROJECTSWhat are the top 3 reasons for the adoption of open source projects in your organization?(select between one and three responses)Dont know or not sureNot confident yet to use open source over commercial products for storageOpportunity to understand more use cases from the commun
244、ityMore exposure to new technologiesLarge community supportTransparent and collaborative developmentEasy to develop custom featuresIncreasing support for open source productsOpen ecosystemCost effectiveImproving quality,reliability and security2%4%12%22%18%30%34%27%30%47%47%2%4%15%15%20%27%26%33%33%
245、40%43%End-user organizationsIT Vendor or service providers2022 SODA DATA&STORAGE TRENDS,Q35 BY Q10,SAMPLE SIZE=389,VALID CASES=389,TOTAL MENTIONS=1,03033DATA AND STORAGE TRENDS 2022and integration and provides end users with quality end-to-end solutions.The SODA Open Data Framework aims to provide a
246、 unified data and storage management framework,seamlessly connecting the application platforms and solutions to the backend storage through a unified API layer.This enables the application platforms to leverage the open ecosystem around the OSS products and focus on building more valuable use cases
247、rather than worrying about managing the underlying storage backends and data management.As observed in FIGURE 28,the most popular SODA projects for our respondents address multi-cloud data management.Strato,which 45%of the end-user organizations and 32%of the IT vendors selected,provides a cloud ven
248、dor agnostic data manage-ment capability for hybrid cloud.The goal is to provide a unified interface to support file,block,and object services across multiple cloud vendors.Como,selected by 31%of end-user organizations and 24%of IT vendors,is a multi-cloud virtual data lake providing a central-ized
249、repository with a single common interface for data stored in public or private clouds that will start in early 2023.This project allows users to connect with a single interface and obtain a unified view of data from multiple sources with minimal data transfer,enhanced security and governance,faster
250、integration and deployment,and better performance,versatility,and scalabil-ity.Interoperability also presented itself as an important feature,mainly for IT organizations.Terra,selected by 20%of the end-user organizations and 28%of IT vendors,provides a standardized API,a controller for metadata,and
251、a dock for drivers to provide seamless data management across various storage vendors,connecting different platforms,such as Kubernetes,Open Stack,and VMware,through plugins.After integration and interoperability,monitoring was also top ranked among our respondents.Delfin,selected by 18%of end-user
252、organizations and 26%of IT vendors,provides unified performance monitoring and alerting across heterogeneous storage.This project is extensible to add more storage vendors and data processing and visualization capabilities.Managing and augmenting containers is also in great demand.Kahu,selected by 2
253、1%of end-user organizations and 17%of IT vendors,augments Kubernetes by offering enhanced data man-agement with data protection,observability,and mobility.LinStor,preferred by circa 20%of the respondents,manages repli-cated volumes across a group of machines.With native integration to Kubernetes,Lin
254、Stor facilitates building,running,and controlling block storage in large Linux server clusters.Other projects with less than 20%of preference include OpenEBS(container attached storage),Cortx(mass capacity object storage),DAOS(NVM object storage),KubeEdge(edge computing man-agement),Zenko(multi-clou
255、d data controller),YIG(massive object storage),CubeFS(cloud native file and object storage),SBK(storage benchmarking),and Karmada(multi-cluster k8s control-ler).Only 6%of our respondents have no plans to adopt or partici-pate in SODA projects,and approximately 10%do not know or are not sure.Non-SODA
256、 data and storage project useOther non-SODA open source data and storage projects com-plement the technological landscape of our respondents.Swift,selected by 46%of end-user organizations and 38%of IT vendors,is OpenStacks object store project.Access is through a REST-based API,and there is much to
257、like about Swift due to its highly available,distributed,and eventually consistent object/blob store.Swift was one of the first OpenStack projects,and this project is very mature.Gluster,which Red Hat acquired in 2011,is also popular among respondents,especially IT vendors,at 32%,compared with just
258、26%for end-user organizations.Gluster allows organizations to 34DATA AND STORAGE TRENDS 2022FIGURE 28PROJECTED SODA OPEN FRAMEWORK PROJECT ADOPTIONWhich SODA open framework or eco projects are you most likely to adopt or participate in their development?(select all that apply)Dont know or not sureWe
259、 currently have no plans to participate or adopt SODA open framework or SODA eco projectsKarmada-multicluster k8s controllerSBK-storage benchmarkingWe are participating or plan to participate in other open framework or eco projects going forwardCubeFS-cloud-native file and object storageYIG-massive
260、object storageZenko-multi-cloud data controllerKubeEdge-edge computing managementDAOS-NVM object storageCortx-mass capacity object storageOpenEBS-container attached storageLinStor-container storage managementKahu-container data protectionDelfin-heterogeneous storage monitoringTerra-SDS controllerCom
261、o-multi-cloud data lakeStrato-multi-cloud data management9%6%7%10%10%9%12%14%12%15%13%16%17%17%26%28%24%32%11%6%5%9%11%15%12%12%15%11%16%14%20%21%18%20%31%45%End-user organizationsIT Vendor or service providers2022 SODA DATA&STORAGE TRENDS,Q36 BY Q10,SAMPLE SIZE=389,VALID CASES=389,TOTAL MENTIONS=1,
262、10635DATA AND STORAGE TRENDS 2022create large,distributed storage solutions using common off-the-shelf hardware for media streaming,data analysis,and other data-and bandwidth-intensive tasks.As we observed with SODA projects,solutions that augment Kubernetes are popular among our respondents.Rook,pr
263、eferred by approximately 24%of the respondents,automates various administration tasks for cloud native storage in Kubernetes,including deployment,bootstrapping,configuration,provisioning,scaling,upgrading,migration,disaster recovery,monitoring,and resource management.More focused on big data tasks,1
264、8%of end-user organizations and 21%of IT vendors selected the in-memory immutable data manager named Vineyard.Similarly,Longhorn,preferred by 22%of the respondents,provides cloud native distributed block storage for Kubernetes,and MinIO,preferred by about 20%of the respondents,provides S3-compatible
265、 multi-cloud object storage to Kubernetes.In a het-erogeneous and complex environment such as data storage,it is unsurprising that Veleroa solution for data recovery,migration,and protectionwas especially popular among end-user organiza-tions(21%)but just 16%of IT vendors.FIGURE 29NON-SODA DATA AND
266、STORAGE PROJECT USEWhich non-SODA open source data and storage projects are you using or considering for your development and production environments?(select all that apply)Dont know ornot sureTritonCephVeleroVineyardMinIOLonghornRookGlusterSwift18%17%16%21%18%18%22%25%26%46%13%10%17%16%21%22%22%23%
267、32%38%End-user organizationsIT Vendor or service providers2022 SODA DATA&STORAGE TRENDS,Q37 BY Q10,SAMPLE SIZE=389,VALID CASES=389,TOTAL MENTIONS=85636DATA AND STORAGE TRENDS 2022ConclusionsThe 2022 Data and Storage Trends survey provides a compre-hensive look at the intersection of cloud computing,
268、data and storage management,the configuration of environments that end-user organizations are gravitating to,and tests for the impor-tance of selected capabilities over the next several years.Looking at the results of the survey and the findings in this report,we come away with the following conclus
269、ions:End-user organizations continue their journey to the data-driven enterpriseFigure 1 shows data analytics and database management as top end-user organization workloads,and the importance of analytics and data management surfaces repeatedly across this survey.Combined with the 3X increase in dat
270、a growth between 2021 and 2022(Figure 6),end-user organizations have an opportunity to become more data-driven.The appeal of being data-driven is that from a business standpoint,the organization can better cater to customer needs in a much more fine-grained and automated way.The holy grail of this a
271、pproach is moving from a”one-size-fits-all”marketing strategy to“markets of one”the ability to custom tailor solutions to individual customers needs.Being data-driven means being more focused on data collection,data quality,metadata management,and effective techniques for managing data across hybrid
272、 and multi-cloud environments.Being data-driven is also a business imperative that escalates its impor-tance relative to IT objectives,which focus on leveraging technol-ogy to better address business needs.End users embrace a hybrid and multi-cloud futureThe survey also reflects the focus on cloud a
273、nd multi-cloud envi-ronments by end-user organizations.End-user organizations are often already involved in hybrid operations.Their expectations are that there will be a multi-cloud environment in their future if it is not already present.FIGURE 15,which asks about the share of public cloud workload
274、s,enables us to estimate that,on average,40%of end-user organi-zation workloads run in the public cloud.The leading response to FIGURE 16 by end-user organizations was multiple public clouds distribute all data.FIGURE 17,which asked about key cloud storage use cases,showed that 49%of end-user organi
275、zations identify the cloud as their primary data store.When asked about which characteristics best describe their choice of storage vendors in Figure 8,most end-user organizations reported that they would be using multiple storage vendors with one primary vendor and were planning to add even more ve
276、ndors.This data and storage strategy aligns well with the heterogeneous approach that end-user organizations are taking to cloud adoption.Private clouds excel at information security and data privacyDespite an exceedingly strong focus on public clouds,end-user organizations know that private clouds
277、provide some unique values.FIGURE 18 clearly shows that end-user organizations believe that private clouds have better information security and data privacy,as well as better flexibility control and capabilities for customization.While noisy neighbors can be an annoyance in the public cloud,the real
278、 benefit of a private cloud is that an end-user organization does not need to be concerned about the quality or security of other tenants applications.In this age of con-tainers where multiple tenants in a public cloud share an operating system image,vulnerabilities and exposures introduced by other
279、 tenants applications can also impact adjacent tenants.Private clouds simply eliminate this problem.37DATA AND STORAGE TRENDS 2022Open source software is uniquely positioned to address data and storage requirements of hybrid and multi-cloud needsHybrid and multi-cloud environments create significant
280、 chal-lenges for end-user organizations.From a business perspective,most CSPs want to provide their customers with the best expe-rience possible.But this does not extend to how their environment interoperates or integrates with competing CSPs.While the exis-tence of multiple leading CSPs in the mark
281、et is desirable,and few end-user organizations are interested in using only one CSP,IT vendors and service providers have only selectively(by function)developed an abstraction layer that integrates multi-cloud environments.This is exactly where a neutral third-party without a profit-seeking or propr
282、ietary agenda ideally positions itself to address integra-tion needs.The SODA Foundations open data framework provides a compelling unified capability for data and storage management through a unified API layer.Forty-five percent of end-user orga-nizations in our survey(FIGURE 28)embraced the Strato
283、 project,which focuses on multi-cloud data management.Because we expect the demand for multi-cloud environments and multi-cloud data management to continue growing,the SODA Foundation is well-positioned to address innovation and industry needs beyond the scope of the for-profit IT vendor community.M
284、ethodologyDuring July and August 2022,the SODA Foundation and Linux Foundation Research fielded a worldwide survey of individuals at organizations on a range of questions related to trends and concerns about their data and storage environments.They surveyed small,medium,and large enterprises,includi
285、ng a cross-section of end-user enterprises,vendors,and IT service providers.Survey participants included employees in various roles,such as CxOs,developers,data&analytics professionals,enterprise archi-tects,and R&D and product development.The data from the 2021 study and this 2022 survey is openly
286、available on data.world.Like last year,this 2022 survey focuses on end-user organizations.End-user organizations exist in every industry and are primary consumers of IT products and services.Vendors and IT service providers,who are primarily producers of IT products and services,also participated in
287、 the survey.Comparisons between the 2021 and 2002 questions were per-formed where possible.The promotion of the survey occurred via social media,the Linux Foundation and L websites,the Linux Foundation Newsletter,and our survey partners(see Acknowledgements).Percentage values in charts may not add u
288、p to 100%due to rounding and multi-response answers.38DATA AND STORAGE TRENDS 2022DemographicsThe sample size analyzed for the 2022 survey was 392.This sample size reflects those respondents who passed various screening and filtering criteria,which included the following:The respondent had to self-i
289、dentify as a real person.Respondents had to be familiar,very familiar,or extremely familiar with how their organizations are addressing their data and storage needs.Respondents could be in any industry except for those focused on education,hospitality,or other industries not listed on the questionna
290、ire.Respondents had to answer the first content question after the screening and demographic questions.This years sample included data collected by the SODA Foundation and its partners(17%of the sample)and data collected by a third-party panel provider(83%of the sample).FIGURE 30 provides selected d
291、emographics that profile the sample.Overall,the left panel reports on organization size.Micro and small and medium-sized enterprises comprise 42%of the sample,and large or very large(enterprise)organizations account for 58%of the sample(dont know or not sure responses excluded).FIGURE 30SELECTED SUR
292、VEY DEMOGRAPHICSOrganization size Respondents by regionRespondents by roleVery large(10,000+)Large(1,000-9,999)SME(100-999)Micro(1-99)23%35%31%12%19%39%25%17%EMEANorth AmericaAsia Pacific21%35%44%23%25%51%OtherConsultant or SIR&D/product devCxODeveloperProduct mgmt/mktEnterprise archData and analyti
293、cs14%8%7%16%8%12%12%23%13%11%9%8%11%14%16%17%SAMPLE SIZE=392End-user organizationsIT Vendor or service providers39DATA AND STORAGE TRENDS 2022The middle panel shows the region where respondents live in a split 50/50 between Asia Pacific and the West(meaning North America and Western Europe).The pane
294、l on the right provides a window into the respondents role.Overall,IT roles account for about 85%of the sample,and non-IT roles account for 15%of the sample.FIGURE 31 shows the sample segment(N=428)between end users and vendors.The left panel shows that vendors account for 55%of the sample(N=235),le
295、aving the remaining 45%to end-user organizations(N=193).The margin of error for the overall sample(N=428)is+/-4.0 at 90%confidence.The margin of error for the end-user segment(N=193)is+/-5.8%at 90%confidence.The middle panel shows the distribution of end-user organiza-tions after filtering out those
296、 respondents who did not know,were not familiar,or slightly familiar with their employers approach to data and storage needs.While we usually expect responses to this question to trail off as the familiarity choices become more demanding,the survey results show the opposite.This was FIGURE 31DEMOGRA
297、PHIC SEGMENTATIONEnd user&vendor segmentationData&storage familiarityRespondents by industryIT Vendor orsevice providersEnd-userorganizations54%46%0.00.10.20.30.40.5FamiliarVery familiarExtremely familiar19%24%57%22%35%43%Other namedindustriesTransportationHC/Life sciencesRetail/WholesaleGovernmentE
298、ngineeringManufacturing/CPGFinancia servicesIT15%4%5%6%6%9%11%12%32%5%2%3%6%0%4%7%15%58%SAMPLE SIZE=392End-user organizationsIT Vendor or service providers40DATA AND STORAGE TRENDS 2022because a significant number of survey completes came from a third-party panel provider who pre-screened using this
299、 question.So,most of these third-party panel respondents were familiar with their employers data and storage needs.The third panel in FIGURE 31 shows the distribution of industries for end-user organizations.While the strong showing of IT was surprising,the focus on financial services,manufacturing,
300、and engineering is consistent with what we usually see in surveys.Our segmentation between end users and vendors or service provid-ers was based on Q10,and its wording was very clear;therefore,we believe those respondents who answered“IT”were likely to work for an IT organization inside an end-user
301、company.Other named industries,totaling 14%,include automotive,media,oil and gas,utilities,and agriculture.The analysis in this report generally focuses on end-user findings.Figures 1-21 and 23-26 in this report show just end-user data.Figures 22 and 27-31 show both end-user data and IT vendor and s
302、ervice provider data.41DATA AND STORAGE TRENDS 2022About the authorStephen Hendrick is the vice president of research at the Linux Foundation,where he is the principal investigator on a variety of research projects core to the Linux Foundations understanding of how open source software is an engine
303、of innovation for producers and con-sumers of information technology.Steve specializes in primary research techniques developed over 30 years as a software industry analyst and is a subject matter expert in application development and deployment topics,including DevOps,application management,and dec
304、ision analytics.Steve brings experience in a variety of quantitative and qualitative research techniques that enable deep insight into market dynamics and has pioneered research across many application development and deployment domains.He has authored over 1,000 publications and provided market gui
305、dance through syndicated research and custom consulting to the worlds leading software vendors and high-profile start-ups.AcknowledgmentsThe support and collaboration of the following individuals helped author this document:Hilary Carter(Linux Foundation),Michael Dolan(Linux Foundation),Lawrence Hec
306、ht(Linux Foundation),Anna Hermansen(Linux Foundation),Rakesh Jain(IBM),Larry Karr(SODA Foundation),Sanil Kumar(SODA Foundation),Christina Oliviero(Linux Foundation),Jason Perlow(Linux Foundation),Melissa Schmidt(Linux Foundation),and Steven Tan(SODA Foundation).This report would not be possible with
307、out the support of the following partners:China Electronics Standardization Institute(CESI)China Open Source Cloud League(COSCL)Chinese Software Developer Network(CSDN)Cloud Computing Innovation Council of India(CCICI)Cloud Native Computing Foundation(CNCF)Electronics For You(EFY)IEEE Bangalore Sect
308、ion Japan Data Storage Forum(JDSF)Mulan Project Open Infra Foundation(OIF)Storage Networking Industry Association(SNIA)42DATA AND STORAGE TRENDS 2022DisclaimerThis report is provided“as is.”The Linux Foundation and its authors,contributors,and sponsors expressly disclaim any warranties(express,impli
309、ed,or otherwise),including implied warranties of merchantability,non-infringement,fitness for a particular purpose,or title related to this report.In no event will the Linux Foundation and its authors,contributors,and sponsors be liable to any other party for lost profits or any form of indirect,spe
310、cial,incidental,or consequential damages of any character from any causes of action of any kind with respect to this report,whether based on breach of contract,tort(including negligence),or otherwise and whether they have been advised of the possibility of such damage.Sponsorship of the creation of
311、this report does not constitute an endorsement of its findings by any of its sponsors.Copyright 2022 The Linux FoundationThis report is licensed under the Creative Commons Attribution-NoDerivatives 4.0 International Public License.To reference the work,please cite as follows:Stephen Hendrick,“Data a
312、nd Storage Trends 2022,”foreword by Rakesh Jain,The Linux Foundation,December SODA(Storage Open Data Autonomy)Foundation is an open source project under the Linux Foundation that fosters an ecosystem of open source data management and storage software for data autonomy.SODA offers a neutral forum fo
313、r cross-project collaboration and integration and provides end users with quality end-to-end in 2021,Linux Foundation Research explores the growing scale of open source collaboration,providing insight into emerging technology trends,best practices,and the global impact of open source projects.Through leveraging project databases and networks,and a commitment to best practices in quantitative and qualitative methodol-ogies,Linux Foundation Research is creating the go-to library for open source insights for the benefit of organizations the world over.