종관기상관측 자료 기반의 태양광 발전량 예측 모델 구현 :Implementation of solar power generation prediction model based on synoptic weather observation data

김준용

추천

검색

자료유형: 학위논문

저자정보: 김준용 (서울시립대학교, 서울시립대학교 일반대학원)

지도교수: 김규식

발행연도: 2023

저작권: 서울시립대학교 논문은 저작권에 의해 보호받습니다.

이용수24

이 논문의 연구 히스토리 (2)

2023

종관기상관측 자료 기반 태양광발전량 예측에 관한 연구

김준용 , 정재원 전자공학회논문지 2023.04 학술저널

종관기상관측 자료 기반의 태양광 발전량 예측 모델 구현

김준용 전자전기컴퓨터공학과 2023.01 학위논문

이 논문의 후속연구가 궁금하신가요?
연관 학술논문 또는 학술발표를 통해 보다 발전된 연구결과를 확인하실 수 있습니다.
이 논문의 연구 히스토리 확인하기

초록· 키워드

오류제보하기

세계적으로 탄소배출 및 온실가스 배출을 줄이기 위해 온실가스 발생원인 중 하나인 화석연료의 사용량을 감축을 위한 방안으로 신재생 에너지가 대두되고 있다. 하지만 신재생에너지는 일사량, 풍량, 풍속 등 자연에서 발생되는 조건들에 의존하여 전력을 생산하는 발전특성을 가지고 있어 출력전력에 대한 불확실성을 가지고 있다. 간헐적인 신재생에너지의 발전특성은 예측한 발전량이 전체 발전량과 실제 생산된 발전량의 큰 차이를 발생시킬 수 있어 발전량의 수요예측의 실패로 인한 전력 공급량의 부족과 발전량의 과소예측으로 인한 전력생산 과잉 문제를 발생시킬 우려가 발생된다.
이러한 문제점 때문에 정부는 신재생에너지를 안정적으로 공급하기 위해 신재생에너지의 발전특성을 보완하고 소규모로 운영 중인 발전소를 통합하여 운용할 수 있도록 재생에너지 발전량 예측제도를 도입하여 출력 변동이 심한 재생에너지의 특성을 완화할 수 있으며 신재생에너지 발전소들이 갑작스럽게 추가로 발전기를 기동하거나 정지함으로써 발생되는 비용을 절감할 수 있어 전력계통을 운영하는데 있어 효율성을 향상시킬 수 있도록 하였다. 따라서 안정적인 전력계통을 운영하기 위해서는 태양광 발전량을 정확하게 예측할 수 있는 기술이 필요하기 때문에 본 논문에서는 태양광 발전량 예측을 위한 태양광 발전량 예측모델을 구현하고자 한다.
기존의 태양광 발전량 예측모델에 대한 연구사례에서는 다양한 데이터와 예측모델을 사용하여 태양광 발전량을 예측하는 연구가 진행되었다. 미래의 태양광 발전량을 예측하기 위해서는 기반이 되어야 하는 데이터, 그리고 이를 분석하기 위한 분석 모델이 필요하다. 각 연구들에서 사용한 데이터는 과거의 발전량이나 기상센서 데이터, 기상예보 데이터 그리고 종관기상관측 데이터가 있었으며 이를 장, 단기 메모리, 기계학습, 딥러닝 등 다양한 모델링을 이용해 태양광 발전량 예측 모델을 구현하였다.
하지만 유의미한 종관기상관측변수 일부만을 사용해 다중 선형 회귀분석을 통한 태양광발전량예측모델을 구현한 예측사례는 없었으며, 이를 구현하면 예측 모델에 대한 오차율을 낮출 수 있으며 기계학습이나 딥러닝 보다 상대적으로 더 적은 연산량을 통해서도 발전량예측 모델을 구현할 수 있다. 따라서 본 논문에서는 종관기상관측데이터를 기반으로 다중 선형 회귀분석을 활용한 태양광 발전량 예측 모델을 구현하였다. 이를 위해 태양광 발전소의 발전량 데이터와 종관기상관측데이터를 수집하여 데이터로 사용하였다. 수집된 기상데이터는 일출 시간대 (07:00)부터 일몰시간(19:00)까지의 자료를 사용하였다. 태양광 발전량 데이터역시 동일한 시간대의 계측된 데이터를 사용하였다.
이후 수집된 발전량 데이터와 기상데이터간의 관계의 유무를 판단하기 위하여 발전량과 기상데이터간의 상관분석을 진행하였다. 수집된 각 기상 계측 데이터 변수들의 대한 평균과 표준편차를 계산한 이후에는 각 기상데이터 변수들과 발전량 데이터간의 관련성이 높은 변수를 선별하기 위해 통계학에서 데이터 간의 패턴을 분석할 때 사용하는 지표 중 하나인 스피어만 상관계수를 계산하여 유의미한 변수를 선별하였다. 이때 일사량, 지면온도, 일조량, 상대습도, 기온, 전운량, 풍향, 풍속 순으로 발전량과 선형적인 상관관계가 높은 것으로 나타났으며 산점도 에서도 전운량, 풍향, 풍속의 경우 발전량과의 연관성이 다소 낮아 세 데이터를 제외하여 다중회귀분석을 진행하였다.
다중회귀분석결과는 각 독립변수들의 유의확률을 통해 회귀분석모델의 유의성을 판단하였으며 조정된 결정계수 R2를 통해, 선별된 독립변수가 종속 변수를 얼마나 잘 설명하는가를 확인하였다. 그 결과 조정된 R2의 값이 0.80에 근접하기 때문에 높은 설명력을 가지고 있었으며 회귀분석 모델 자체의 유의확률 값은 0.001 보다 작은 수치로 나타나 0.05보다 매우 작기 때문에 통계적으로 유의한 것을 알 수 있었다.
이후 독립변수들의 데이터 오차율을 낮추고 회귀분석모형의 타당성을 높일 수 있도록 유의성이 높다고 판단되는 데이터를 선별하기 위해 변수 선택법을 사용하여 사용하고자 하는 변수를 선별하여 다중회귀분석을 다시 진행하였다. 회귀분석에서 사용하는 변수 선택법 중 본 논문에서는 중요하지 않은 변수를 선별하여 이를 제거하고 유의미한 독립변수만을 사용하여 태양광 발전량 예측 모델을 구현하고자 한다. 따라서 다중회귀분석에 이용할 기상데이터를 선별하기 위해 단계적 선택법을 이용하여 각 변수에 대한 중요성을 점검하여 중요하지 않은 변수를 제거하거나, 다시 다른 변수를 추가하는 방식으로 사용하고자 하는 기상데이터를 선별하였다. 단계적 선택법을 통해 유의미한 변수를 선별한 결과 일사량과 습도 데이터를 사용한 모델의 결정계수가 0.792로 0.8에 가까운 높은 수치를 기록하였으며 두 가지 독립변수의 유의확률이 모두 0.01 미만으로 매우 작은 수치로 나타나 통계적으로 유의한 결과를 나타내었다. 따라서 구현된 태양광발전량 예측모델을 사용하여 13시간 동안의 실제 발전량 데이터와 비교하는 방식으로 시험을 진행하였다.
실제 태양광 발전량 데이터와 구현된 태양광발전량 예측모델을 통해 예측한 발전량을 비교한 결과 07시부터 19시까지 시간대별 발전량의 평균 오차율은 7.7%를 기록하여 태양광발전량예측에 있어 적합한 모델로 판단되었다. 예측 발전량 그래프는 실제 발전량 그래프와 비교하여 매우 유사한 모양을 나타내었으며 전체 시간대 중 08시부터 09시, 11시부터 14시, 그리고 18시부터 19시에는 7% 이상의 예측 오차율을 기록하였다. 그 외의 시간대에는 평균 4%의 오차율을 기록하였다.
예측모델의 성능을 확인하기 위해 동일한 시험방법으로 경주 발전소 3곳의 태양광 발전량을 예측한 결과 RMSE 및 MAE 값은 경주 A 발전소의 경우 각각 4.36kWh, 3.55kWh. 경주 B 발전소의 경우 각각 3.76kWh, 3.29kWh, 그리고 경주 C 발전소의 경우 각각 3.74kWh, 3.06kWh로 나타났다. 이는 안산 태양광 발전소의 태양광 발전량 예측모델의 RMSE, MAE가 각각 4.96kWh, 3.22kWh인 것을 감안했을 때 경주 태양광 발전소 3곳에 대한 태양광 발전량 예측모델은 대한 태양광 발전량 예측모델은 상대적으로 작은 RMSE 및 MAE 값이 나타났기 때문에 오차가 적은 발전량 예측 모델이 구현되었다는 것을 알 수 있다. 경주 발전소들의 평균 오차율은 각각 7.4%, 6.9%, 6.8%로 안산 발전소가 기록한 7.7%보다 낮았으며, 이때 3곳의 경주 발전소 들은 07시부터 19시까지 중 12시부터 17시까지 평균 오차율이 5%미만으로 높은 예측률을 나타냈지만, 일몰 직전, 직후 시간대인 17시부터 19시 사이의 오차율이 10%이상으로 다소 높게 기록되었다.
오차가 발생된 이유는 실제 현장에서 발생될 수 있는 전운량에 따른 흐림 유무, 강우 여부, 건물이나 주변 환경으로 인해 발생된 응달 혹은 그림자로 인해 태양광 패널에 직접적인 영향을 주어 전체 시간대 중 일부 시간대에 오차를 발생시킨 점, 그리고 발전량 데이터의 자릿수 표기로 인한 오차값이 존재하는 점이 원인으로 사료된다.

Renewable energy is emerging worldwide as a way to reduce the use of fossil fuels, one of the causes of greenhouse gas generation, to reduce carbon emissions and greenhouse gas emissions. However, renewable energy has power generation characteristics that generate power depending on natural conditions such as solar radiation, air volume, and wind speed, so there is uncertainty about output power. The power generation characteristics of intermittent new and renewable energy can cause a large difference between the total power generation and the actual power generation, raising concerns about a lack of power supply due to a failure to predict demand for power generation.
Because of this problem, the government introduced a renewable energy generation prediction system to supplement renewable energy generation characteristics and integrate and operate small-scale power plants to stably supply renewable energy. Therefore, in order to operate a stable power system, a technology capable of accurately predicting solar power generation is required, so this paper intends to implement a solar power generation prediction model for predicting solar power generation.
In the existing research cases on solar power generation prediction models, studies have been conducted to predict solar power generation using various data and prediction models. In order to predict future solar power generation, data that needs to be based and an analysis model to analyze it are needed. The data used in each study included past power generation, weather sensor data, weather forecast data, and long-term weather observation data, and solar power generation prediction models were implemented using various modeling such as long-term memory, machine learning, and deep learning.
However, there have been no predictive cases of implementing a solar power prediction model through multiple linear regression analysis using only some of the significant longitudinal weather observations, which can lower the error rate of the predictive model and implement a power generation prediction model through relatively less computation than machine learning or deep learning. Therefore, in this paper, a solar power generation prediction model using multiple linear regression analysis was implemented based on longitudinal weather observation data. To this end, power generation data and longitudinal meteorological observation data of solar power plants were collected and used as data. The collected weather data used data from sunrise time zone (07:00) to sunset time (19:00). Solar power generation data also used measured data in the same time zone.
After that, a correlation analysis between the power generation and the weather data was conducted to determine whether there was a relationship between the collected power generation data and the weather data. After calculating the average and standard deviation of each collected meteorological data variable, meaningful variables were selected by calculating the Spearman correlation coefficient, one of the indicators used to analyze patterns between statistics to select variables that are highly related to each meteorological data variable. At this time, there was a high linear correlation with power generation in the order of solar radiation, ground temperature, sunlight, relative humidity, temperature, total cloud, wind direction, and wind speed, and multiple regression analysis was conducted excluding three data.
The significance of the regression analysis model was determined through the significance probability of each independent variable, and the adjusted coefficient R2 confirmed how well the selected independent variable explains the dependent variable. As a result, the adjusted R2 value was close to 0.80, so it had high explanatory power, and the significance probability value of the regression analysis model itself was less than 0.001, which was statistically significant because it was much less than 0.05.
After that, multiple regression analysis was conducted again by selecting variables to be used using the variable selection method to select data judged to be highly significant to lower the data error rate of independent variables and increase the validity of the regression analysis model. Among the variable selection methods used in regression analysis, this paper selects and removes non-important variables and implements a solar power generation prediction model using only significant independent variables. Therefore, in order to select weather data to be used for multiple regression analysis, the importance of each variable was checked using a step-by-step selection method to remove non-important variables or to add other variables again. As a result of selecting significant variables through the stepwise selection method, the determination coefficient of the model using solar radiation and humidity data was 0.792, which was close to 0.8, and the significance probability of both independent variables was very small, less than 0.01, showing statistically significant results. Therefore, the test was conducted by comparing it with the actual power generation data for 13 hours using the implemented solar power generation prediction model.

As a result of comparing the actual solar power generation data with the predicted solar power generation model implemented, the average error rate of power generation by time from 07:00 to 19:00 was 7.7%, making it a suitable model for predicting solar power generation. The predicted power generation graph showed a very similar shape compared to the actual power generation graph, and the predicted error rate was more than 7% from 08:00 to 09:00, 11:00 to 14:00, and 18:00 to 19:00. In other times, an average error rate of 4% was recorded.
To confirm the performance of the predictive model, RMSE and MAE values were 4.36kWh and 3.55kWh for Gyeongju A power plant, respectively, as a result of predicting solar power at three Gyeongju power plants using the same test method. It was 3.76kWh and 3.29kWh for Gyeongju B power plant, respectively, and 3.74kWh and 3.06kWh for Gyeongju C power plant, respectively. Considering that the RMSE and MAE of the Ansan solar power plant''s solar power prediction model are 4.96kWh and 3.22kWh, respectively, the solar power prediction model for three Gyeongju solar power plants has relatively small RMSE and MAE values. The average error rate of Gyeongju power plants was 7.4%, 6.9%, and 6.8%, respectively, lower than the 7.7% recorded by Ansan power plants. The average error rate of the three Gyeongju power plants was less than 5% from 07:00 to 17:00, but the error rate between 17:00 and 19:00, just before and after sunset.
The reason for the error is the presence or absence of cloudiness due to the actual amount of traffic that may occur at the site, rainfall, condensation or shadow caused by buildings or surrounding environments, causing errors in some time zones.

#태양광 발전량 예측 #회귀분석 #다중 선형 회귀분석 #단계선택법 #신재생에너지

1. 서론 1
2. 예측 기법 선행 연구사례 8
3. 선형 회귀모형 16
3.1 선형 회귀모형 16
3.2 다중 선형 회귀모형 21
4. 발전량 예측을 위한 다중 선형 회귀모델 구현 30
4.1 기상 데이터 수집 30
4.2 기상 데이터 특성 분석 45
5. 다중 선형 회귀모델 구축 55
5.1 기상 데이터 변수에 의한 다중 선형 회귀분석 59
5.2 선택한 기상 데이터 변수에 의한 다중 선형 회귀분석 61
5.3 태양광 발전량 예측모델의 성능평가 70
6. 구현된 태양광 발전량 예측 모델의 적용 77
7. 결론 93
참고문헌 98
Abstract 103

최근 본 자료

전체보기

구분	그룹	데이터 항목
AI 학습용 데이터	원문	원문 PDF 파일
AI 학습용 데이터	원문 + 메타 (기본/상세)	원문 PDF 파일 및 서지정보 CSV
대량 구매용 데이터	B2B 구독 방식	특정 자료 한정으로 원문 접근 권한 부여
대량 구매용 데이터	URL 전달 방식	바로 PDF 뷰어를 열람할 수 있는 URL 제공

구분	그룹	데이터 항목
AI 학습용 데이터	기본 메타	발행기관명, 간행물명, 권호명, 권(vol), 호(issue), 통권, 발행연도, 발행월, 논문명, 저자명, 시작페이지, 종료페이지, 전체페이지, 상세페이지URL
상세 메타 데이터	발행기관 메타	발행기관 이명, 영문명, 창립연도, 홈페이지URL, 발행기관 소개
	간행물 메타	부제목, 간행물 유형, ISSN, ISBN, 최초발행연도, 폐간연도, 간행빈도, 발행주기, 등재사항, 이용수, 피인용수, 권호수, 논문수, 표지이미지
	논문 메타	작성 언어, 부제목, 대등제목, 목차, 키워드, 초록, 이미지, 참고문헌, 이용수, 피인용수, 논문활용도, DBpia통합주제분류, KDC분류, DDC분류, 한국연구재단분류, UCI, DOI
	저자 메타	소속기관, 소속부서, 직급, 연구분야, 연구키워드, 이용수, 피인용수, 저자 논문활용도

구분	그룹	데이터 항목
※ 결합형/맞춤형 메타 데이터는 신청 내용에 따라 다양하게 제공 가능
이용순위 정보	주제분야별 많이 이용된 논문	“인문학”에서 많이 이용된 논문 TOP100
	이용기관별 많이 이용된 논문	“중고등학교”에서 많이 이용된 논문 TOP100
	세부기관별 많이 이용된 논문	“서울대학교”에서 많이 이용된 논문 TOP100
	키워드별 많이 이용된 논문	“Chat GPT”에서 많이 이용된 논문 TOP100
키워드 정보	많이 이용된 키워드	특정기간/분야/저널 내 많이 이용된 키워드
	많이 발행된 키워드	특정기간/분야/저널 내 많이 발행된 키워드
	많이 검색된 키워드	특정기간/분야/저널 내 많이 검색된 키워드
	연구 트렌드 키워드	특정 키워드 연관 연구동향 분석 데이터 키워드

논문 기본 정보

이 논문의 연구 히스토리 (2)

초록· 키워드

목차

최근 본 자료

댓글(0)