๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90
๋ฐ˜์‘ํ˜•

๐Ÿค–๋จธ์‹ ๋Ÿฌ๋‹8

sklearn - ์ŠคํŒธ ๋ฉ”์„ธ์ง€ ๋ถ„๋ฅ˜(spam-text-message-classification) notebook Spam Text ๋ฐ์ดํ„ฐ์…‹ https://www.kaggle.com/datasets/team-ai/spam-text-message-classification Spam Text Message Classification Let's battle with annoying spammer with data science. www.kaggle.com Write-up ๋ฐ์ดํ„ฐ ๊ด€๋ฆฌ์— ํ•„์š”ํ•œ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋“ค import import numpy as np # linear algebra import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv) SPAM Text ๋ฐ์ดํ„ฐ์…‹ ๋ถˆ๋Ÿฌ์˜ค๊ธฐ ๋ฐ ๋ฐ์ดํ„ฐ ์š”์•ฝ df = pd.read_csv("/kaggle/in.. 2024. 4. 8.
OCR - ํ…Œ์„œ๋ž™ํŠธ ํ•œ๊ตญ์–ด(ํ•œ๊ธ€) ์ธ์‹ํ•˜๊ธฐ ์•„๋ž˜ ๊นƒํ—ˆ๋ธŒ์—์„œ ํ•œ๊ตญ์–ด ํŠธ๋ ˆ์ธ ๋ฐ์ดํ„ฐ ํŒŒ์ผ์„ ๋‹ค์šด๋กœ๋“œ ํ•˜๊ณ  Tesseract-OCR/tessdata(C:\Program Files\Tesseract-OCR\tessdata)์— ๋„ฃ์–ด์ค€๋‹ค. https://github.com/tesseract-ocr/tessdata_best/blob/main/kor.traineddata GitHub - tesseract-ocr/tessdata_best: Best (most accurate) trained LSTM models. Best (most accurate) trained LSTM models. Contribute to tesseract-ocr/tessdata_best development by creating an account on GitHub. github.com .. 2023. 4. 12.
๋จธ์‹ ๋Ÿฌ๋‹ - ๋จธ์‹ ๋Ÿฌ๋‹ ์‰ฝ๊ฒŒ ๋ฐฐ์šฐ๊ธฐ ์˜์ƒ: https://youtu.be/432p379XXMw ์›๋ฌธ: https://medium.com/@calebkaiser/dont-learn-machine-learning-8af3cf946214 Don’t learn machine learning Learn how to build software with ML models medium.com ์š”์•ฝ: ๋จธ์‹ ๋Ÿฌ๋‹์„ ๋ฐฐ์šฐ๊ธฐ ์œ„ํ•ด ๋ฐ‘๋ฐ˜์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๊ตฌ์กฐ๋ถ€ํ„ฐ ๊ณต๋ถ€ํ•˜๋Š” ๊ฒƒ์€ ๋งˆ์น˜ ์‘์šฉ ํ”„๋กœ๊ทธ๋žจ ๊ฐœ๋ฐœ์ž๊ฐ€ ๋กœ์šฐ ๋žญ๊ท€์ง€์ธ ์–ด์…ˆ๋ธ”๋ฆฌ ์–ธ์–ด๋ฅผ ๋ฐฐ์šฐ๋Š” ๊ฒƒ๊ณผ ๋น„์Šทํ•˜๋‹ค. ์†Œํ”„ํŠธ์›จ์–ด ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ๋จธ์‹ ๋Ÿฌ๋‹์„ ๋ฐฐ์šฐ๊ธฐ ์œ„ํ•ด์„œ๋Š” ํƒ‘-๋‹ค์šด ๋ฐฉ์‹๊ณผ ์‹คํ–‰์„ ํ†ตํ•œ ํ•™์Šต ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜๋ผ chatGPT, YOLO ๊ฐ™์€ ํ”„๋กœ์ ํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ๋„ ์ข‹์€ ์ˆ˜๋‹จ์ด๋‹ค. 2023. 3. 13.
YOLO - ๊ฐ„๋‹จํ•œ ์‚ฌ๋ฌผ ์ธ์‹ ์˜ˆ์ œ(YOLOv5, Colab) YOLO(You Only Look Once)๋Š” ๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ์‚ฌ๋ฌผ ์ธ์‹ ํ”„๋ ˆ์ž„์›Œํฌ๋‹ค. ๋งŽ์€ ์ธ๊ธฐ ํƒ“์— ๋‹ค์–‘ํ•œ ๋ฒ„์ „๋“ค(v3, v4, v5...)์ด ์ƒ๊ฒจ๋‚˜๊ณ  ์žˆ๋‹ค. ๋‚ด๊ฐ€ ์‚ฌ์šฉํ•  ์˜ˆ์ œ์˜ ๋ฒ„์ „์€ YOLOv5์ด๋‹ค. ๊นƒํ—ˆ๋ธŒ https://github.com/ultralytics/yolov5/wiki/Train-Custom-Data Train Custom Data YOLOv5 ๐Ÿš€ in PyTorch > ONNX > CoreML > TFLite. Contribute to ultralytics/yolov5 development by creating an account on GitHub. github.com ์œ„ ๊นƒํ—ˆ๋ธŒ ํŽ˜์ด์ง€๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋”ฐ๋ผ ํ•˜์˜€๋‹ค. Roboflow - ์ปค์Šคํ…€ ๋ฐ์ดํ„ฐ์…‹ ๋งŒ๋“ค๊ธฐ https://app.rob.. 2023. 3. 12.
OCR - ํ…Œ์„œ๋ž™ํŠธ ๊ธฐ๋ณธ ๋ช…๋ น์–ด tesseract ./captcha.png stdout -l eng --oem 3 --psm 10 2022. 6. 27.
ํ…์„œํ”Œ๋กœ์šฐ - ๊ฐ„๋‹จํ•œ ๊ธฐ๊ณ„ํ•™์Šต ์‹œํ‚ค๊ธฐ https://teachablemachine.withgoogle.com/ Teachable Machine Train a computer to recognize your own images, sounds, & poses. A fast, easy way to create machine learning models for your sites, apps, and more – no expertise or coding required. teachablemachine.withgoogle.com ์—ฌ๋Ÿฌ๊ฐ€์ง€ ์ข…๋ฅ˜์˜ ํ”„๋กœ์ ํŠธ๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Œ [์ด๋ฏธ์ง€ ํ”„๋กœ์ ํŠธ ๋กœ์ปฌ์—์„œ ์‹คํ–‰์‹œํ‚ค๊ธฐ] import os os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2' import tensorflow as tf from .. 2022. 5. 19.
OCR - ํ…Œ์„œ๋ž™ํŠธ(Tesseract) ์œˆ๋„์šฐ ๋ฒ„์ „ ์„ค์น˜ https://github.com/UB-Mannheim/tesseract/wiki GitHub - UB-Mannheim/tesseract: Tesseract Open Source OCR Engine (main repository) Tesseract Open Source OCR Engine (main repository) - GitHub - UB-Mannheim/tesseract: Tesseract Open Source OCR Engine (main repository) github.com 2022. 5. 17.
ํ…์„œํ”Œ๋กœ์šฐ - ๊ฝƒ ์ด๋ฏธ์ง€ ๋งž์ถ”๊ธฐ from shutil import ExecError import matplotlib.pyplot as plt import numpy as np import os import PIL import tensorflow as tf from tensorflow import keras from tensorflow.keras import layers from tensorflow.keras.models import Sequential # ๋ฐ์ดํ„ฐ์„ธํŠธ ๋‹ค์šด๋กœ๋“œ ๋ฐ ํƒ์ƒ‰ํ•˜๊ธฐ import pathlib dataset_url = "https://storage.googleapis.com/download.tensorflow.org/example_images/flower_photos.tgz" data_dir = tf.keras.ut.. 2022. 4. 29.
728x90
๋ฐ˜์‘ํ˜•