Archive Issue – Vol.4, Issue.3 (Jul-Sep 2024)

Archive Issue – Vol.4, Issue.3 (July-September 2024)


Text-to-Image Conversion: Advancements, Challenges, and Future Directions

Arun Pratap Singh, Sanjay Kumar Sharma

Research Paper | Journal Paper

Vol.4, Issue.3, pp.1-04, Sep-2024

Abstract

Text-to-image conversion, an emerging domain in artificial intelligence (AI), focuses on synthesizing realistic images from natural language descriptions. This research paper explores the fundamental principles, recent advancements, and challenges associated with text-to-image conversion. Leveraging deep learning models, particularly generative adversarial networks (GANs), variational autoencoders (VAEs), and diffusion models, researchers have made significant progress in bridging the gap between textual semantics and visual representation. Despite these advancements, challenges such as maintaining semantic accuracy, generating high-resolution images, and addressing biases persist. This paper presents a comprehensive study of related works, problem statements, proposed methodologies, results, and potential future directions in the field.

Key-Words / Index Term: Text-to-Image Generation, Deep Learning, Generative Adversarial Networks (GANs), Diffusion Models, Natural Language Processing (NLP), Computer Vision.

References

      1. Mirza, M., & Osindero, S. (2014). Conditional Generative Adversarial Nets. arXiv preprint arXiv:1411.1784.
      2. Zhang, H., Xu, T., Li, H., Zhang, S., Wang, X., Huang, X., & Metaxas, D. N. (2017). StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. Proceedings of the IEEE International Conference on Computer Vision (ICCV).
      3. Xu, T., Zhang, P., Huang, Q., Zhang, H., Gan, Z., Huang, X., & He, X. (2018). AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
      4. Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning Transferable Visual Models From Natural Language Supervision. Proceedings of the International Conference on Machine Learning (ICML).
      5. Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., & Chen, M. (2022). Hierarchical Text-Conditional Image Generation with CLIP Latents. arXiv preprint arXiv:2204.06125.
      6. Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-Resolution Image Synthesis with Latent Diffusion Models. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Citation

Arun Pratap Singh, Sanjay Kumar Sharma, "Text-to-Image Conversion: Advancements, Challenges, and Future Directions" International Journal of Scientific Research in Technology & Management, Vol.4, Issue.3, pp.1-04, 2024.

A Review on Unusual Activity Detection for Prior Appraisal against Crime

Swati Bhargava, Arun Jhapate

Research Paper | Journal Paper

Vol.4, Issue.3, pp.05-08, Sep-2024

Abstract

Unusual activities on public areas and personal safety are seriously endangered. Millions of video surveillance systems are used in public areas, such as roads, prisons, holy sites, airports, and supermarkets. Video surveillance cameras are not intelligent enough to detect unusual activities even in real time. It is necessary to investigate the detection and recognition of the contents of suspicious activities from surveillance video. It is necessary to identify crook status in real time from video surveillance for quick and immediate management. The purpose of this paper is to review various implemented systems and their drawbacks. Most systems use Gaussian filters to classify objects according to gestures identified from a video or frame. Some systems are based on background or foreground subtraction that work with two layers; The first one is the background and the second is the foreground. But these systems are good enough for simple backgrounds or less crowded areas. These systems are recognizing many more basic things such as walking, sitting, running, waving hands, clapping, and classifying unusual activities. A system is required that must be intelligent enough to recognize unusual activities from a crowded area in real time.

Key-Words / Index Term: Unusual Activity Recognition, Gaussian Filter, Foreground, Video Surveillance.

References

      1. Massive Software, “World War Z Crowd Simulation,” Available: http://www.massivesoftware.com/wwz.html.
      2. Z. Hammal, W.-S. Chu, J. F. Cohn, C. Heike, and M. L. Speltz, “Automatic Action Unit Detection in Infants Using Convolutional Neural Network,” in Proc. 2017 Seventh Int. Conf. on Affective Computing and Intelligent Interaction (ACII), IEEE, 2017.
      3. H. Xu, C. Yuan, P. Li, and Y. Wang, “Design and Implementation of Action Recognition System Based on RFID Sensor,” in Proc. 2017 13th Int. Conf. on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), IEEE, 2017.
      4. V. S. Nanaware, M. H. Nerkar, and C. M. Patil, “A Review of the Detection Methodologies of Multiple Human Tracking & Action Recognition in a Real Time Video Surveillance,” in Proc. IEEE Int. Conf. on Power, Control, Signals and Instrumentation Engineering (ICPCSI), 2017.
      5. J. Li, H. Wu, and X. Zhou, “PeMapNet: Action Recognition from Depth Videos Using Pyramid Energy Maps on Neural Networks,” in Proc. 2017 Int. Conf. on Tools with Artificial Intelligence, IEEE, 2017.
      6. N. E. D. Elmadany, Y. He, and L. Guan, “Information Fusion for Human Action Recognition via Biset/Multiset Globality Locality Preserving Canonical Correlation Analysis,” IEEE Trans. Image Process., 2018.
      7. K. Aouaidjia, B. Sheng, P. Yang, P. Li, R. Shen, and D. D. Feng, “Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures,” IEEE Trans. Systems, Man, and Cybernetics: Systems, 2018.
      8. S. Sen, M. Dhar, and S. Banerjee, “Implementation of Human Action Recognition Using Image Parsing Techniques,” in Proc. 2018 Emerging Trends in Electronic Devices and Computational Techniques (EDCT), IEEE, 2018.
      9. Cornell University, “Human Activities Dataset,” Available: http://pr.cs.cornell.edu/humanactivities/.
      10. SlidePlayer, “Human Activity Recognition,” Available: https://slideplayer.com/slide/6932185/.
      11. Quora, “What is the easiest way to recognise gestures in OpenCV using Python?” Available: https://www.quora.com/What-is-the-easiest-way-to-recognise-gestures-in-OpenCV-using-Python.
      12. Frontiers, “Action Recognition Image Example,” Available: https://www.frontiersin.org/.../frobt-02-00028-g008.jpg.
      13. Semantic Scholar, “Motion Influence Map for Unusual Human Activity and Crowd Behavior Analysis,” Available: Link.
      14. HVSS Events, “Intro Concealed Carry,” Available: https://hvss.eventsmart.com/events/intro-concealed-carry/hqdefault/.
      15. YouTube, “Gesture Recognition Example,” Available: https://www.youtube.com/watch?v=l1Ui4U-1dNw.
      16. A. Ghasemi and C. N. R. Kumar, “A Novel Algorithm to Predict and Detect Suspicious Behaviors of People at Public Areas for Surveillance Cameras,” in Proc. Int. Conf. on Intelligent Sustainable Systems (ICISS), IEEE, 2017. [Online]. Available: IEEE Xplore, ISBN: 978-1-5386-1959-9.

Citation

Prakash Chandr Nandanwar, Ankur Taneja, "A Review on Unusual Activity Detection for Prior Appraisal against Crime" International Journal of Scientific Research in Technology & Management, Vol.4, Issue.3, pp.05-08, 2024.

A Review on Automatic Lung Lesion Detection from Various Imaging Techniques

Ayush Bhargava, Madhu Shandilya, Vijayshri Chaurasia

Research Paper | Journal Paper

Vol.4, Issue.3, pp.09-14, Sep-2024

Abstract

Lung cancer is horrible disease that may takes human life. It is the most diagnosed cancer in the world that may considered as life threatening disease. If it can be diagnosed earlier then treatment is a solution that may saves human life. There are various imaging techniques through which this disease can be diagnosed and treated accordingly. But Computed Tomographical image commonly known as CT scan image is the better option for diagnosing disease with better level of accuracy as compare to the other imaging techniques such as X-Ray, Ultrasound and many more. If a disease can be diagnosed automatically then it saves medical professional time as well as human life. A routine checkup can become easier and it can be processed in less time. The intension of this paper is to review various previously implemented systems related to the automatic diagnosis of lung cancer. There are so many researches have been done that are based on CNN, DNN, Edge Detection and various image processing or machine learning techniques. The objective of this paper is to define the limitations and drawbacks of various systems that are lacking somewhere.

Key-Words / Index Term: Lung Cancer Detection, Lesion Classification, Imaging Techniques, CNN, DNN, X-Ray, CT Scan.

References

    1. A. M. Gindi, T. A. Al Attiatalla, and M. M. Sami, “A Comparative Study for Comparing Two Feature Extraction Methods and Two Classifiers in Classification of Earlystage Lung Cancer Diagnosis of chest x-ray images,” Journal of American Science, vol. 10, no. 6, pp. 13–22, 2014.
    2. K. Suzuki, M. Kusumoto, S. I. Watanabe, R. Tsuchiya, and H. Asamura, “Radiologic classification of small adenocarcinoma of the lung: radiologic-pathologic correlation and its prognostic impact,” The Annals of Thoracic Surgery, vol. 81, no. 2, pp. 413–419, 2006.
    3. C. C. Su et al., “Impact of low-dose computed tomography screening for primary lung cancer on subsequent risk of brain metastasis,” J. Thorac. Oncol., 2021, doi: 10.1016/j.jtho.2021.05.010.
    4. P. B. Sangamithraa and S. Govindaraju, “Lung tumour detection and classification using EK-Mean clustering,” in Proc. Int. Conf. Wireless Communications, Signal Processing and Networking (WiSPNET), pp. 2201–2206, 2016, doi: 10.1109/WiSPNET.2016.7566533.
    5. S. Uzelaltinbulat and B. Ugur, “Lung tumor segmentation algorithm,” Procedia Comput. Sci., vol. 120, pp. 140–147, 2017.
    6. G. Niranjana and M. Ponnavaikko, “A Review on Image Processing Methods in Detecting Lung Cancer Using CT Images,” in Proc. Int. Conf. Technical Advancements in Computers and Communications (ICTACC), pp. 18–25, 2017, doi: 10.1109/ICTACC.2017.16.
    7. Q. Wu and W. Zhao, “Small-Cell Lung Cancer Detection Using a Supervised Machine Learning Algorithm,” in Proc. Int. Symp. Computer Science and Intelligent Controls (ISCSIC), pp. 88–91, 2017, doi: 10.1109/ISCSIC.2017.22.
    8. S. Makaju, P. W. C. Prasad, A. Alsadoon, A. K. Singh, and A. Elchouemi, “Lung Cancer Detection using CT Scan Images,” Procedia Comput. Sci., vol. 125, pp. 107–114, 2018.
    9. S. Pang, Y. Zhang, M. Ding, X. Wang, and X. Xie, “A Deep Model for Lung Cancer Type Identification by Densely Connected Convolutional Networks and Adaptive Boosting,” IEEE Access, vol. 8, pp. 4799–4805, 2020.
    10. A. Agarwal, K. Patni, and R. Devarajan, “Lung Cancer Detection and Classification Based on Alexnet CNN,” in Proc. Int. Conf. Communication and Electronics Systems (ICCES), 2021, doi: 10.1109/ICCES51350.2021.9489033.
    11. Z. Li and L. Li, “A novel method for lung masses detection and location based on deep learning,” in Proc. IEEE Int. Conf. Bioinformatics and Biomedicine (BIBM), pp. 963–969, 2017.
    12. H. Shi, Z. Peng, and H. Wan, “Pulmonary Nodules Detection Based on CNN Multi-scale Feature Fusion,” in Proc. IEEE Int. Conf. Advanced Infocomm Technology (ICAIT), pp. 86–90, 2019.
    13. M. Vas and A. Dessai, “Lung cancer detection system using lung CT image processing,” in Proc. Int. Conf. Computing, Communication, Control and Automation (ICCUBEA), pp. 1–5, 2017, doi: 10.1109/ICCUBEA.2017.8463851.
    14. W. Abdul, “An Automatic Lung Cancer Detection and Classification (ALCDC) System Using Convolutional Neural Network,” in Proc. Int. Conf. Developments in eSystems Engineering (DeSE), pp. 443–446, 2020, doi: 10.1109/DeSE51703.2020.9450778.
    15. D. Sharma and G. Jindal, “Computer aided diagnosis system for detection of lung cancer in CT scan images,” Int. J. Comput. Elect. Eng., vol. 3, no. 5, pp. 714–718, Sep. 2011.
    16. T. Yoshiya, T. Mimae, and Y. Tsutani, “Prognostic role of subtype classification in small-sized pathologic N0 invasive lung adenocarcinoma,” Ann. Thoracic Surg., vol. 102, no. 5, pp. 1668–1673, 2016.
    17. R. K. Samala, H.-P. Chan, C. Richter, L. Hadjiiski, C. Zhou, and J. Wei, “Analysis of deep convolutional features for detection of lung nodules in computed tomography,” in Proc. SPIE Medical Imaging: Computer-Aided Diagnosis, vol. 10950, p. 109500Q, 2019.
    18. D. Kumar, A. Wong, and D. A. Clausi, “Lung nodule classification using deep features in CT images,” in Proc. Conf. Computer and Robot Vision, pp. 133–138, 2015.
    19. M. Tan, R. Deklerck, B. Jansen, M. Bister, and J. Cornelis, “A novel computer-aided lung nodule detection system for CT images,” Med. Phys., vol. 38, no. 10, pp. 5630–5645, 2011.
    20. A. A. A. Setio et al., “Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge,” Med. Image Anal., vol. 42, pp. 1–13, 2017.

Citation

MAyush Bhargava, Madhu Shandilya, Vijayshri Chaurasia, "A Review on Automatic Lung Lesion Detection from Various Imaging Techniques" International Journal of Scientific Research in Technology & Management, Vol.4, Issue.3, pp.09-14, 2024.

Automatic Diabetic Retinopathy Disease Diagnosis from Fundus Imaging: A Review

Mohini Rathore, Madhu Shandilya

Research Paper | Journal Paper

Vol.4, Issue.3, pp.15-21, Sep-2024

Abstract

In recent decades, there are many cases can be observed which have been affected by Diabetes. This is the most common disease that can be found in people. If a person having this disease from a long time then that person may also suffer from Diabetic Retinopathy; due to that a person may lost his vision partially or completely as per the condition of the retina or how much tissues have been damaged. Diabetic Retinopathy is a disease that cannot be cured and there is no treatment to repair the retina or vision optics. It only can be prevented by taking care of it with routine checkup from medical professionals. It blocks the blood flow towards retina, due to that; blood vessels get swell and exudates started leaking that may cause partial or complete blindness. The intension of this paper is to review various researches which have been done in the field of diabetic retinopathy. Diabetic retinopathy can be automatically diagnosed through fundus imaging and there are many approaches have been made for pertaining better level of accuracy with minimal error rate. System compares machine learning approaches, classifiers and edge detection techniques that have been used for implementing Automatic Diabetic Retinopathy Disease Diagnosis.

Key-Words / Index Term: Automatic Diabetic Retinopathy Diagnosis, Fundus Imaging, Optic Disc, Optic Cup, CNN, Retinal Image, Hemorrhages.

References

    1. National Eye Institute, ‘Diabetic retinopathy’, [Accessed: 26-March-2022], [Online]. Available: https://www.nei.nih.gov/...
    2. Grace, Annie & Mohideen, S.. (2014). An Economic System for Screening of Diabetic Retinopathy Using Fundus Images. OnLine Journal of Biological Sciences. 14. 254-260. 10.3844/ojbsci.2014.254.260.
    3. EyeRis Vision, ‘Diabetic retinopathy’, [Accessed: 26-March-2022], [Online]. Available: http://www.eyerisvision.com/...
    4. Elia J. Duh, Jennifer K. Sun, Alan W. Stitt, Diabetic retinopathy: current understanding, mechanisms, and treatment strategies, JCI Insight. 2017;2(14):e93751. https://doi.org/10.1172/...
    5. S. Ravishankar, A. Jain and A. Mittal, "Automated feature extraction for early detection of diabetic retinopathy in fundus images," 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, 2009, pp. 210-217.
    6. K. K. Palavalasa and B. Sambaturu, "Automatic Diabetic Retinopathy Detection Using Digital Image Processing," 2018 ICCSP, Chennai, 2018, pp. 0072-0076, doi: 10.1109/ICCSP.2018.8524234.
    7. A. Sopharak, Bunyarit Uyyanonvara and Sarah Barman, “Automatic exudate detection from nondilated diabetic retinopathy retinal images using fuzzy c-means clustering.” Sensors 2009, vol. 9, no. 3, pp. 2148-2161.
    8. T. Walter, J.C. Klein, P. Massin and A. Erginay, “Detection of exudates in color fundus images of the human retina,” IEEE Transactions on Medical Imaging, Vol. 21, Issue 10, 2002.
    9. A. Sopharak, Bunyarit Uyyanonvara, Sarah Barman and Thomas H.Williamson, “Automatic detection of diabetic retinopathy exudates using mathematical morphology methods,” Computerized Medical Imaging and Graphics, 2008, pp. 720–727.
    10. D. Welfer, Jacob Scharcanski and Diane Ruschel Marinho, “A coarse-to-fine strategy for automatically detecting exudates in color eye fundus images,” Computerized Medical Imaging and Graphics, 2010, pp. 228–235.
    11. A. Osareh, B. Shadgar, and R. Markham, “A computational-intelligence-based approach for detection of exudates in diabetic retinopathy images,” IEEE Trans. Inf. Technol. Biomed., vol. 13, no. 4, pp. 535–545, 2009.
    12. Gardner, G & Keating, David & Williamson, Tom & Elliott, A. (1996). Automatic detection of diabetic retinopathy using an artificial neural network: A screening tool. The British Journal of Ophthalmology. 80. 940-4. 10.1136/bjo.80.11.940.
    13. Anupriyaa Mukherjee et al., Int. Journal of Engineering Research and Applications, Vol. 5, Issue 2, Part-4, February 2015, pp. 21-24.
    14. Muhammad Waseem Khan, “Diabetic Retinopathy Detection using Image Processing: A Survey”, International Journal Of Emerging Technology & Research, Volume 1, Issue 1, Nov-Dec, 2013.
    15. M. M. Dharmana and A. M.S., "Pre-diagnosis of Diabetic Retinopathy using Blob Detection," 2020 ICIRCA, pp. 98-101, doi: 10.1109/ICIRCA48905.2020.9183241.
    16. M. Arora and M. Pandey, "Deep Neural Network for Diabetic Retinopathy Detection," 2019 COMITCon, pp. 189-193, doi: 10.1109/COMITCon.2019.8862217.
    17. Y. S. Boral and S. S. Thorat, "Classification of Diabetic Retinopathy based on Hybrid Neural Network," 2021 ICCMC, pp. 1354-1358, doi: 10.1109/ICCMC51019.2021.9418224.
    18. Kanimozhi, J., Vasuki, P. & Roomi, S.M.M. Fundus image lesion detection algorithm for diabetic retinopathy screening. J Ambient Intell Human Comput 12, 7407–7416 (2021). https://doi.org/...
    19. Salman, Ahmad & Siddiqui, Shoaib & Shafait, Faisal & Mian, Ajmal & Shortis, Mark & Khurshid, Khawar & Ulges, Adrian & Schwanecke, Ulrich. (2019). Automatic fish detection in underwater videos by a deep neural network-based hybrid motion learning system. ICES Journal of Marine Science. 77. 10.1093/icesjms/fsz025.
    20. P. Kokare, "Wavelet based automatic exudates detection in diabetic retinopathy," 2017 WiSPNET, pp. 1022-1025, doi: 10.1109/WiSPNET.2017.8299917.
    21. N. Karami and H. Rabbani, "A dictionary learning based method for detection of diabetic retinopathy in color fundus images," 2017 MVIP, pp. 119-122, doi: 10.1109/IranianMVIP.2017.8342333.
    22. Dailyhunt, ‘Diabetic retinopathy can cause vision loss’, [Accessed: 26-March-2022], [Online]. Available: https://m.dailyhunt.in/...
    23. Sisodia D. S, Nair S, Khobragade P. Diabetic Retinal Fundus Images: Preprocessing and Feature Extraction for Early Detection of Diabetic Retinopathy. Biomed Pharmacol J 2017.
    24. Klein R, Klein BE, Moss SE, Davis MD and DeMets DL, “The Wisconsin epidemiologic study of diabetic retinopathy. II Prevalence and risk of diabetic retinopathy when age at diagnosis is less than 30 years,” Arch Ophthalmology 1984, vol. 102, pp. 527–532.
    25. B. Harangi, I. Lazar and A. Hajdu, “Automatic Exudate Detection Using Active Contour Model and Region wise Classification,” IEEE EMBS 2012, pp. 5951–5954.
    26. Balazs Harangi, Balint Antal and Andras Hajdu, “Automatic Exudate Detection with Improved Naive-Bayes Classifier,” CBMS 2012, pp. 1–4.
    27. K. Zuiderveld, “Contrast Limited Adaptive Histogram Equalization,” Graphics Gems IV, Academic Press 1994, pp. 474–485.
    28. M. N. Langroudi and Hamed Sadjedi, “A New Method for Automatic Detection and Diagnosis of Retinopathy Diseases in Colour Fundus Images Based on Morphology,” International Conference on Bioinformatics and Biomedical Technology 2010, pp. 134–138.
    29. S. Chaudhauri, S. Chatterjee, N. Katz, M. Nelson and M. Goldbaum, "Detection of blood vessels in retinal images using two dimensional matched filters," IEEE Trans. Medical Imaging, vol. 8.
    30. X. Jiang and D. Mojon, “Adaptive local thresholding by verification-based multithreshold probing with application to vessel detection in retinal images,” IEEE TPAMI, vol. 25, no. 1, pp. 131–137, Jan. 2003.
    31. J. Staal, M. D. Abràmoff, M. Niemeijer, M. A. Viergever, and B. v. Ginneken, “Ridge based vessel segmentation in color images of the retina,” IEEE Trans. Med. Imag., vol. 23, no. 4, pp. 501–509, Apr. 2004.
    32. S. K. Kuri, "Automatic diabetic retinopathy detection using Gabor filter with local entropy thresholding," 2015 ReTIS, pp. 411-415, doi: 10.1109/ReTIS.2015.7232914.

Citation

Mohini Rathore, Madhu Shandilya, "Automatic Diabetic Retinopathy Disease Diagnosis from Fundus Imaging: A Review" International Journal of Scientific Research in Technology & Management, Vol.4, Issue.3, pp.15-21, 2024.