; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1024 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1024
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationMC03:16694994..16702081
RNA-Seq ExpressionMC03g1024
SyntenyMC03g1024
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]6.64e-26886.6Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA
        L+ T L  + +T   H  +P  NQT FHP KEL+KLKHIRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPPERPR N S E VA
Subjt:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA

Query:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN
        ++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGN
Subjt:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN

Query:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA
        DLNTIEAGWQ S PELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY GKQFDIGLMVWKDPKHGHWWLEYGSG+LVGYWPA
Subjt:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA

Query:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV
        FLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFYYGGPGR V
Subjt:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV

Query:  KCP
        +CP
Subjt:  KCP

XP_022137033.1 uncharacterized protein LOC111008593 [Momordica charantia]9.43e-29298.7Show/hide
Query:  IPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
        +P  NQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
Subjt:  IPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP

Query:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD
        IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQ S PELYGD
Subjt:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD

Query:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
        SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
Subjt:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI

Query:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
Subjt:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_022923706.1 uncharacterized protein LOC111431334 [Cucurbita moschata]8.51e-26986.82Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE
        L+ T    +  T   H   P +   FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPRGNNS E VAE
Subjt:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE

Query:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND
        SFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQIWVISGSFGND
Subjt:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND

Query:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF
        LNTIEAGWQ S PELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GKQFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAF
Subjt:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF

Query:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK
        LFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+ WGTYFYYGGPGRNVK
Subjt:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK

Query:  CP
        CP
Subjt:  CP

XP_023000690.1 uncharacterized protein LOC111495054 [Cucurbita maxima]9.92e-26890.36Show/hide
Query:  PFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
        P  NQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPRGNNS E VAE+FQLWS SG+FCPEGTIP
Subjt:  PFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP

Query:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD
        IRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQIWVISGSFGNDLNTIEAGWQ S PELYGD
Subjt:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD

Query:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
        +NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGKQFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
Subjt:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI

Query:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        VNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+VWGTYFYYGGPGRNVKCP
Subjt:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]4.59e-27593.51Show/hide
Query:  IPFSNQTF-HPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTI
        IP  NQTF HPAKEL+KLKHIR YLRKINKP  KTIRSSDGDVIDCV+SHLQPAFDHPELKGHTPLEPPERPRGNNS E VAE+FQLWS SG+FCPEGTI
Subjt:  IPFSNQTF-HPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTI

Query:  PIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYG
        PIRRT E DI RASS RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQ S PELYG
Subjt:  PIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYG

Query:  DSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGE
        D+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY+GKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGE
Subjt:  DSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGE

Query:  IVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        IVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ NNNVWGTYFYYGGPGRNVKCP
Subjt:  IVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A6J1C958 uncharacterized protein LOC1110085934.56e-29298.7Show/hide
Query:  IPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
        +P  NQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
Subjt:  IPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP

Query:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD
        IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQ S PELYGD
Subjt:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD

Query:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
        SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
Subjt:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI

Query:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
Subjt:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

A0A6J1E7H5 uncharacterized protein LOC1114313344.12e-26986.82Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE
        L+ T    +  T   H   P +   FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPRGNNS E VAE
Subjt:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE

Query:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND
        SFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQIWVISGSFGND
Subjt:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND

Query:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF
        LNTIEAGWQ S PELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GKQFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAF
Subjt:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF

Query:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK
        LFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+ WGTYFYYGGPGRNVK
Subjt:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK

Query:  CP
        CP
Subjt:  CP

A0A6J1GRA4 uncharacterized protein LOC1114568105.32e-26786.35Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA
        L+ T L  + +T   H  +P  NQT FHP KEL+KLKHIRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPPERPR N S E VA
Subjt:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA

Query:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN
        ++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGN
Subjt:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN

Query:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA
        DLNTIEAGWQ S PELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTN++IAIGAAISP+SSY GKQFDIGLMVWKDPKHGHWWLEYGSG LVGYWPA
Subjt:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA

Query:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV
        FLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFYYGGPGR V
Subjt:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV

Query:  KCP
        +CP
Subjt:  KCP

A0A6J1JZN8 uncharacterized protein LOC1114902773.61e-26786.35Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA
        L+ T L  + +T   H  +P  NQT FHP KEL+KLK+IRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPPERPR N S E VA
Subjt:  LLQTLLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVA

Query:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN
        ++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGN
Subjt:  ESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN

Query:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA
        DLNTIEAGWQ S PELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY GKQFDIGLMVWKDPKHGHWWLEYGSG+LVGYWPA
Subjt:  DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPA

Query:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV
        FLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFYYGGPGR V
Subjt:  FLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNV

Query:  KCP
        +CP
Subjt:  KCP

A0A6J1KJ26 uncharacterized protein LOC1114950544.80e-26890.36Show/hide
Query:  PFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP
        P  NQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPRGNNS E VAE+FQLWS SG+FCPEGTIP
Subjt:  PFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIP

Query:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD
        IRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQIWVISGSFGNDLNTIEAGWQ S PELYGD
Subjt:  IRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQASSPELYGD

Query:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
        +NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGKQFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI
Subjt:  SNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEI

Query:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        VNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+VWGTYFYYGGPGRNVKCP
Subjt:  VNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23340.1 Protein of Unknown Function (DUF239)2.9e-17470.66Show/hide
Query:  SSHHHLLQTLLQRLGATITNHSPIPFSNQT--FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNN
        SS   L  T +  L    +  SP   +++T    P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P G +
Subjt:  SSHHHLLQTLLQRLGATITNHSPIPFSNQT--FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNN

Query:  STEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI
              E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQIW+I
Subjt:  STEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI

Query:  SGSFGNDLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL
        +GSF  DLNTIEAGWQ  SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  SGSFGNDLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG
        VGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VWG +FYYG
Subjt:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG

Query:  GPGRNVKCP
        GPG+N KCP
Subjt:  GPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)2.9e-17470.66Show/hide
Query:  SSHHHLLQTLLQRLGATITNHSPIPFSNQT--FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNN
        SS   L  T +  L    +  SP   +++T    P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P G +
Subjt:  SSHHHLLQTLLQRLGATITNHSPIPFSNQT--FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNN

Query:  STEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI
              E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQIW+I
Subjt:  STEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI

Query:  SGSFGNDLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL
        +GSF  DLNTIEAGWQ  SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  SGSFGNDLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG
        VGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VWG +FYYG
Subjt:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG

Query:  GPGRNVKCP
        GPG+N KCP
Subjt:  GPGRNVKCP

AT1G70550.1 Protein of Unknown Function (DUF239)4.5e-17571.64Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE
        LL  L+    ++ T+ S    ++QT  P +ELQKL  IR  L KINKP+ KTI+SSDGD IDCV +H QPAFDHP L+G  PL+PPE P+G +  +G  E
Subjt:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE

Query:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND
        + QLWS SGE CPEGTIPIRRT E D+LRASSV+RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWVI+GSF +D
Subjt:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND

Query:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF
        LNTIEAGWQ  SPELYGD+ PRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG LVGYWPAF
Subjt:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF

Query:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK
        LF+HL+ HGSMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G N VWG YFYYGGPG+N +
Subjt:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK

Query:  CP
        CP
Subjt:  CP

AT1G70550.2 Protein of Unknown Function (DUF239)4.5e-17571.64Show/hide
Query:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE
        LL  L+    ++ T+ S    ++QT  P +ELQKL  IR  L KINKP+ KTI+SSDGD IDCV +H QPAFDHP L+G  PL+PPE P+G +  +G  E
Subjt:  LLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAE

Query:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND
        + QLWS SGE CPEGTIPIRRT E D+LRASSV+RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWVI+GSF +D
Subjt:  SFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND

Query:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF
        LNTIEAGWQ  SPELYGD+ PRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG LVGYWPAF
Subjt:  LNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAF

Query:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK
        LF+HL+ HGSMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G N VWG YFYYGGPG+N +
Subjt:  LFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVK

Query:  CP
        CP
Subjt:  CP

AT5G50150.1 Protein of Unknown Function (DUF239)5.1e-18777.69Show/hide
Query:  LLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQ
        LL  L   +   S I   NQT F P +E+QKL+ + AYL KINKPS KTI S DGDVI+CV SHLQPAFDHP+L+G  PL+ P RP   N T       Q
Subjt:  LLQRLGATITNHSPIPFSNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQ

Query:  LWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNT
        LWS SGE CP G+IPIR+T + D+LRA+SVRRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPRVTD YEFSLSQIW+ISGSFG+DLNT
Subjt:  LWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNT

Query:  IEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFS
        IEAGWQ  SPELYGD+ PRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISP SSYNG+QFDIGLM+WKDPKHGHWWLE G+GLLVGYWPAFLFS
Subjt:  IEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFS

Query:  HLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        HLRSH SMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDIRQG NNVWGTYFYYGGPGRN +CP
Subjt:  HLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGATAGCATCATGTGGTCTCCACATTCAGATGGATCATCAGAGCTCCCATCACCATCTTCTTCAAACACTTCTTCAAAGGCTTGGTGCCACCATTACCAATCA
CAGTCCAATACCATTCTCTAACCAAACTTTCCACCCGGCCAAAGAGCTGCAGAAACTAAAGCACATCAGAGCTTATTTACGCAAAATCAACAAGCCTTCAACCAAGACAA
TTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATCTCCAGCCTGCTTTTGACCATCCTGAACTCAAAGGACACACCCCATTGGAGCCGCCGGAGAGGCCA
AGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTATGGTCAGATTCTGGCGAATTCTGCCCGGAGGGAACTATTCCGATAAGAAGAACCAGAGAGACCGA
CATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCATTAGACGTGTGAGGAGAGATTCATCAGGCAATGGCCACGAGCATGCCGTGGTGTTTGTAAATGGAG
AACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCACGTGTAACCGACCAATACGAATTCAGCTTATCTCAAATATGGGTCATTTCAGGCTCCTTTGGCAAT
GATTTAAACACCATTGAAGCTGGATGGCAGGCAAGTAGCCCTGAGCTGTATGGCGACAGCAATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGG
CTGTTACAATCTACTTTGCTCTGGCTTTGTTCAAACTAACAACAAGATCGCCATTGGAGCAGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTT
TAATGGTTTGGAAGGACCCGAAGCACGGGCACTGGTGGTTGGAATACGGGTCAGGTCTGCTAGTCGGATACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGG
AGCATGGTGCAGTTTGGAGGGGAGATAGTGAACAGCAGATCATCAGGGTTCCACACAGCCACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTC
ATATTTCAGGAACCTCCAAGTGGTTGATTGGGACAATAATTTGCTTCCTCTAACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACA
ATAATGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGGAGGAATGTAAAATGCCCATGA
mRNA sequenceShow/hide mRNA sequence
CAGGAGGATGGCCAGAAAGATTTCAAATTTACTTGACTTCTCCAATGCATATTTGTTTAAAATCTTCCCATGCTAAATTTTCTAATCATACTTTTGAAAAGTCACAGAAG
TTTTTGCTTTTTCTTTGAAAGAAGGTTGCAAAAAGAGCTGCAGTTTTTGTATGGCAAAGATAGCATCATGTGGTCTCCACATTCAGATGGATCATCAGAGCTCCCATCAC
CATCTTCTTCAAACACTTCTTCAAAGGCTTGGTGCCACCATTACCAATCACAGTCCAATACCATTCTCTAACCAAACTTTCCACCCGGCCAAAGAGCTGCAGAAACTAAA
GCACATCAGAGCTTATTTACGCAAAATCAACAAGCCTTCAACCAAGACAATTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATCTCCAGCCTGCTTTTG
ACCATCCTGAACTCAAAGGACACACCCCATTGGAGCCGCCGGAGAGGCCAAGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTATGGTCAGATTCTGGC
GAATTCTGCCCGGAGGGAACTATTCCGATAAGAAGAACCAGAGAGACCGACATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCATTAGACGTGTGAGGAG
AGATTCATCAGGCAATGGCCACGAGCATGCCGTGGTGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCACGTGTAACCGACCAAT
ACGAATTCAGCTTATCTCAAATATGGGTCATTTCAGGCTCCTTTGGCAATGATTTAAACACCATTGAAGCTGGATGGCAGGCAAGTAGCCCTGAGCTGTATGGCGACAGC
AATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGGCTGTTACAATCTACTTTGCTCTGGCTTTGTTCAAACTAACAACAAGATCGCCATTGGAGC
AGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTTTAATGGTTTGGAAGGACCCGAAGCACGGGCACTGGTGGTTGGAATACGGGTCAGGTCTGC
TAGTCGGATACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGGAGCATGGTGCAGTTTGGAGGGGAGATAGTGAACAGCAGATCATCAGGGTTCCACACAGCC
ACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTCATATTTCAGGAACCTCCAAGTGGTTGATTGGGACAATAATTTGCTTCCTCTAACAAATCT
TCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACAATAATGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGGAGGAATGTAAAATGCCCAT
GAAAACAAAAATTAAGTTGTAATAATATTGAAAAAAGCTTGTAGGATAGAATGTAATGTATAATTTTTTTTTTTCCTTTCTCCTGGTTTCACTTTTATGGTTTGTTTTGT
TTTTTTTTTTTTCATAATGTTAATCATAGAGGATATGGAATTTTGTAACTATTGGAGTGGAAATACAAAGAATTAGAAGGTTGTGGTCTCTTCCCT
Protein sequenceShow/hide protein sequence
MAKIASCGLHIQMDHQSSHHHLLQTLLQRLGATITNHSPIPFSNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP
RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN
DLNTIEAGWQASSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHG
SMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP