; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020952 (gene) of Snake gourd v1 genome

Gene IDTan0020952
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF620)
Genome locationLG11:1919926..1922546
RNA-Seq ExpressionTan0020952
SyntenyTan0020952
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]1.1e-23493.08Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNKAH ISWQAMK+WVK NHHDKSSH NSIASLFGGRNAEIQLLLGVVGAPLIPLPI +  HQPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGE CLNGKA KVKNGKGGGG  GGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTID VNIAHAGKTTVSLFRFGESAEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG  
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRC---AAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL
        TSNGK PL MRC   AAAAGSKICSSRVAAIDVDESEGSNQSD+DE++
Subjt:  TSNGKFPLAMRC---AAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]1.2e-22891.7Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNK H ISWQAMKSWVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+   DH+PITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG LNGK +K KNGK    GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSC+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEEIWEIEEVDFNI+GLSMDFFLPPSDLKKEEEGV VI
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDE-SEG-SNQSDDDED
        TSNGKFPL MRC AAA SKICSSRVAAIDVDE SEG SNQSD+DED
Subjt:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDE-SEG-SNQSDDDED

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]2.6e-23694.17Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNKAH ISWQAMK+WVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI +  HQPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGE CLNGKA KVKNGKGGGG  GGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG  
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRC--AAAAGSKICSSRVAAIDVDESEGSNQSDDDED
        TSNGK PL MRC  AAAAGSKICSSRVAAIDVDESEGSNQSD+DE+
Subjt:  TSNGKFPLAMRC--AAAAGSKICSSRVAAIDVDESEGSNQSDDDED

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]2.8e-23092.36Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNKAH ISWQAMK+WVKSN H KSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI +  HQPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGE CLNGKA KVKNGK    GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG  
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL
        TSNGK PL MRC AA+GSKIC SRVAAIDVDESEGSNQSD+DE++
Subjt:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]4.2e-22690.4Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSN-HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNI
        MRKLCPNFDRE GLDTVLEVPIPEEMFSS T K H ISWQAMKSWVKSN HHDKSSHV SI+SLFGGRNAEIQLLLGVVGAPLIPLPI+  D QPITRNI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSN-HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKIS
        KDNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKM ASEF SGEGCLNGKAVK     G GGGGGGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIK  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKEEEGVG+
Subjt:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLAMRCAAAAGSKICSSRVAAI---DVDESEGSNQSDDDED
        ITSNGKFP+ MRC  +AGS++ SSRV AI   D DESE SNQSD+DED
Subjt:  ITSNGKFPLAMRCAAAAGSKICSSRVAAI---DVDESEGSNQSDDDED

TrEMBL top hitse value%identityAlignment
A0A0A0KBJ0 Uncharacterized protein4.9e-21286.53Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITR-NI
        MRKLCPNFDRE GLDTVLEVPIPEEMFSSNT K H ISWQAMKSWVKSN  DKSSH  SI SLFGGRNAEIQLLLGVVGAPLIPLPI+    QPI R NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKN----GKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSG
        KDNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKM ASEF SGEG     AVK KN    G GGGGGGGGGEMGGFVVWQKRPELWCLELML G
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKN----GKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEE
        LVQLEDSHLLRIK  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKEEE
Subjt:  LVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEE

Query:  GVGVI-TSNGKFPLAMRCAAAAGSKICSSRVAAIDV---DESEGSNQSDDDED
        GVG+I TS GKFPL MRC+    S+  SSRVAAID    +ESEGSN+SD++++
Subjt:  GVGVI-TSNGKFPLAMRCAAAAGSKICSSRVAAIDV---DESEGSNQSDDDED

A0A6J1BVR5 uncharacterized protein LOC1110061915.8e-22991.7Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNK H ISWQAMKSWVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+   DH+PITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG LNGK +K KNGK    GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSC+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEEIWEIEEVDFNI+GLSMDFFLPPSDLKKEEEGV VI
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDE-SEG-SNQSDDDED
        TSNGKFPL MRC AAA SKICSSRVAAIDVDE SEG SNQSD+DED
Subjt:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDE-SEG-SNQSDDDED

A0A6J1FGT4 uncharacterized protein LOC1114452683.1e-21484.94Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS  T K+H ISWQAMKSWVKS++++K SH+ SIASLFGGRNAEIQLLLGVVGAPLIPLPI+   HQ ITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEF+SGEGC NGK +K KNGK   GG   GEMG FV+WQKRP+LWCLE+MLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKT+N EDCFILKLEAESSVLRARSSS VEIIRHTVWGYFSQRTGLLV LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIK GGSRNDN+FWETTME+ IQDYRTIDGVNIAHAGKTTVSL RFG+ AEGHSKTKMEEIW+IEEVDFNI+GLSM+FFLPPSDLKKEEE +G I
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAM-RCAAAAGSKICSSRVAAIDVDESEGSNQSDDDED
         S+ KFPLAM R +A AGS+I  SRVAA+D DESEGS++SD+D+D
Subjt:  TSNGKFPLAM-RCAAAAGSKICSSRVAAIDVDESEGSNQSDDDED

A0A6J1GZA9 uncharacterized protein LOC1114588771.3e-23694.17Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNKAH ISWQAMK+WVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI +  HQPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGE CLNGKA KVKNGKGGGG  GGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG  
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRC--AAAAGSKICSSRVAAIDVDESEGSNQSDDDED
        TSNGK PL MRC  AAAAGSKICSSRVAAIDVDESEGSNQSD+DE+
Subjt:  TSNGKFPLAMRC--AAAAGSKICSSRVAAIDVDESEGSNQSDDDED

A0A6J1JSU8 uncharacterized protein LOC1114875311.4e-23092.36Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPNFDREDGLDTVLEVPIPEEMFS NTNKAH ISWQAMK+WVKSN H KSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI +  HQPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGE CLNGKA KVKNGK    GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI
        DSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG  
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVI

Query:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL
        TSNGK PL MRC AA+GSKIC SRVAAIDVDESEGSNQSD+DE++
Subjt:  TSNGKFPLAMRCAAAAGSKICSSRVAAIDVDESEGSNQSDDDEDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)1.1e-11051.6Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSN------------HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIS
        MRKLCPN DREDGL+TVLEVP+PEEMF+   + A    W+ M + +K++                SS  N    L    + E   LL +VG+PLIP  + 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSN------------HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIS

Query:  LKDHQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWC
        L+    ++R I D  IEAS AKYIVQQYVAA GG  ALN++ SMYA+G+V+M  SE  +GE    G  V++        G G  E+GGFV+WQK P LW 
Subjt:  LKDHQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWC

Query:  LELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGY
        LEL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGY
Subjt:  LELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGY

Query:  FSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPS
        FSQRTGLLV+  D+ L+R+K+G  +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G +   + + ++EE W IEEVDFNI GL ++ FLPPS
Subjt:  FSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPS

Query:  DLKKE
        D+  +
Subjt:  DLKKE

AT3G19540.1 Protein of unknown function (DUF620)2.7e-9846.48Show/hide
Query:  REDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIKDNPIEASMA
        R   L  V+E P P+E                +  WVK     + S   S+A+    R  +++LLLGV+GAPL P+ +S  D  P   +IK+ PIE S A
Subjt:  REDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIKDNPIEASMA

Query:  KYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWR
        +YI+QQY AA GG+   NSI + YAMGK+KM+ SE  +       + V+ +N           E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR
Subjt:  KYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWR

Query:  QTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKA
         TPW  SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++
Subjt:  QTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKA

Query:  GGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVITSNGKFP
         G   + +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H++TKMEE W IEEV FN+ GLS+D F+PP+DLK      G +T + ++P
Subjt:  GGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVITSNGKFP

AT3G55720.1 Protein of unknown function (DUF620)3.7e-13558.33Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIS---LKDHQPITR
        MR LCPNFDREDGL+TVLEVP+PEE+F S+ NK+   +W+++KS +  +  D SS   S+A+LFGGR+++IQ+LLG+VGAP IPLPIS    K   PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIS---LKDHQPITR

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVK--VKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSG
         IK+  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +  LNGK  K  V+        G GGEMGGFV+W+K    W LEL++SG
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVK--VKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        CK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + S L++RS S +E ++HTVWG F QRTG
Subjt:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDL-KKE
        LLVQLED++L+RIK G    D + WETT ETLIQDY++IDG+ IAH GKT VSL R  ES E HSKT MEE WEIEEV FN++GLS DFFLPP DL  KE
Subjt:  LLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDL-KKE

Query:  EEGVGVITSNGKFPLAMRCAAAAGS-KICSSRVAAI-DVDESEG
        EE  G    +   P+ +    +  S KI SS+V AI D  E EG
Subjt:  EEGVGVITSNGKFPLAMRCAAAAGS-KICSSRVAAI-DVDESEG

AT5G05840.1 Protein of unknown function (DUF620)6.6e-16165.27Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKS-WVK-SNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDH----QP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+++  K     W  MKS W K +     ++   ++  LFGGRNAEIQLLLGVVGAPLIPLP+    H     P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKS-WVK-SNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDH----QP

Query:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLS
        I ++IKD P+E SMA+YIV+QY+AAVGG+ ALN+++SMYAMGKV+M ASEF +GEG LN K VK ++ K      GGGE+GGFV+WQK  ELWCLEL++S
Subjt:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        GCKISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE S L+ARSSS+VEIIRHTVWG FSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKK--
        LL+QLEDSHLLRIKA    +++IFWETTME+LIQDYRT+DG+ +AHAGK++VSLFRFGE+++ HS+T+MEE WEIEE+DFNI+GLSMD FLPPSDLKK  
Subjt:  LLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKK--

Query:  -EEEGV--GVITSNGKFPLAMRCAAAAGSKICSSRVAAI--DVDESEGSNQS
         EEE +  G+  +N K P+ +R   +A  +I SS+V AI  + DESE + ++
Subjt:  -EEEGV--GVITSNGKFPLAMRCAAAAGSKICSSRVAAI--DVDESEGSNQS

AT5G66740.1 Protein of unknown function (DUF620)7.2e-11552.79Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N    + WQ M +W+K+   DK S       L   R  E++ LL +VG+PLIPL + +     + + +K
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        D  I+AS AKYIVQQY+AA GG  ALN+++SM   G+VKM ASEF  G+       V +K+           EMGGFV+WQK P+LWCLEL++SGCK+  
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GS+G+++WR +    + AS G PRPLRRFLQGLDP+STA LF +++CIGEK IN EDCFILKLE   +V  A+S  + EII HT+WGYFSQR+GLL+Q E
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEE
        DS LLR++     ++++FWET+ E+++ DYR +D VNIAH GKT+V++FR+GE++  H + +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  DSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCCAACTTCGACCGCGAAGACGGTCTCGATACTGTCCTTGAAGTTCCCATCCCCGAGGAGATGTTCTCTTCCAACACCAACAAAGCCCACATGAT
TTCATGGCAAGCCATGAAATCATGGGTCAAATCCAATCACCACGATAAATCATCACATGTTAATTCAATCGCTTCTCTTTTCGGCGGCCGCAACGCTGAGATCCAGCTTC
TCCTCGGCGTCGTTGGAGCTCCCTTAATTCCTCTACCTATCTCTTTGAAAGACCATCAACCCATTACTCGCAACATCAAAGACAATCCCATTGAGGCGTCAATGGCGAAG
TACATAGTGCAACAATATGTGGCCGCCGTGGGAGGGGAACATGCGTTGAATTCAATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTTTCTTC
CGGCGAAGGGTGTTTGAACGGAAAAGCGGTGAAGGTGAAGAATGGGAAAGGCGGCGGCGGTGGCGGAGGAGGAGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAAAGGC
CGGAGTTATGGTGCTTGGAACTGATGTTGTCGGGCTGTAAAATCAGCGCCGGCAGCGACGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCTCATGCTTCTCGT
GGCCCACCTCGTCCCCTTCGACGATTCTTGCAGGGACTCGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCTTCCTGCATCGGCGAGAAAACAATCAACGACGAAGA
TTGCTTCATTCTAAAGCTAGAAGCCGAATCATCAGTTCTCCGAGCAAGAAGCAGTAGCAGCGTCGAAATAATCCGCCACACAGTTTGGGGATATTTCAGCCAAAGAACCG
GCCTCCTCGTGCAGCTCGAAGATTCGCATCTCCTCCGAATCAAAGCCGGCGGATCTCGAAACGACAACATCTTCTGGGAGACGACAATGGAAACCTTAATTCAGGACTAC
AGAACGATCGACGGCGTCAACATTGCACACGCCGGAAAAACAACCGTCTCGCTTTTTCGATTTGGCGAAAGCGCTGAAGGCCATTCGAAAACGAAGATGGAGGAGATTTG
GGAGATCGAAGAAGTTGATTTCAATATCCAGGGTTTATCGATGGACTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAAGAAGGAGTTGGTGTAATTACGAGTAATG
GAAAGTTTCCGTTGGCAATGAGATGTGCGGCGGCGGCTGGTTCGAAGATTTGTTCGTCGAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGGGAGTAATCAGAGTGAT
GATGATGAAGATTTGTGA
mRNA sequenceShow/hide mRNA sequence
CTCCCTCACAATAATCTCTCTCTCTCTCTCTCTCACTGAGCAAGGAAAAAAAAAATTCATTCTTTCTTTGCTTCAATGAGGAAGCTTTGTCCCAACTTCGACCGCGAAGA
CGGTCTCGATACTGTCCTTGAAGTTCCCATCCCCGAGGAGATGTTCTCTTCCAACACCAACAAAGCCCACATGATTTCATGGCAAGCCATGAAATCATGGGTCAAATCCA
ATCACCACGATAAATCATCACATGTTAATTCAATCGCTTCTCTTTTCGGCGGCCGCAACGCTGAGATCCAGCTTCTCCTCGGCGTCGTTGGAGCTCCCTTAATTCCTCTA
CCTATCTCTTTGAAAGACCATCAACCCATTACTCGCAACATCAAAGACAATCCCATTGAGGCGTCAATGGCGAAGTACATAGTGCAACAATATGTGGCCGCCGTGGGAGG
GGAACATGCGTTGAATTCAATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTTTCTTCCGGCGAAGGGTGTTTGAACGGAAAAGCGGTGAAGG
TGAAGAATGGGAAAGGCGGCGGCGGTGGCGGAGGAGGAGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAAAGGCCGGAGTTATGGTGCTTGGAACTGATGTTGTCGGGC
TGTAAAATCAGCGCCGGCAGCGACGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCTCATGCTTCTCGTGGCCCACCTCGTCCCCTTCGACGATTCTTGCAGGG
ACTCGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCTTCCTGCATCGGCGAGAAAACAATCAACGACGAAGATTGCTTCATTCTAAAGCTAGAAGCCGAATCATCAG
TTCTCCGAGCAAGAAGCAGTAGCAGCGTCGAAATAATCCGCCACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCGTGCAGCTCGAAGATTCGCATCTCCTC
CGAATCAAAGCCGGCGGATCTCGAAACGACAACATCTTCTGGGAGACGACAATGGAAACCTTAATTCAGGACTACAGAACGATCGACGGCGTCAACATTGCACACGCCGG
AAAAACAACCGTCTCGCTTTTTCGATTTGGCGAAAGCGCTGAAGGCCATTCGAAAACGAAGATGGAGGAGATTTGGGAGATCGAAGAAGTTGATTTCAATATCCAGGGTT
TATCGATGGACTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAAGAAGGAGTTGGTGTAATTACGAGTAATGGAAAGTTTCCGTTGGCAATGAGATGTGCGGCGGCG
GCTGGTTCGAAGATTTGTTCGTCGAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGGGAGTAATCAGAGTGATGATGATGAAGATTTGTGAAGAAATGTTTTAAGGTT
TTAAACTCAATTTTTGGAAATTTGTACAGAATCCATCCATGGTTAATATATATAAATATATGGAATATGGGACTTTAGGGCTAGTATGTTGTTTTGTTTTTATTTTGGTT
AAATTATAAGTTTAGTTCCTAAAGTTTGATGTTTATTTAGTCTCTAAACTTTAAAAAGTGTCTAATAGATTCTTGAACTTTCA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSSNTNKAHMISWQAMKSWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPISLKDHQPITRNIKDNPIEASMAK
YIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGCLNGKAVKVKNGKGGGGGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASR
GPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDY
RTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLAMRCAAAAGSKICSSRVAAIDVDESEGSNQSD
DDEDL