; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003423 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003423
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationscaffold234:2779816..2782143
RNA-Seq ExpressionMS003423
SyntenyMS003423
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]5.6e-22389.29Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVK-SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVK +H+D  SH NS+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVK-SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTID VNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTIRCTAAA----SKICSSRVAAIDVDEISEGSSNQSDEDED
        SNGK PLT+RC AAA    SKICSSRVAAIDVDE SEG SNQSDEDE+
Subjt:  SNGKFPLTIRCTAAA----SKICSSRVAAIDVDEISEGSSNQSDEDED

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]5.8e-25299.32Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIP+PVPFDHRPITRNIKDN
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
        PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSC+GEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
        IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF

Query:  PLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
        PLT+RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
Subjt:  PLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]1.3e-22490.16Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTIRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDED
        SNGK PLT+RC AAA   SKICSSRVAAIDVDE SEG SNQSDEDE+
Subjt:  SNGKFPLTIRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDED

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]1.3e-22490.45Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS+  HK SHVNS+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK
        RIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   TSNGK
Subjt:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK

Query:  FPLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDED
         PLT+RC A+ SKIC SRVAAIDVDE SEG SNQSDEDE+
Subjt:  FPLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDED

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]3.5e-21787Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS--HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIK
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T KTH ISWQAMKSWVKS  H+D  SHV S+++LFGGRNAEIQLLLGVVGAPLIPLP+ FD +PITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS--HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKA--KNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKM ASEF SGEG LNGK +K    NGK  GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKA--KNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVI
        DSHLLRIK  GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV +I
Subjt:  DSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVI

Query:  TSNGKFPLTIRCTAAASKICSSRVAAID-VDEISEGSSNQSDEDED
        TSNGKFP+T+RC +A S++ SSRV AID  DE     SNQSDEDED
Subjt:  TSNGKFPLTIRCTAAASKICSSRVAAID-VDEISEGSSNQSDEDED

TrEMBL top hitse value%identityAlignment
A0A6J1BVR5 uncharacterized protein LOC1110061912.8e-25299.32Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIP+PVPFDHRPITRNIKDN
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
        PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSC+GEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
        IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF

Query:  PLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
        PLT+RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
Subjt:  PLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE

A0A6J1FGT4 uncharacterized protein LOC1114452687.2e-20883.33Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMKSWVKS   +K SH+ S+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEF+SGEG  NGK +KAKNGKGG   GEMG FV+WQKRP+LWCLE+MLSGCKISAGSDG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKT+N EDCFILKLEAES+VLRARSSS VEIIRHTVWGYFSQRTGLLV LEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG
        LRIK GGSRND++FWETTME+ IQDYRTIDGVNIAHAGKT+VSL RFG+ AEGHS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE +  I S+ 
Subjt:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG

Query:  KFPLTI--RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
        KFPL +  R   A S+I  SRVAA+D DE SEGSS +SDED+D+
Subjt:  KFPLTI--RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE

A0A6J1GZA9 uncharacterized protein LOC1114588776.5e-22590.16Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTIRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDED
        SNGK PLT+RC AAA   SKICSSRVAAIDVDE SEG SNQSDEDE+
Subjt:  SNGKFPLTIRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDED

A0A6J1JRQ6 uncharacterized protein LOC1114891903.6e-20783.97Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMKSWVKS   +K SHV S+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        NPIEASMAKYIVQQY+AAVGGEHALNSI SMYAMGKVKMVASEF+SGEG  NGK +KAKNGK G   GEMG FV+WQKRP+LWCLE+MLSGCKISAGSDG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+CIGEKT+N EDCFILKLEAES+VLRARSSS VEIIRHTVWGYFSQRTGLLV LEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG
        LRIK GGSRND +FWETTME+ I+DYRTIDGVNIAHAGKT+VSL RFG+ AEGHS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE + V+TS+ 
Subjt:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG

Query:  KFPLTI-RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
        KFP T+ R + A S+I SSRVAA+D DE SEGSS +SDEDED+
Subjt:  KFPLTI-RCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE

A0A6J1JSU8 uncharacterized protein LOC1114875316.5e-22590.45Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS+  HK SHVNS+A+LFGGRNAEIQLLLGVVGAPLIPLP+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK
        RIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   TSNGK
Subjt:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK

Query:  FPLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDED
         PLT+RC A+ SKIC SRVAAIDVDE SEG SNQSDEDE+
Subjt:  FPLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)1.9e-11252.25Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSH-------NDHKSHVNSMAA------LFGGRNAEIQLLLGVVGAPLIPLPVP
        MRKLCPN DREDGL+TVLEVP+PEEMF+   +      W+ M + +K+H        D ++  +S +       L    + E   LL +VG+PLIP  VP
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSH-------NDHKSHVNSMAA------LFGGRNAEIQLLLGVVGAPLIPLPVP

Query:  FDHRPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELML
         +   ++R I D  IEAS AKYIVQQYVAA GG  ALN+++SMYA+G+V+M  SE  +GE    G  ++     G G  E+GGFV+WQK P LW LEL++
Subjt:  FDHRPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELML

Query:  SGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRT
        SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYFSQRT
Subjt:  SGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRT

Query:  GLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE
        GLLV+  D+ L+R+K+G  +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G +   + R ++EE W IEEVDFNI GL ++ FLPPSD+  +
Subjt:  GLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE

AT3G19540.1 Protein of unknown function (DUF620)5.1e-9745.43Show/hide
Query:  REDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDNPIEASMAKY
        R   L  V+E P P+E                +  WVK      S   S+AA    R  +++LLLGV+GAPL P+ V         +IK+ PIE S A+Y
Subjt:  REDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDNPIEASMAKY

Query:  IVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHH
        I+QQY AA GG+   NSI++ YAMGK+KM+ SE  +          +    +     E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR TPW  
Subjt:  IVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHH

Query:  SHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRND
        SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ G   +
Subjt:  SHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRND

Query:  SIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTIR---C
        ++FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H+RTKMEE W IEEV FN+ GLS+D F+PP+DLK        +T + ++P   R    
Subjt:  SIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTIR---C

Query:  TAAASKICSSRVAAID
        T A S    ++VAA++
Subjt:  TAAASKICSSRVAAID

AT3G55720.1 Protein of unknown function (DUF620)1.6e-13557.21Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMK-SWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDH----RPITR
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NK+ A  W+++K S ++S  D+ S   S+A LFGGR+++IQ+LLG+VGAP IPLP+  D      PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMK-SWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDH----RPITR

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSG
         IK+  IE++MAKYIV+QY AA GGE AL+++ESMYAMGKVKM  +EF + + +LNGK  K        N   G GGEMGGFV+W+K    W LEL++SG
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTG
        CK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + + L++RS S +E ++HTVWG F QRTG
Subjt:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---K
        LLVQLED++L+RIK G    D + WETT ETLIQDY++IDG+ IAH GKT VSL R  ES E HS+T MEE WEIEEV FN+KGLS DFFLPP DL   +
Subjt:  LLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---K

Query:  KEEEGVAVITSNGKFPLTIRCTAAASKICSSRVAAID
        +EE G +         L ++ +  + KI SS+V AI+
Subjt:  KEEEGVAVITSNGKFPLTIRCTAAASKICSSRVAAID

AT5G05840.1 Protein of unknown function (DUF620)2.2e-16467.87Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKS-WVK--SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDH-----RP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+ +  K     W  MKS W K  +     +   +M  LFGGRNAEIQLLLGVVGAPLIPLPV  DH      P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKS-WVK--SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDH-----RP

Query:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        I ++IKD P+E SMA+YIV+QY+AAVGG+ ALN++ESMYAMGKV+M ASEF +GEGSLN K++KA++ K  GGGE+GGFV+WQK  ELWCLEL++SGCKI
Subjt:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE + L+ARSSS+VEIIRHTVWG FSQRTGLL+Q
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---EEE
        LEDSHLLRIKA    ++SIFWETTME+LIQDYRT+DG+ +AHAGK+SVSLFRFGE+++ HSRT+MEE WEIEE+DFNIKGLSMD FLPPSDLKK   EEE
Subjt:  LEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---EEE

Query:  GV--AVITSNGKFPLTIRCTAAASKICSSRVAAI----DVDEISE
         +   +  +N K P+ IR  +A+ +I SS+V AI    D  E++E
Subjt:  GV--AVITSNGKFPLTIRCTAAASKICSSRVAAI----DVDEISE

AT5G66740.1 Protein of unknown function (DUF620)9.9e-11755.15Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N   A+ WQ M +W+K+    K     +AA    R  E++ LL +VG+PLIPL V   H  + + +KD 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
         I+AS AKYIVQQY+AA GG  ALN++ SM   G+VKM ASEF  G+ S  G  LK+ +       EMGGFV+WQK P+LWCLEL++SGCK+  GS+G++
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        +WR +    + AS G PRPLRRFLQGLDP+STA LF +++CIGEK IN EDCFILKLE    V  A+S  + EII HT+WGYFSQR+GLL+Q EDS LLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        ++     ++ +FWET+ E+++ DYR +D VNIAH GKTSV++FR+GE++  H R +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCGAATTTCGACCGAGAAGACGGTCTCGACACTGTCCTTGAGGTTCCCATTCCTGAAGAGATGTTCTCTTGCAACACTAACAAAACCCACGCGAT
TTCATGGCAAGCTATGAAATCGTGGGTGAAGTCCCATAATGATCATAAATCGCACGTGAATTCCATGGCTGCACTTTTCGGCGGCCGGAACGCCGAGATCCAGCTCCTCC
TCGGCGTCGTCGGAGCTCCTTTAATCCCTCTCCCTGTCCCTTTCGATCACCGACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCGATGGCGAAGTATATA
GTGCAGCAATATGTGGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGAGAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAATTCTCGTCCGGCGA
GGGGAGTTTGAACGGAAAGGTGTTAAAGGCGAAGAACGGGAAAGGCGGCGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAACGGCCGGAGTTATGGTGCTTGG
AACTGATGCTGTCCGGTTGTAAAATCAGCGCCGGCAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCATTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTC
CGACGCTTCTTGCAGGGGCTTGATCCGAAATCGACGGCGACTCTGTTTTCGAATTCCAGCTGCATCGGCGAGAAAACAATCAACGATGAAGATTGTTTCATCCTAAAACT
GGAAGCGGAATCAACGGTGCTGAGAGCGAGAAGCAGTAGTAGCGTGGAGATCATCCGCCACACGGTTTGGGGATACTTCAGCCAGAGAACCGGCCTCCTGGTCCAGCTCG
AAGACTCGCATCTCCTCCGAATCAAGGCCGGTGGATCCCGAAACGACAGTATTTTCTGGGAAACGACAATGGAAACCCTAATTCAAGACTACAGAACGATCGACGGCGTC
AATATCGCACACGCCGGGAAAACGTCCGTTTCGCTCTTTCGATTCGGCGAGAGCGCTGAAGGCCATTCGAGAACGAAAATGGAAGAGATTTGGGAGATCGAAGAAGTGGA
TTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCTAGCGATCTGAAGAAGGAGGAAGAAGGAGTCGCCGTGATTACGAGTAACGGAAAGTTTCCGTTGACGA
TTCGATGTACGGCTGCTGCTTCGAAGATTTGTTCGTCGCGAGTGGCGGCCATTGATGTTGATGAAATTTCGGAGGGGAGCAGCAACCAGAGTGATGAGGATGAAGATGAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGCTTTGTCCGAATTTCGACCGAGAAGACGGTCTCGACACTGTCCTTGAGGTTCCCATTCCTGAAGAGATGTTCTCTTGCAACACTAACAAAACCCACGCGAT
TTCATGGCAAGCTATGAAATCGTGGGTGAAGTCCCATAATGATCATAAATCGCACGTGAATTCCATGGCTGCACTTTTCGGCGGCCGGAACGCCGAGATCCAGCTCCTCC
TCGGCGTCGTCGGAGCTCCTTTAATCCCTCTCCCTGTCCCTTTCGATCACCGACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCGATGGCGAAGTATATA
GTGCAGCAATATGTGGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGAGAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAATTCTCGTCCGGCGA
GGGGAGTTTGAACGGAAAGGTGTTAAAGGCGAAGAACGGGAAAGGCGGCGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAACGGCCGGAGTTATGGTGCTTGG
AACTGATGCTGTCCGGTTGTAAAATCAGCGCCGGCAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCATTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTC
CGACGCTTCTTGCAGGGGCTTGATCCGAAATCGACGGCGACTCTGTTTTCGAATTCCAGCTGCATCGGCGAGAAAACAATCAACGATGAAGATTGTTTCATCCTAAAACT
GGAAGCGGAATCAACGGTGCTGAGAGCGAGAAGCAGTAGTAGCGTGGAGATCATCCGCCACACGGTTTGGGGATACTTCAGCCAGAGAACCGGCCTCCTGGTCCAGCTCG
AAGACTCGCATCTCCTCCGAATCAAGGCCGGTGGATCCCGAAACGACAGTATTTTCTGGGAAACGACAATGGAAACCCTAATTCAAGACTACAGAACGATCGACGGCGTC
AATATCGCACACGCCGGGAAAACGTCCGTTTCGCTCTTTCGATTCGGCGAGAGCGCTGAAGGCCATTCGAGAACGAAAATGGAAGAGATTTGGGAGATCGAAGAAGTGGA
TTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCTAGCGATCTGAAGAAGGAGGAAGAAGGAGTCGCCGTGATTACGAGTAACGGAAAGTTTCCGTTGACGA
TTCGATGTACGGCTGCTGCTTCGAAGATTTGTTCGTCGCGAGTGGCGGCCATTGATGTTGATGAAATTTCGGAGGGGAGCAGCAACCAGAGTGATGAGGATGAAGATGAA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPLPVPFDHRPITRNIKDNPIEASMAKYI
VQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPL
RRFLQGLDPKSTATLFSNSSCIGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGV
NIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTIRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE