; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1884 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1884
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationMC04:25956707..25960019
RNA-Seq ExpressionMC04g1884
SyntenyMC04g1884
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]2.77e-28389.26Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVK-SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVK +H+D  SH NS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVK-SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGG    GEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTID VNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTMRCTAAA----SKICSSRVAAIDVDEISEGSSNQSDEDE
        SNGK PLTMRC AAA    SKICSSRVAAIDVDE SEGS NQSDEDE
Subjt:  SNGKFPLTMRCTAAA----SKICSSRVAAIDVDEISEGSSNQSDEDE

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]0.0100Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
        PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
        IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF

Query:  PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS
        PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS
Subjt:  PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]1.97e-28590.13Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGG    GEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTMRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDE
        SNGK PLTMRC AAA   SKICSSRVAAIDVDE SEGS NQSDEDE
Subjt:  SNGKFPLTMRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDE

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]1.03e-28590.66Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKS-HVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS+  HKS HVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKS-HVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK
        RIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   TSNGK
Subjt:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK

Query:  FPLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDE
         PLTMRC A+ SKICS RVAAIDVDE SEGS NQSDEDE
Subjt:  FPLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDE

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]1.06e-27586.77Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS--HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIK
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T KTH ISWQAMKSWVKS  H+D  SHV S+++LFGGRNAEIQLLLGVVGAPLIP+P+ FD +PITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS--HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKA--KNGKGGGGG--EMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKM ASEF SGEG LNGK +K    NGKGGGGG  EMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKA--KNGKGGGGG--EMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVI
        DSHLLRIK  GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV +I
Subjt:  DSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVI

Query:  TSNGKFPLTMRCTAAASKICSSRVAAIDV-DEISEGSSNQSDEDED
        TSNGKFP+TMRC+A  S++ SSRV AID  DE     SNQSDEDED
Subjt:  TSNGKFPLTMRCTAAASKICSSRVAAIDV-DEISEGSSNQSDEDED

TrEMBL top hitse value%identityAlignment
A0A6J1BVR5 uncharacterized protein LOC1110061910.0100Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
        PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
        IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKF

Query:  PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS
        PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS
Subjt:  PLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS

A0A6J1FGT4 uncharacterized protein LOC1114452689.12e-26483.15Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMKSWVKS   +K SH+ S+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEF+SGEG  NGK +KAKNGKGG   GEMG FV+WQKRP+LWCLE+MLSGCKISAGSDG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKT+N EDCFILKLEAES+VLRARSSS VEIIRHTVWGYFSQRTGLLV LEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG
        LRIK GGSRND++FWETTME+ IQDYRTIDGVNIAHAGKT+VSL RFG+ AEGHS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE +  I S+ 
Subjt:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG

Query:  KFPLTMR--CTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDED
        KFPL MR     A S+I  SRVAA+D DE SEGSS +SDED+D+D
Subjt:  KFPLTMR--CTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDED

A0A6J1GZA9 uncharacterized protein LOC1114588779.52e-28690.13Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKS-HNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGG    GEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGG----GEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVIT

Query:  SNGKFPLTMRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDE
        SNGK PLTMRC AAA   SKICSSRVAAIDVDE SEGS NQSDEDE
Subjt:  SNGKFPLTMRCTAAA---SKICSSRVAAIDVDEISEGSSNQSDEDE

A0A6J1JRQ6 uncharacterized protein LOC1114891902.25e-26383.82Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMKSWVKS   +K SHV S+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHK-SHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        NPIEASMAKYIVQQY+AAVGGEHALNSI SMYAMGKVKMVASEF+SGEG  NGK +KAKNGK G   GEMG FV+WQKRP+LWCLE+MLSGCKISAGSDG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKT+N EDCFILKLEAES+VLRARSSS VEIIRHTVWGYFSQRTGLLV LEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG
        LRIK GGSRND +FWETTME+ I+DYRTIDGVNIAHAGKT+VSL RFG+ AEGHS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE + V+TS+ 
Subjt:  LRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNG

Query:  KFPLTMR-CTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS
        KFP TMR  + A S+I SSRVAA+D DE SEGSS +SDEDED+DS
Subjt:  KFPLTMR-CTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDEDS

A0A6J1JSU8 uncharacterized protein LOC1114875314.96e-28690.66Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKS-HVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS+  HKS HVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKS-HVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK
        RIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   TSNGK
Subjt:  RIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGK

Query:  FPLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDE
         PLTMRC A+ SKICS RVAAIDVDE SEGS NQSDEDE
Subjt:  FPLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)5.6e-11252Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSH-------NDHKSHVNSMAA------LFGGRNAEIQLLLGVVGAPLIPIPVP
        MRKLCPN DREDGL+TVLEVP+PEEMF+   +      W+ M + +K+H        D ++  +S +       L    + E   LL +VG+PLIP  VP
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSH-------NDHKSHVNSMAA------LFGGRNAEIQLLLGVVGAPLIPIPVP

Query:  FDHRPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELML
         +   ++R I D  IEAS AKYIVQQYVAA GG  ALN+++SMYA+G+V+M  SE  +GE    G  ++     G G  E+GGFV+WQK P LW LEL++
Subjt:  FDHRPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELML

Query:  SGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRT
        SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ C+GE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYFSQRT
Subjt:  SGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRT

Query:  GLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE
        GLLV+  D+ L+R+K+G  +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G +   + R ++EE W IEEVDFNI GL ++ FLPPSD+  +
Subjt:  GLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE

AT3G19540.1 Protein of unknown function (DUF620)6.7e-9745.43Show/hide
Query:  REDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDNPIEASMAKY
        R   L  V+E P P+E                +  WVK      S   S+AA    R  +++LLLGV+GAPL PI V         +IK+ PIE S A+Y
Subjt:  REDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDNPIEASMAKY

Query:  IVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHH
        I+QQY AA GG+   NSI++ YAMGK+KM+ SE  +          +    +     E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR TPW  
Subjt:  IVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHH

Query:  SHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRND
        SH ++GP RPLRR LQGLDP++TA +F+ + C+GEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ G   +
Subjt:  SHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRND

Query:  SIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTMR---C
        ++FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H+RTKMEE W IEEV FN+ GLS+D F+PP+DLK        +T + ++P   R    
Subjt:  SIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTMR---C

Query:  TAAASKICSSRVAAID
        T A S    ++VAA++
Subjt:  TAAASKICSSRVAAID

AT3G55720.1 Protein of unknown function (DUF620)6.2e-13556.98Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMK-SWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDH----RPITR
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NK+ A  W+++K S ++S  D+ S   S+A LFGGR+++IQ+LLG+VGAP IP+P+  D      PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMK-SWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDH----RPITR

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSG
         IK+  IE++MAKYIV+QY AA GGE AL+++ESMYAMGKVKM  +EF + + +LNGK  K        N   G GGEMGGFV+W+K    W LEL++SG
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTG
        CK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + + L++RS S +E ++HTVWG F QRTG
Subjt:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---K
        LLVQLED++L+RIK G    D + WETT ETLIQDY++IDG+ IAH GKT VSL R  ES E HS+T MEE WEIEEV FN+KGLS DFFLPP DL   +
Subjt:  LLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---K

Query:  KEEEGVAVITSNGKFPLTMRCTAAASKICSSRVAAID
        +EE G +         L ++ +  + KI SS+V AI+
Subjt:  KEEEGVAVITSNGKFPLTMRCTAAASKICSSRVAAID

AT5G05840.1 Protein of unknown function (DUF620)3.7e-16467.64Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKS-WVK--SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDH-----RP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+ +  K     W  MKS W K  +     +   +M  LFGGRNAEIQLLLGVVGAPLIP+PV  DH      P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKS-WVK--SHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDH-----RP

Query:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        I ++IKD P+E SMA+YIV+QY+AAVGG+ ALN++ESMYAMGKV+M ASEF +GEGSLN K++KA++ K  GGGE+GGFV+WQK  ELWCLEL++SGCKI
Subjt:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S CMGEK INDEDCFILKL+AE + L+ARSSS+VEIIRHTVWG FSQRTGLL+Q
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---EEE
        LEDSHLLRIKA    ++SIFWETTME+LIQDYRT+DG+ +AHAGK+SVSLFRFGE+++ HSRT+MEE WEIEE+DFNIKGLSMD FLPPSDLKK   EEE
Subjt:  LEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---EEE

Query:  GV--AVITSNGKFPLTMRCTAAASKICSSRVAAI----DVDEISE
         +   +  +N K P+ +R  +A+ +I SS+V AI    D  E++E
Subjt:  GV--AVITSNGKFPLTMRCTAAASKICSSRVAAI----DVDEISE

AT5G66740.1 Protein of unknown function (DUF620)4.9e-11654.64Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N   A+ WQ M +W+K+    K     +AA    R  E++ LL +VG+PLIP+ V   H  + + +KD 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
         I+AS AKYIVQQY+AA GG  ALN++ SM   G+VKM ASEF  G+ S  G  LK+ +       EMGGFV+WQK P+LWCLEL++SGCK+  GS+G++
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        +WR +    + AS G PRPLRRFLQGLDP+STA LF +++C+GEK IN EDCFILKLE    V  A+S  + EII HT+WGYFSQR+GLL+Q EDS LLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        ++     ++ +FWET+ E+++ DYR +D VNIAH GKTSV++FR+GE++  H R +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  IKAGGSRNDSIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCGAATTTCGACCGAGAAGACGGTCTCGACACTGTCCTTGAGGTTCCCATTCCTGAGGAGATGTTCTCTTGCAACACTAACAAAACCCACGCGAT
TTCATGGCAAGCTATGAAATCGTGGGTGAAGTCCCATAATGATCATAAATCGCACGTGAATTCCATGGCTGCACTTTTCGGCGGCCGGAACGCCGAGATCCAGCTCCTCC
TCGGCGTCGTCGGAGCTCCTTTAATCCCTATCCCTGTCCCTTTCGATCACCGACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCGATGGCGAAGTATATA
GTGCAGCAATATGTGGCGGCGGTGGGAGGAGAGCATGCGTTGAACTCGATTGAGAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAATTCTCGTCCGGCGA
GGGGAGTTTGAACGGAAAGGTGTTAAAGGCGAAGAACGGGAAAGGCGGCGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAACGGCCGGAGTTATGGTGCTTGG
AACTGATGCTGTCCGGTTGTAAAATCAGCGCCGGCAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCATTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTC
CGACGCTTCTTGCAGGGGCTTGATCCGAAATCGACGGCGACTCTGTTTTCGAATTCCAGCTGCATGGGCGAGAAAACAATCAACGATGAAGATTGTTTCATCCTAAAACT
GGAAGCGGAATCAACGGTGCTGAGAGCGAGAAGCAGTAGTAGCGTGGAGATCATCCGCCACACGGTTTGGGGATACTTCAGCCAGAGAACCGGCCTCCTGGTCCAGCTCG
AAGACTCGCATCTCCTCCGAATCAAGGCCGGTGGATCCCGAAACGACAGTATTTTCTGGGAAACGACAATGGAAACCCTAATTCAAGACTACAGAACGATCGACGGCGTC
AATATCGCACACGCCGGGAAAACGTCCGTTTCGCTCTTTCGATTCGGCGAGAGCGCTGAAGGCCATTCGAGAACGAAAATGGAAGAGATTTGGGAGATCGAAGAAGTGGA
TTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCTAGCGATCTGAAGAAGGAGGAAGAAGGAGTCGCCGTGATTACGAGTAACGGAAAGTTTCCGTTGACGA
TGCGATGTACGGCTGCTGCTTCGAAGATTTGTTCGTCGCGAGTGGCGGCCATTGATGTTGATGAAATATCGGAGGGGAGCAGCAACCAGAGTGATGAAGATGAAGATGAA
GATTCGTGA
mRNA sequenceShow/hide mRNA sequence
CCGACATAAAAACTAATATTTATGGTATATAACTGAGAATAATTATCTTCATAATTGGTATGAGATTTTTTGGGTGAACTCAAAAGTAAAGTCGTGTTATGTTCAAAACA
GATAATTTCATACTATTGTAGAGATACTTGGAATACGAAAGAACTGTTATCCAGAAAAGGAGGAAAAGAGACTGAAAACTATTTATTACATAACATTATCCTAAGAAGAA
AAACAAACCCATAAAAATTCAACAGACGAAAAGATTATGCAATTTGAAAACAAGAAAAAAAAAAGGTCAATACAGCAGTTGAAAAAAAGAAACTACAATAGAGGGAGGAA
GAGGAAGCCAGCCAGTTGATGTTGGAAAAAAGTGGCCCTATGGCTATGGTTGGGTTGAAAAGGAAGTAACTTGAGAAACATTACACTTACAAAACAAACTAATCAAAATA
ATAAATGAATAATGAACAATAATTCCCAAAGAAAGCCATCCATCGCCATCCCCCAATTTGAATACTAAATTAAAATTAATTACCACCAAATTAAGTTTCCAGAGAGAGAG
AGAGAGCGTGCAAACACGACATCTACTTAGCGGACCACGTTACATTGAGGGGCCATGGTCGGTTGGTATTGCATTTCCATTCTTCCTTTTCTTCTCATCTTTCTCCTCTC
TTTCACTTGATCATCACATATATTCATAATCCTCTCTCTGATCACTGATCTGATCTCTCTCACTGAAAAAATCATTTCTTTGATCTGATGAGGAAGCTTTGTCCGAATTT
CGACCGAGAAGACGGTCTCGACACTGTCCTTGAGGTTCCCATTCCTGAGGAGATGTTCTCTTGCAACACTAACAAAACCCACGCGATTTCATGGCAAGCTATGAAATCGT
GGGTGAAGTCCCATAATGATCATAAATCGCACGTGAATTCCATGGCTGCACTTTTCGGCGGCCGGAACGCCGAGATCCAGCTCCTCCTCGGCGTCGTCGGAGCTCCTTTA
ATCCCTATCCCTGTCCCTTTCGATCACCGACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCGATGGCGAAGTATATAGTGCAGCAATATGTGGCGGCGGT
GGGAGGAGAGCATGCGTTGAACTCGATTGAGAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAATTCTCGTCCGGCGAGGGGAGTTTGAACGGAAAGGTGT
TAAAGGCGAAGAACGGGAAAGGCGGCGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAACGGCCGGAGTTATGGTGCTTGGAACTGATGCTGTCCGGTTGTAAA
ATCAGCGCCGGCAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCATTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTCCGACGCTTCTTGCAGGGGCTTGA
TCCGAAATCGACGGCGACTCTGTTTTCGAATTCCAGCTGCATGGGCGAGAAAACAATCAACGATGAAGATTGTTTCATCCTAAAACTGGAAGCGGAATCAACGGTGCTGA
GAGCGAGAAGCAGTAGTAGCGTGGAGATCATCCGCCACACGGTTTGGGGATACTTCAGCCAGAGAACCGGCCTCCTGGTCCAGCTCGAAGACTCGCATCTCCTCCGAATC
AAGGCCGGTGGATCCCGAAACGACAGTATTTTCTGGGAAACGACAATGGAAACCCTAATTCAAGACTACAGAACGATCGACGGCGTCAATATCGCACACGCCGGGAAAAC
GTCCGTTTCGCTCTTTCGATTCGGCGAGAGCGCTGAAGGCCATTCGAGAACGAAAATGGAAGAGATTTGGGAGATCGAAGAAGTGGATTTCAATATCAAGGGTTTGTCGA
TGGACTTCTTTTTGCCTCCTAGCGATCTGAAGAAGGAGGAAGAAGGAGTCGCCGTGATTACGAGTAACGGAAAGTTTCCGTTGACGATGCGATGTACGGCTGCTGCTTCG
AAGATTTGTTCGTCGCGAGTGGCGGCCATTGATGTTGATGAAATATCGGAGGGGAGCAGCAACCAGAGTGATGAAGATGAAGATGAAGATTCGTGAAGAGAACATTTGAA
ATCAATTTTTGCAACTCTGTACATAATCCATGGCTGATACATATATATGAAGCATGGACCTTTTACTAATTACAAACGCCAGTATTTTGGTTTTTGTTTTTCAAGATCGT
AACTGTACTCTGTATCATACAATTCTCGCCGACTCGAAGTCTGGTGGAAGAAATTACAATGTATTCAAATCACAAATCAAGGGTTTGTTTGGGAGGGTT
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHAISWQAMKSWVKSHNDHKSHVNSMAALFGGRNAEIQLLLGVVGAPLIPIPVPFDHRPITRNIKDNPIEASMAKYI
VQQYVAAVGGEHALNSIESMYAMGKVKMVASEFSSGEGSLNGKVLKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPL
RRFLQGLDPKSTATLFSNSSCMGEKTINDEDCFILKLEAESTVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDSIFWETTMETLIQDYRTIDGV
NIAHAGKTSVSLFRFGESAEGHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVAVITSNGKFPLTMRCTAAASKICSSRVAAIDVDEISEGSSNQSDEDEDE
DS