; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023885 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023885
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationtig00001047:1139483..1141765
RNA-Seq ExpressionSgr023885
SyntenySgr023885
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]1.1e-22188.2Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+K NH+   SH NSIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK+INDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTID VNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA-----SKITSSRVAAIDVDESEESSHQSDEDED
         TSNGK PLTMRC  A     SKI SSRVAAIDVDES E S+QSDEDE+
Subjt:  ITSNGKFPLTMRCTTA-----SKITSSRVAAIDVDESEESSHQSDEDED

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]4.1e-22991.67Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTH ISWQAMKSW+KS++ H KSHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+PF H+PITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG  SLN KV+K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEK+INDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGE+AE HSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV VIT
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED
        SNGKFPLTMRCT  ASKI SSRVAAIDVDE SE SS+QSDEDED
Subjt:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]1.5e-22388.86Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSNH+   SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK+INDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS
         TSNGK PLTMRC  A    SKI SSRVAAIDVDES E S+QSDEDE++
Subjt:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]1.8e-22188.46Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSN +H  SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEK+INDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG  T
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED
        SNGK PLTMRC  +     SRVAAIDVDES E S+QSDEDE+
Subjt:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]6.7e-22489.26Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDK-SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIK
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T KTHTISWQAMKSW+KSNH+HDK SHV SI+SLFGGRNAEIQLLLGVVGAPLIPLPI F  QPITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDK-SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIK

Query:  DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVK--TKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKM ASEF SGEG   LN K VK    NGK G GGGGEMGGFVVWQKRPELWCLELMLSGCKI
Subjt:  DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVK--TKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK+INDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
        LEDSHLLRIK AGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGETAE HS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
Subjt:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG

Query:  VITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSDEDED
        +ITSNGKFP+TMRC+  S++ SSRV AID  + +ES  S+QSDEDED
Subjt:  VITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSDEDED

TrEMBL top hitse value%identityAlignment
A0A0A0KBJ0 Uncharacterized protein1.5e-20885.75Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPF-HHQPITR-NI
        MRKLCPNFDRE GLDTVLEVPIPEEMFS NT KTH ISWQAMKSW+KSN     SH  SI SLFGGRNAEIQLLLGVVGAPLIPLPI F   QPI R NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPF-HHQPITR-NI

Query:  KDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAK-VVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        KDNPIEASMAKYIVQQY+AAVGGEHALN I+SMYAMGKVKM ASEF SGEG  ++  K   K   G  G GGGGEMGGFVVWQKRPELWCLELML G KI
Subjt:  KDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAK-VVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK+INDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
        LEDSHLLRIK AGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGETAE HS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
Subjt:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG

Query:  VI-TSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSD-EDED
        +I TS GKFPLTMRC+  S+  SSRVAAID  E EES  S++SD EDED
Subjt:  VI-TSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSD-EDED

A0A6J1BVR5 uncharacterized protein LOC1110061912.0e-22991.67Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTH ISWQAMKSW+KS++ H KSHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+PF H+PITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG  SLN KV+K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEK+INDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGE+AE HSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV VIT
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED
        SNGKFPLTMRCT  ASKI SSRVAAIDVDE SE SS+QSDEDED
Subjt:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED

A0A6J1FGT4 uncharacterized protein LOC1114452681.7e-20983.82Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+H ISWQAMKSW+KS++Y+  SH+ SIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEF+SGEG    N K +K KNGK G    GEMG FV+WQKRP+LWCLE+MLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK++N EDCFILKLEAESSVLRARSSS VEIIRHTVWGYFSQRTGLLV LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIK  GSRNDN+FWETTME+ IQDYRTIDGVNIAHAGKT+VSL RFG+ AE HS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE +G I 
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTA---SKITSSRVAAIDVDESEESSHQSDEDED
        S+ KFPL MR  +A   S+I  SRVAA+D DESE SS +SDED+D
Subjt:  SNGKFPLTMRCTTA---SKITSSRVAAIDVDESEESSHQSDEDED

A0A6J1GZA9 uncharacterized protein LOC1114588777.2e-22488.86Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSNH+   SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEK+INDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS
         TSNGK PLTMRC  A    SKI SSRVAAIDVDES E S+QSDEDE++
Subjt:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS

A0A6J1JSU8 uncharacterized protein LOC1114875318.9e-22288.46Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSN +H  SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEK+INDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG  T
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED
        SNGK PLTMRC  +     SRVAAIDVDES E S+QSDEDE+
Subjt:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)7.6e-10950.74Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHY------------HDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP
        MRKLCPN DREDGL+TVLEVP+PEEMF+          W+ M + +K++                 S  N    L    + E   LL +VG+PLIP  +P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHY------------HDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP

Query:  FHHQPITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCL
             ++R I D  IEAS AKYIVQQY+AA GG  ALN++ SMYA+G+V+M  SE  +GE            + GK    G  E+GGFV+WQK P LW L
Subjt:  FHHQPITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCL

Query:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF
        EL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYF
Subjt:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF

Query:  SQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSD
        SQRTGLLV+  D+ L+R+K+   +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G     + R ++EE W IEEVDFNI GL ++ FLPPSD
Subjt:  SQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSD

Query:  LKKE
        +  +
Subjt:  LKKE

AT3G19540.1 Protein of unknown function (DUF620)6.7e-9744.5Show/hide
Query:  REDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKDNPIEASMAK
        R   L  V+E P P+E                +  W+K       S   S+A+    R  +++LLLGV+GAPL P+ +         +IK+ PIE S A+
Subjt:  REDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKDNPIEASMAK

Query:  YIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ
        YI+QQY AA GG+   NSI + YAMGK+KM+ SE       L    + V+ +N         E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR 
Subjt:  YIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ

Query:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAA
        TPW  SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ 
Subjt:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAA

Query:  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTM
        G   + +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A SH+RTKMEE W IEEV FN+ GLS+D F+PP+DLK      G +T + ++P   
Subjt:  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTM

Query:  R----CTTASKITSSRVAAIDVDESEE
        R        S    ++VAA++    E+
Subjt:  R----CTTASKITSSRVAAIDVDESEE

AT3G55720.1 Protein of unknown function (DUF620)3.6e-13557.5Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHH----QPITR
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NK+   +W+++KS L  +   + S   S+A+LFGGR+++IQ+LLG+VGAP IPLPI         PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHH----QPITR

Query:  NIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNA----KVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELML
         IK+  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +   +LN     K+V+ +N  +  G GGEMGGFV+W+K    W LEL++
Subjt:  NIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNA----KVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELML

Query:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQR
        SGCK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK++N+E+CF+LKLE + S L++RS S +E ++HTVWG F QR
Subjt:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQR

Query:  TGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL--
        TGLLVQLED++L+RIK      D + WETT ETLIQDY++IDG+ IAH GKT VSL R  E+ ESHS+T MEE WEIEEV FN+KGLS DFFLPP DL  
Subjt:  TGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL--

Query:  KKEEE---GVGVITSNGKFPLTMRCTTASKITSSRVAAID
        K+EEE     G  TS    PL +  TT+ KI SS+V AI+
Subjt:  KKEEE---GVGVITSNGKFPLTMRCTTASKITSSRVAAID

AT5G05840.1 Protein of unknown function (DUF620)8.3e-16467.87Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKS-WLK-SNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI-PFHH----QP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+ +  K     W  MKS W K +      +   ++  LFGGRNAEIQLLLGVVGAPLIPLP+ P HH     P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKS-WLK-SNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI-PFHH----QP

Query:  ITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLS
        I ++IKD P+E SMA+YIV+QYIAAVGG+ ALN+++SMYAMGKV+M ASEF +GEG  SLN+K+VK ++ KS   GGGE+GGFV+WQK  ELWCLEL++S
Subjt:  ITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        GCKISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE S L+ARSSS+VEIIRHTVWG FSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK--
        LL+QLEDSHLLRIKA    +++IFWETTME+LIQDYRT+DG+ +AHAGK+SVSLFRFGE +++HSRT+MEE WEIEE+DFNIKGLSMD FLPPSDLKK  
Subjt:  LLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK--

Query:  -EEEGV--GVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES
         EEE +  G+  +N K P+ +R + + +I+SS+V AI V+E +ES
Subjt:  -EEEGV--GVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES

AT5G66740.1 Protein of unknown function (DUF620)1.9e-11553.94Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N    + WQ M +W+K+      S       L   R  E++ LL +VG+PLIPL +   H  + + +KD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
          I+AS AKYIVQQYIAA GG  ALN+++SM   G+VKM ASEF  G+ S  +N K               EMGGFV+WQK P+LWCLEL++SGCK+  G
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        S+G+++WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE   +V  A+S  + EII HT+WGYFSQR+GLL+Q ED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        S LLR++     ++++FWET+ E+++ DYR +D VNIAH GKTSV++FR+GE + +H R +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCGAACTTCGACAGAGAAGACGGCCTCGACACCGTCCTCGAGGTTCCGATCCCTGAAGAGATGTTTTCCTGCAATACTAACAAGACCCATACGAT
TTCATGGCAAGCTATGAAGTCCTGGCTCAAGTCCAATCATTATCACGATAAATCCCACGTTAATTCCATAGCTTCCCTTTTCGGCGGCCGCAACGCAGAGATCCAGCTCC
TCCTTGGCGTCGTCGGAGCTCCCTTGATCCCTCTCCCCATCCCTTTCCATCATCAACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCCATGGCGAAGTAC
ATAGTGCAACAGTATATAGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTCTCCTCCGG
CGAAGGGAGTTTGAGTTTGAACGCCAAGGTGGTGAAGACGAAGAACGGGAAAAGCGGTGCCGGCGGTGGTGGGGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGG
AATTATGGTGCTTGGAACTGATGCTCTCCGGCTGCAAGATCAGCGCCGGCAGCGACGGCAAGGTCGCTTGGAGACAAACTCCATGGCATCACTCCCATGCTTCTCGCGGC
CCTCCTCGTCCCCTCCGACGCTTCTTGCAGGGACTTGATCCGAAATCGACGGCAACTCTGTTTTCGAACTCCACCTGCATCGGCGAGAAGTCAATCAACGATGAAGACTG
CTTCATTCTAAAACTGGAAGCCGAATCCTCGGTCCTAAGAGCCAGAAGCAGTAGCAGCGTCGAGATCATCCGCCACACCGTGTGGGGATACTTCAGCCAGAGAACCGGCC
TCCTCGTCCAGCTCGAAGACTCCCACCTCCTCCGCATCAAAGCCGCCGGCTCTCGAAACGACAACATCTTCTGGGAAACCACAATGGAAACCCTAATCCAAGACTACAGG
ACCATCGACGGCGTCAACATCGCACACGCCGGGAAAACGTCCGTTTCGCTTTTCCGGTTCGGCGAAACCGCAGAGAGCCATTCCAGAACGAAAATGGAAGAGATTTGGGA
GATCGAGGAAGTGGATTTCAACATCAAGGGTTTGTCCATGGACTTCTTTTTGCCTCCGAGTGACCTGAAGAAGGAAGAAGAAGGAGTCGGTGTAATAACAAGTAACGGAA
AGTTTCCGTTGACGATGCGATGTACGACCGCTTCGAAGATTACCTCGTCCAGAGTGGCGGCCATCGATGTTGATGAATCGGAGGAAAGCAGCCACCAGAGTGATGAAGAT
GAAGATTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGCTTTGTCCGAACTTCGACAGAGAAGACGGCCTCGACACCGTCCTCGAGGTTCCGATCCCTGAAGAGATGTTTTCCTGCAATACTAACAAGACCCATACGAT
TTCATGGCAAGCTATGAAGTCCTGGCTCAAGTCCAATCATTATCACGATAAATCCCACGTTAATTCCATAGCTTCCCTTTTCGGCGGCCGCAACGCAGAGATCCAGCTCC
TCCTTGGCGTCGTCGGAGCTCCCTTGATCCCTCTCCCCATCCCTTTCCATCATCAACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCCATGGCGAAGTAC
ATAGTGCAACAGTATATAGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTCTCCTCCGG
CGAAGGGAGTTTGAGTTTGAACGCCAAGGTGGTGAAGACGAAGAACGGGAAAAGCGGTGCCGGCGGTGGTGGGGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGG
AATTATGGTGCTTGGAACTGATGCTCTCCGGCTGCAAGATCAGCGCCGGCAGCGACGGCAAGGTCGCTTGGAGACAAACTCCATGGCATCACTCCCATGCTTCTCGCGGC
CCTCCTCGTCCCCTCCGACGCTTCTTGCAGGGACTTGATCCGAAATCGACGGCAACTCTGTTTTCGAACTCCACCTGCATCGGCGAGAAGTCAATCAACGATGAAGACTG
CTTCATTCTAAAACTGGAAGCCGAATCCTCGGTCCTAAGAGCCAGAAGCAGTAGCAGCGTCGAGATCATCCGCCACACCGTGTGGGGATACTTCAGCCAGAGAACCGGCC
TCCTCGTCCAGCTCGAAGACTCCCACCTCCTCCGCATCAAAGCCGCCGGCTCTCGAAACGACAACATCTTCTGGGAAACCACAATGGAAACCCTAATCCAAGACTACAGG
ACCATCGACGGCGTCAACATCGCACACGCCGGGAAAACGTCCGTTTCGCTTTTCCGGTTCGGCGAAACCGCAGAGAGCCATTCCAGAACGAAAATGGAAGAGATTTGGGA
GATCGAGGAAGTGGATTTCAACATCAAGGGTTTGTCCATGGACTTCTTTTTGCCTCCGAGTGACCTGAAGAAGGAAGAAGAAGGAGTCGGTGTAATAACAAGTAACGGAA
AGTTTCCGTTGACGATGCGATGTACGACCGCTTCGAAGATTACCTCGTCCAGAGTGGCGGCCATCGATGTTGATGAATCGGAGGAAAGCAGCCACCAGAGTGATGAAGAT
GAAGATTCGTGA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPFHHQPITRNIKDNPIEASMAKY
IVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRG
PPRPLRRFLQGLDPKSTATLFSNSTCIGEKSINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYR
TIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDED
EDS