; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001666 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001666
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationtig00001058:12554..14822
RNA-Seq ExpressionSgr001666
SyntenySgr001666
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]4.1e-22188.2Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+K NH+   SH NSIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTID VNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA-----SKITSSRVAAIDVDESEESSHQSDEDED
         TSNGK PLTMRC  A     SKI SSRVAAIDVDES E S+QSDEDE+
Subjt:  ITSNGKFPLTMRCTTA-----SKITSSRVAAIDVDESEESSHQSDEDED

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]1.5e-22891.67Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTH ISWQAMKSW+KS++ H KSHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+P  H+PITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG  SLN KV+K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGE+AE HSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV VIT
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED
        SNGKFPLTMRCT  ASKI SSRVAAIDVDE SE SS+QSDEDED
Subjt:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]5.7e-22388.86Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSNH+   SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS
         TSNGK PLTMRC  A    SKI SSRVAAIDVDES E S+QSDEDE++
Subjt:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]6.9e-22188.46Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSN +H  SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG  T
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED
        SNGK PLTMRC  +     SRVAAIDVDES E S+QSDEDE+
Subjt:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]2.6e-22389.26Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDK-SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIK
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T KTHTISWQAMKSW+KSNH+HDK SHV SI+SLFGGRNAEIQLLLGVVGAPLIPLPI    QPITRNIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDK-SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIK

Query:  DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVK--TKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKM ASEF SGEG   LN K VK    NGK G GGGGEMGGFVVWQKRPELWCLELMLSGCKI
Subjt:  DNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVK--TKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
        LEDSHLLRIK AGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGETAE HS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
Subjt:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG

Query:  VITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSDEDED
        +ITSNGKFP+TMRC+  S++ SSRV AID  + +ES  S+QSDEDED
Subjt:  VITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSDEDED

TrEMBL top hitse value%identityAlignment
A0A0A0KBJ0 Uncharacterized protein3.3e-20885.75Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP-SHHQPITR-NI
        MRKLCPNFDRE GLDTVLEVPIPEEMFS NT KTH ISWQAMKSW+KSN     SH  SI SLFGGRNAEIQLLLGVVGAPLIPLPI     QPI R NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP-SHHQPITR-NI

Query:  KDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAK-VVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI
        KDNPIEASMAKYIVQQY+AAVGGEHALN I+SMYAMGKVKM ASEF SGEG  ++  K   K   G  G GGGGEMGGFVVWQKRPELWCLELML G KI
Subjt:  KDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAK-VVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
        SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQ

Query:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
        LEDSHLLRIK AGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGETAE HS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG
Subjt:  LEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG

Query:  VI-TSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSD-EDED
        +I TS GKFPLTMRC+  S+  SSRVAAID  E EES  S++SD EDED
Subjt:  VI-TSNGKFPLTMRCTTASKITSSRVAAIDVDESEES--SHQSD-EDED

A0A6J1BVR5 uncharacterized protein LOC1110061917.5e-22991.67Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTH ISWQAMKSW+KS++ H KSHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+P  H+PITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSI+SMYAMGKVKMVASEFSSGEG  SLN KV+K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGE+AE HSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV VIT
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED
        SNGKFPLTMRCT  ASKI SSRVAAIDVDE SE SS+QSDEDED
Subjt:  SNGKFPLTMRCT-TASKITSSRVAAIDVDE-SEESSHQSDEDED

A0A6J1FGT4 uncharacterized protein LOC1114452685.0e-20983.82Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+H ISWQAMKSW+KS++Y+  SH+ SIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQ ITRNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEF+SGEG    N K +K KNGK G    GEMG FV+WQKRP+LWCLE+MLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKT+N EDCFILKLEAESSVLRARSSS VEIIRHTVWGYFSQRTGLLV LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIK  GSRNDN+FWETTME+ IQDYRTIDGVNIAHAGKT+VSL RFG+ AE HS+TKMEEIW+IEEVDFNIKGLSM+FFLPPSDLKKEEE +G I 
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTA---SKITSSRVAAIDVDESEESSHQSDEDED
        S+ KFPL MR  +A   S+I  SRVAA+D DESE SS +SDED+D
Subjt:  SNGKFPLTMRCTTA---SKITSSRVAAIDVDESEESSHQSDEDED

A0A6J1GZA9 uncharacterized protein LOC1114588772.8e-22388.86Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSNH+   SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK G G  GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAG--GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV
        EDSHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGV

Query:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS
         TSNGK PLTMRC  A    SKI SSRVAAIDVDES E S+QSDEDE++
Subjt:  ITSNGKFPLTMRCTTA----SKITSSRVAAIDVDESEESSHQSDEDEDS

A0A6J1JSU8 uncharacterized protein LOC1114875313.4e-22188.46Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK H ISWQAMK+W+KSN +H  SHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI  HHQPIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEFSSGE    LN K  K KNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT
        SHLLRIKA GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AE HS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG  T
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVIT

Query:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED
        SNGK PLTMRC  +     SRVAAIDVDES E S+QSDEDE+
Subjt:  SNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDEDED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)1.7e-10850.74Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHY------------HDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP
        MRKLCPN DREDGL+TVLEVP+PEEMF+          W+ M + +K++                 S  N    L    + E   LL +VG+PLIP  +P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHY------------HDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIP

Query:  SHHQPITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCL
             ++R I D  IEAS AKYIVQQY+AA GG  ALN++ SMYA+G+V+M  SE  +GE            + GK    G  E+GGFV+WQK P LW L
Subjt:  SHHQPITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCL

Query:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF
        EL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYF
Subjt:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF

Query:  SQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSD
        SQRTGLLV+  D+ L+R+K+   +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G     + R ++EE W IEEVDFNI GL ++ FLPPSD
Subjt:  SQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSD

Query:  LKKE
        +  +
Subjt:  LKKE

AT3G19540.1 Protein of unknown function (DUF620)2.3e-9744.73Show/hide
Query:  REDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKDNPIEASMAK
        R   L  V+E P P+E                +  W+K       S   S+A+    R  +++LLLGV+GAPL P+ + S       +IK+ PIE S A+
Subjt:  REDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKDNPIEASMAK

Query:  YIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ
        YI+QQY AA GG+   NSI + YAMGK+KM+ SE       L    + V+ +N         E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR 
Subjt:  YIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ

Query:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAA
        TPW  SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ 
Subjt:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAA

Query:  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTM
        G   + +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A SH+RTKMEE W IEEV FN+ GLS+D F+PP+DLK      G +T + ++P   
Subjt:  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTM

Query:  R----CTTASKITSSRVAAIDVDESEE
        R        S    ++VAA++    E+
Subjt:  R----CTTASKITSSRVAAIDVDESEE

AT3G55720.1 Protein of unknown function (DUF620)1.3e-13557.73Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHH----QPITR
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NK+   +W+++KS L  +   + S   S+A+LFGGR+++IQ+LLG+VGAP IPLPI S       PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHH----QPITR

Query:  NIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNA----KVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELML
         IK+  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +   +LN     K+V+ +N  +  G GGEMGGFV+W+K    W LEL++
Subjt:  NIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNA----KVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELML

Query:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQR
        SGCK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + S L++RS S +E ++HTVWG F QR
Subjt:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQR

Query:  TGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL--
        TGLLVQLED++L+RIK      D + WETT ETLIQDY++IDG+ IAH GKT VSL R  E+ ESHS+T MEE WEIEEV FN+KGLS DFFLPP DL  
Subjt:  TGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL--

Query:  KKEEE---GVGVITSNGKFPLTMRCTTASKITSSRVAAID
        K+EEE     G  TS    PL +  TT+ KI SS+V AI+
Subjt:  KKEEE---GVGVITSNGKFPLTMRCTTASKITSSRVAAID

AT5G05840.1 Protein of unknown function (DUF620)6.4e-16467.87Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKS-WLK-SNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI-PSHH----QP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+ +  K     W  MKS W K +      +   ++  LFGGRNAEIQLLLGVVGAPLIPLP+ P HH     P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKS-WLK-SNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPI-PSHH----QP

Query:  ITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLS
        I ++IKD P+E SMA+YIV+QYIAAVGG+ ALN+++SMYAMGKV+M ASEF +GEG  SLN+K+VK ++ KS   GGGE+GGFV+WQK  ELWCLEL++S
Subjt:  ITRNIKDNPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        GCKISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE S L+ARSSS+VEIIRHTVWG FSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK--
        LL+QLEDSHLLRIKA    +++IFWETTME+LIQDYRT+DG+ +AHAGK+SVSLFRFGE +++HSRT+MEE WEIEE+DFNIKGLSMD FLPPSDLKK  
Subjt:  LLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK--

Query:  -EEEGV--GVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES
         EEE +  G+  +N K P+ +R + + +I+SS+V AI V+E +ES
Subjt:  -EEEGV--GVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEES

AT5G66740.1 Protein of unknown function (DUF620)2.5e-11553.94Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N    + WQ M +W+K+      S       L   R  E++ LL +VG+PLIPL +   H  + + +KD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKD

Query:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
          I+AS AKYIVQQYIAA GG  ALN+++SM   G+VKM ASEF  G+ S  +N K               EMGGFV+WQK P+LWCLEL++SGCK+  G
Subjt:  NPIEASMAKYIVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        S+G+++WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE   +V  A+S  + EII HT+WGYFSQR+GLL+Q ED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        S LLR++     ++++FWET+ E+++ DYR +D VNIAH GKTSV++FR+GE + +H R +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  SHLLRIKAAGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCGAACTTCGACAGAGAAGACGGCCTCGACACCGTCCTCGAGGTTCCGATTCCTGAAGAGATGTTTTCCTGCAATACTAACAAGACCCATACGAT
TTCATGGCAAGCTATGAAGTCCTGGCTCAAGTCCAATCATTATCACGATAAATCCCACGTTAATTCCATAGCTTCCCTTTTCGGCGGCCGCAACGCCGAGATCCAGCTCC
TCCTTGGCGTCGTCGGAGCTCCCTTGATCCCTCTCCCCATCCCTTCCCATCATCAACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCCATGGCGAAGTAC
ATAGTGCAACAGTATATAGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTCTCCTCAGG
CGAAGGGAGTTTGAGTTTGAACGCCAAGGTGGTGAAGACGAAGAACGGGAAAAGCGGTGCCGGCGGTGGTGGGGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGG
AATTATGGTGCTTGGAACTGATGCTCTCCGGCTGCAAGATCAGCGCCGGCAGCGACGGCAAGGTCGCTTGGAGACAAACTCCATGGCATCACTCCCATGCTTCTCGCGGC
CCTCCTCGTCCCCTCCGACGCTTCTTGCAGGGACTTGATCCGAAATCGACGGCAACTCTGTTTTCGAACTCCACCTGCATCGGCGAGAAGACAATCAACGATGAAGACTG
CTTCATTCTAAAACTGGAAGCCGAATCCTCGGTCCTAAGAGCCAGAAGCAGTAGCAGCGTCGAGATCATCCGCCACACCGTATGGGGATACTTCAGCCAGAGAACCGGCC
TCCTCGTCCAGCTCGAAGACTCCCACCTCCTCCGGATCAAAGCCGCCGGCTCTCGAAACGACAACATCTTCTGGGAAACCACAATGGAAACCCTAATCCAAGACTACAGG
ACCATCGACGGCGTCAACATCGCACACGCCGGGAAGACATCCGTTTCGCTTTTCCGATTCGGCGAAACCGCAGAAAGCCATTCCAGAACGAAAATGGAAGAGATTTGGGA
GATCGAAGAAGTGGATTTCAACATCAAGGGTTTGTCCATGGACTTCTTTTTGCCTCCGAGTGACCTGAAGAAGGAAGAAGAAGGAGTCGGTGTAATAACAAGTAACGGAA
AGTTTCCGTTGACGATGCGATGTACGACCGCTTCGAAGATTACCTCGTCTAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGAAAGCAGCCATCAGAGTGATGAAGAT
GAAGATTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGCTTTGTCCGAACTTCGACAGAGAAGACGGCCTCGACACCGTCCTCGAGGTTCCGATTCCTGAAGAGATGTTTTCCTGCAATACTAACAAGACCCATACGAT
TTCATGGCAAGCTATGAAGTCCTGGCTCAAGTCCAATCATTATCACGATAAATCCCACGTTAATTCCATAGCTTCCCTTTTCGGCGGCCGCAACGCCGAGATCCAGCTCC
TCCTTGGCGTCGTCGGAGCTCCCTTGATCCCTCTCCCCATCCCTTCCCATCATCAACCCATTACCCGCAACATCAAAGACAACCCCATTGAGGCGTCCATGGCGAAGTAC
ATAGTGCAACAGTATATAGCGGCGGTGGGAGGGGAGCATGCGTTGAACTCGATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTCTCCTCAGG
CGAAGGGAGTTTGAGTTTGAACGCCAAGGTGGTGAAGACGAAGAACGGGAAAAGCGGTGCCGGCGGTGGTGGGGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGG
AATTATGGTGCTTGGAACTGATGCTCTCCGGCTGCAAGATCAGCGCCGGCAGCGACGGCAAGGTCGCTTGGAGACAAACTCCATGGCATCACTCCCATGCTTCTCGCGGC
CCTCCTCGTCCCCTCCGACGCTTCTTGCAGGGACTTGATCCGAAATCGACGGCAACTCTGTTTTCGAACTCCACCTGCATCGGCGAGAAGACAATCAACGATGAAGACTG
CTTCATTCTAAAACTGGAAGCCGAATCCTCGGTCCTAAGAGCCAGAAGCAGTAGCAGCGTCGAGATCATCCGCCACACCGTATGGGGATACTTCAGCCAGAGAACCGGCC
TCCTCGTCCAGCTCGAAGACTCCCACCTCCTCCGGATCAAAGCCGCCGGCTCTCGAAACGACAACATCTTCTGGGAAACCACAATGGAAACCCTAATCCAAGACTACAGG
ACCATCGACGGCGTCAACATCGCACACGCCGGGAAGACATCCGTTTCGCTTTTCCGATTCGGCGAAACCGCAGAAAGCCATTCCAGAACGAAAATGGAAGAGATTTGGGA
GATCGAAGAAGTGGATTTCAACATCAAGGGTTTGTCCATGGACTTCTTTTTGCCTCCGAGTGACCTGAAGAAGGAAGAAGAAGGAGTCGGTGTAATAACAAGTAACGGAA
AGTTTCCGTTGACGATGCGATGTACGACCGCTTCGAAGATTACCTCGTCTAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGAAAGCAGCCATCAGAGTGATGAAGAT
GAAGATTCGTGA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKTHTISWQAMKSWLKSNHYHDKSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIPSHHQPITRNIKDNPIEASMAKY
IVQQYIAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEGSLSLNAKVVKTKNGKSGAGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRG
PPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAAGSRNDNIFWETTMETLIQDYR
TIDGVNIAHAGKTSVSLFRFGETAESHSRTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVITSNGKFPLTMRCTTASKITSSRVAAIDVDESEESSHQSDED
EDS