; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G030500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G030500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationCmo_Chr04:21352322..21354420
RNA-Seq ExpressionCmoCh04G030500
SyntenyCmoCh04G030500
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]7.4e-25598.88Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVK NHHDKSSH NSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIKAGGSRNDDIFWETTMETLIQDYRTID VNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRC-AAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
        NGKLPLTMRC AAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEE EP
Subjt:  NGKLPLTMRC-AAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]8.0e-22589.53Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T

Query:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDE-SEG-SNQSDEDEEAE
        SNGK PLTMRC AAA   SKICSSRVAAIDVDE SEG SNQSDEDE+ +
Subjt:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDE-SEG-SNQSDEDEEAE

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]1.9e-258100Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
        NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
Subjt:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]5.7e-24796.87Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN H KSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
        NGKLPLTMRC   AA+GSKIC SRVAAIDVDESEGSNQSDEDEE EP
Subjt:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]1.2e-22088.25Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN-HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIK
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T K H ISWQAMK+WVKSN HHDKSSHV SI+SLFGGRNAEIQLLLGVVGAPLIPLPI F  QPIT NIK
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN-HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATK--VKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKI
        DNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKM ASEF SGE CLNGKA K    NGKGGGG  GGGEMGGFVVWQKRPELWCLELMLSGCKI
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATK--VKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKI

Query:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVE
        SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+
Subjt:  SAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVE

Query:  LEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVG
        LEDSHLLRIK  GSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG
Subjt:  LEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVG

Query:  -ATSNGKLPLTMRCAAAAAAGSKICSSRVAAI---DVDESEGSNQSDEDEE
          TSNGK P+TMRC    +AGS++ SSRV AI   D DESE SNQSDEDE+
Subjt:  -ATSNGKLPLTMRCAAAAAAGSKICSSRVAAI---DVDESEGSNQSDEDEE

TrEMBL top hitse value%identityAlignment
A0A6J1BVR5 uncharacterized protein LOC1110061913.9e-22589.53Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK HAISWQAMK+WVKS H+D  SHVNS+A+LFGGRNAEIQLLLGVVGAPLIP+P+ F H+PIT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKMVASEFSSGE  LNGK  K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T
        SHLLRIKAGGSRND IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGESAEGHS+TKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGV   T
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T

Query:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDE-SEG-SNQSDEDEEAE
        SNGK PLTMRC AAA   SKICSSRVAAIDVDE SEG SNQSDEDE+ +
Subjt:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDE-SEG-SNQSDEDEEAE

A0A6J1FGT4 uncharacterized protein LOC1114452689.0e-21483.67Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMK+WVKS++++K SH+ SIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQ IT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMVASEF+SGE C NGK  K KNGKGG   +  GEMG FV+WQKRP+LWCLE+MLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKT+N EDCFILKLEAES VLRARSSS VEIIRHTVWGYFSQRTGLLV LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T
        SHLLRIK GGSRND++FWETTME+ IQDYRTIDGVNIAHAGKTTVSL RFG+ AEGHSKTKMEE W+IEEVDFNI+GLSM+FFLPPSDLKKE+E +GA  
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGA-T

Query:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAE
        S+ K PL MR   +A AGS+I  SRVAA+D DESEGS++SDED++ +
Subjt:  SNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAE

A0A6J1GZA9 uncharacterized protein LOC1114588779.1e-259100Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
        NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
Subjt:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP

A0A6J1JRQ6 uncharacterized protein LOC1114891901.4e-21184.01Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T K+HAISWQAMK+WVKS++++K SHV SIASLFGGRNAEIQLLLGVVGAPLIPLPI FHHQ IT NIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQY+AAVGGEHALNSI SMYAMGKVKMVASEF+SGE   NGK  K KNGK G   +  GEMG FV+WQKRP+LWCLE+MLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKT+N EDCFILKLEAES VLRARSSS VEIIRHTVWGYFSQRTGLLV LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIK GGSRNDD+FWETTME+ I+DYRTIDGVNIAHAGKTTVSL RFG+ AEGHSKTKMEE W+IEEVDFNI+GLSM+FFLPPSDLKKE+E    TS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEE
        + K P TMR   A+ AGS+I SSRVAA+D DESEGS++SDEDE+
Subjt:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEE

A0A6J1JSU8 uncharacterized protein LOC1114875312.8e-24796.87Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN H KSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
        SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATS

Query:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP
        NGKLPLTMRC   AA+GSKIC SRVAAIDVDESEGSNQSDEDEE EP
Subjt:  NGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDEDEEAEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)8.2e-11151.97Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN------------HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIP--LP
        MRKLCPN DREDGL+TVLEVP+PEEMF+   + A    W+ M   +K++                SS  N    L    + E   LL +VG+PLIP  +P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSN------------HHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIP--LP

Query:  IIFHHQPITCNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELW
        + F    ++  I D  IEAS AKYIVQQYVAA GG  ALN++ SMYA+G+V+M  SE  +GE    G  T V+ GK      G  E+GGFV+WQK P LW
Subjt:  IIFHHQPITCNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELW

Query:  CLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWG
         LEL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWG
Subjt:  CLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWG

Query:  YFSQRTGLLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPP
        YFSQRTGLLV+  D+ L+R+K+G  +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G +   + + ++EE W IEEVDFNI GL ++ FLPP
Subjt:  YFSQRTGLLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPP

Query:  SDLKKE
        SD+  +
Subjt:  SDLKKE

AT3G19540.1 Protein of unknown function (DUF620)2.6e-9646.98Show/hide
Query:  REDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKDNPIEASMAK
        R   L  V+E P P+E                +  WVK     + S   S+A+    R  +++LLLGV+GAPL P+ +         +IK+ PIE S A+
Subjt:  REDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKDNPIEASMAK

Query:  YIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ
        YI+QQY AA GG+   NSI + YAMGK+KM+ SE  +        AT+    +         E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR 
Subjt:  YIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQ

Query:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELEDSHLLRIKAG
        TPW  SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ 
Subjt:  TPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELEDSHLLRIKAG

Query:  GSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLK
        G   + +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H++TKMEE+W IEEV FN+ GLS+D F+PP+DLK
Subjt:  GSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLK

AT3G55720.1 Protein of unknown function (DUF620)8.5e-13256.28Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHH----QPITC
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NK+ A  W+++K+ +  +  D SS   S+A+LFGGR+++IQ+LLG+VGAP IPLPI         PI+ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHH----QPITC

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATK----VKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELML
         IK+  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +  LNGK  K    ++N     G   GGEMGGFV+W+K    W LEL++
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATK----VKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELML

Query:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQR
        SGCK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE +   L++RS S +E ++HTVWG F QR
Subjt:  SGCKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQR

Query:  TGLLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDL-K
        TGLLV+LED++L+RIK G    D + WETT ETLIQDY++IDG+ IAH GKT VSL R  ES E HSKT MEE+WEIEEV FN++GLS DFFLPP DL  
Subjt:  TGLLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDL-K

Query:  KEKEGVGATSNGKLPLTMRCAAAAAAGSKICSSRVAAI-DVDESEG
        KE+E  G +        +     +    KI SS+V AI D  E EG
Subjt:  KEKEGVGATSNGKLPLTMRCAAAAAAGSKICSSRVAAI-DVDESEG

AT5G05840.1 Protein of unknown function (DUF620)1.5e-15764.1Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKT-WVK-SNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHH-----QP
        MRKLCPN++ EDGL+TVLEVP+PEE+F+ +  K     W  MK+ W K +     ++   ++  LFGGRNAEIQLLLGVVGAPLIPLP+   H      P
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKT-WVK-SNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHH-----QP

Query:  ITCNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLS
        I  +IKD P+E SMA+YIV+QY+AAVGG+ ALN+++SMYAMGKV+M ASEF +GE  LN K  K ++ K      GGGE+GGFV+WQK  ELWCLEL++S
Subjt:  ITCNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTG
        GCKISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE   L+ARSSS+VEIIRHTVWG FSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKK--
        LL++LEDSHLLRIKA    ++ IFWETTME+LIQDYRT+DG+ +AHAGK++VSLFRFGE+++ HS+T+MEETWEIEE+DFNI+GLSMD FLPPSDLKK  
Subjt:  LLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKK--

Query:  -EKEGVG---ATSNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDE
         E+E +    A +N KLP+ +R     +A  +I SS+V AI V+E + S  ++E
Subjt:  -EKEGVG---ATSNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSDE

AT5G66740.1 Protein of unknown function (DUF620)5.9e-11753.94Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N   A+ WQ M TW+K+   DK S       L   R  E++ LL +VG+PLIPL +   H  +   +KD
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG
          I+AS AKYIVQQY+AA GG  ALN+++SM   G+VKM ASEF  G+               G  +    EMGGFV+WQK P+LWCLEL++SGCK+  G
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED
        S+G+++WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE    V  A+S  + EII HT+WGYFSQR+GLL++ ED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELED

Query:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEK
        S LLR++     ++D+FWET+ E+++ DYR +D VNIAH GKT+V++FR+GE++  H + +M E W IEEVDFN+ GLS+D FLPP++L+ EK
Subjt:  SHLLRIKAGGSRNDDIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGTCCCAACTTCGACAGGGAAGACGGCCTCGACACTGTCCTTGAGGTTCCCATCCCCGAGGAGATGTTCTCTTGCAACACCAACAAAGCCCACGCTAT
CTCATGGCAAGCCATGAAAACATGGGTCAAATCCAACCATCACGACAAATCCTCACATGTTAATTCAATCGCCTCCTTATTCGGCGGCCGCAACGCCGAGATCCAGCTCC
TCCTCGGCGTCGTCGGAGCTCCCTTAATCCCACTCCCCATCATTTTTCACCACCAGCCCATTACTTGCAACATCAAAGACAATCCCATTGAGGCGTCAATGGCGAAGTAC
ATAGTGCAACAATACGTGGCCGCAGTGGGAGGGGAGCATGCCTTGAATTCGATTGATAGTATGTATGCCATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTTTCTTCCGG
CGAAGCGTGTTTGAATGGTAAAGCGACGAAGGTAAAGAACGGGAAAGGCGGCGGCGGCGTCGTCGGCGGCGGAGAAATGGGTGGGTTTGTAGTGTGGCAGAAACGGCCGG
AGCTATGGTGTTTGGAACTGATGCTGTCGGGGTGTAAAATCAGCGCCGGTAGCGACGGGAAAGTGGCTTGGAGACAAACTCCTTGGCATCACTCTCATGCTTCTCGTGGC
CCTCCACGTCCGCTCCGACGATTCTTGCAGGGACTGGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCGACCTGCATCGGCGAGAAAACGATCAACGACGAAGATTG
CTTCATTCTAAAACTAGAAGCCGAATCGCCAGTCCTACGCGCAAGAAGCAGTAGCAGCGTTGAAATAATCCGCCACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCC
TCCTCGTGGAGCTAGAAGATTCGCATCTCCTCCGAATCAAAGCCGGAGGATCACGAAACGACGATATATTCTGGGAGACGACAATGGAAACCCTAATTCAGGACTACAGA
ACGATCGACGGCGTAAACATTGCACACGCTGGAAAAACAACCGTCTCGCTTTTTCGATTTGGCGAAAGCGCTGAAGGCCATTCGAAAACGAAGATGGAAGAGACTTGGGA
GATCGAAGAGGTTGATTTTAATATCCAGGGTTTATCGATGGACTTCTTTTTACCTCCCAGCGATTTGAAGAAGGAGAAGGAAGGAGTTGGTGCAACGAGTAATGGAAAGT
TACCGTTGACGATGAGATGTGCGGCGGCGGCGGCTGCTGGTTCGAAGATTTGTTCGTCAAGAGTGGCGGCCATTGATGTCGATGAATCGGAGGGCAGTAATCAGAGTGAT
GAAGATGAAGAGGCAGAGCCGTGA
mRNA sequenceShow/hide mRNA sequence
TCATTTCTTTTCTTCTCTTCTCTTCTTCTTTCACTTCAAACTCCCTCGATAACCTCTCTCTCCCTCTCTCCCTCTCTCTCCGATGAGGAAGCTTTGTCCCAACTTCGACA
GGGAAGACGGCCTCGACACTGTCCTTGAGGTTCCCATCCCCGAGGAGATGTTCTCTTGCAACACCAACAAAGCCCACGCTATCTCATGGCAAGCCATGAAAACATGGGTC
AAATCCAACCATCACGACAAATCCTCACATGTTAATTCAATCGCCTCCTTATTCGGCGGCCGCAACGCCGAGATCCAGCTCCTCCTCGGCGTCGTCGGAGCTCCCTTAAT
CCCACTCCCCATCATTTTTCACCACCAGCCCATTACTTGCAACATCAAAGACAATCCCATTGAGGCGTCAATGGCGAAGTACATAGTGCAACAATACGTGGCCGCAGTGG
GAGGGGAGCATGCCTTGAATTCGATTGATAGTATGTATGCCATGGGGAAGGTGAAGATGGTGGCGTCGGAGTTTTCTTCCGGCGAAGCGTGTTTGAATGGTAAAGCGACG
AAGGTAAAGAACGGGAAAGGCGGCGGCGGCGTCGTCGGCGGCGGAGAAATGGGTGGGTTTGTAGTGTGGCAGAAACGGCCGGAGCTATGGTGTTTGGAACTGATGCTGTC
GGGGTGTAAAATCAGCGCCGGTAGCGACGGGAAAGTGGCTTGGAGACAAACTCCTTGGCATCACTCTCATGCTTCTCGTGGCCCTCCACGTCCGCTCCGACGATTCTTGC
AGGGACTGGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCGACCTGCATCGGCGAGAAAACGATCAACGACGAAGATTGCTTCATTCTAAAACTAGAAGCCGAATCG
CCAGTCCTACGCGCAAGAAGCAGTAGCAGCGTTGAAATAATCCGCCACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCGTGGAGCTAGAAGATTCGCATCT
CCTCCGAATCAAAGCCGGAGGATCACGAAACGACGATATATTCTGGGAGACGACAATGGAAACCCTAATTCAGGACTACAGAACGATCGACGGCGTAAACATTGCACACG
CTGGAAAAACAACCGTCTCGCTTTTTCGATTTGGCGAAAGCGCTGAAGGCCATTCGAAAACGAAGATGGAAGAGACTTGGGAGATCGAAGAGGTTGATTTTAATATCCAG
GGTTTATCGATGGACTTCTTTTTACCTCCCAGCGATTTGAAGAAGGAGAAGGAAGGAGTTGGTGCAACGAGTAATGGAAAGTTACCGTTGACGATGAGATGTGCGGCGGC
GGCGGCTGCTGGTTCGAAGATTTGTTCGTCAAGAGTGGCGGCCATTGATGTCGATGAATCGGAGGGCAGTAATCAGAGTGATGAAGATGAAGAGGCAGAGCCGTGAAGAA
ATATATTTAAATTTTGAAACTCAAATTTTTGAAAATTTGTACATAATCCATGATTTTATATATATGTAAAGAGAGAGCTCTCCATTTATAGACTTTTCCAGGTATGTAGG
CCTGGGCTTTAATGGTTTTGGGCCGAACCAATAAACTTGGGCTTTATGAACTTTGAGCCCAAGCCCTTTGAACTTGGGCTTTTGGCTCCCCTTCACTGTGAG
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKAHAISWQAMKTWVKSNHHDKSSHVNSIASLFGGRNAEIQLLLGVVGAPLIPLPIIFHHQPITCNIKDNPIEASMAKY
IVQQYVAAVGGEHALNSIDSMYAMGKVKMVASEFSSGEACLNGKATKVKNGKGGGGVVGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRG
PPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVELEDSHLLRIKAGGSRNDDIFWETTMETLIQDYR
TIDGVNIAHAGKTTVSLFRFGESAEGHSKTKMEETWEIEEVDFNIQGLSMDFFLPPSDLKKEKEGVGATSNGKLPLTMRCAAAAAAGSKICSSRVAAIDVDESEGSNQSD
EDEEAEP