; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030299 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030299
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationchr8:46243971..46246451
RNA-Seq ExpressionLag0030299
SyntenyLag0030299
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602636.1 hypothetical protein SDJN03_07869, partial [Cucurbita argyrosperma subsp. sororia]1.6e-22289.29Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSH---LTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMK+ +   NHH KSSH   + S+FGGRNA IQLLLGVVGAPLIPLPI F  QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSH---LTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        +DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEFSSGE CLNGKA K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL
        EDSHLLRIKAGGSRND+IFWETTMETLIQDYRTID VNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL

Query:  IPTNGKFPLTMRC---SAAAGSKICSSRVAAIDVDESEGSNPSDEDED
          +NGK PLTMRC   +AAAGSKICSSRVAAIDVDESEGSN SDEDE+
Subjt:  IPTNGKFPLTMRC---SAAAGSKICSSRVAAIDVDESEGSNPSDEDED

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]6.3e-22289.21Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKS--SHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIR
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMKS +  S++ HKS  + + ++FGGRNA IQLLLGVVGAPLIP+P+ FD +PIT NI+
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKS--SHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIR

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        DNPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKM ASEFSSGEG LNGK  KAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTNG
        LRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEEIWEIEEVDFNI+GLSMDFFLPPSDLKKEEEGV +I +NG
Subjt:  LRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTNG

Query:  KFPLTMRCSAAAGSKICSSRVAAIDVDE-SEG-SNPSDEDEDEDS
        KFPLTMRC+AAA SKICSSRVAAIDVDE SEG SN SDEDEDEDS
Subjt:  KFPLTMRCSAAAGSKICSSRVAAIDVDE-SEG-SNPSDEDEDEDS

XP_022957502.1 uncharacterized protein LOC111458877 [Cucurbita moschata]1.1e-22389.76Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMK+ +  SNHH KSSH+ SI   FGGRNA IQLLLGVVGAPLIPLPI F  QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        +DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEFSSGE CLNGKA K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL
        EDSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL

Query:  IPTNGKFPLTMRC--SAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED
          +NGK PLTMRC  +AAAGSKICSSRVAAIDVDESEGSN SDEDE+ +
Subjt:  IPTNGKFPLTMRC--SAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED

XP_022990734.1 uncharacterized protein LOC111487531 [Cucurbita maxima]4.3e-22390.48Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMK+ +  SN HHKSSH+ SI   FGGRNA IQLLLGVVGAPLIPLPI F  QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
        +DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEFSSGE CLNGKA K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD

Query:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
        GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSH
Subjt:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH

Query:  LLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTN
        LLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG   +N
Subjt:  LLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTN

Query:  GKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDED
        GK PLTMRC AA+GSKIC SRVAAIDVDESEGSN SDEDE+
Subjt:  GKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDED

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]3.8e-21988.39Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSH---LTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDRE GLDTVLEVPIPEEMFS  T K+HT ISWQAMKS +  ++HH KSSH   ++S+FGGRNA IQLLLGVVGAPLIPLPITFD+QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSH---LTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKA--KNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKIS
        +DNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMAASEF SGEGCLNGKA K    NGK  GGGGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKA--KNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL
        EDSHLLRIK  GSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKEEEGVG+
Subjt:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL

Query:  IPTNGKFPLTMRCSAAAGSKICSSRVAAI---DVDESEGSNPSDEDED
        I +NGKFP+TMRCS  AGS++ SSRV AI   D DESE SN SDEDED
Subjt:  IPTNGKFPLTMRCSAAAGSKICSSRVAAI---DVDESEGSNPSDEDED

TrEMBL top hitse value%identityAlignment
A0A6J1BVR5 uncharacterized protein LOC1110061913.0e-22289.21Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKS--SHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIR
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMKS +  S++ HKS  + + ++FGGRNA IQLLLGVVGAPLIP+P+ FD +PIT NI+
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKS--SHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIR

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
        DNPIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKM ASEFSSGEG LNGK  KAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTNG
        LRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEEIWEIEEVDFNI+GLSMDFFLPPSDLKKEEEGV +I +NG
Subjt:  LRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTNG

Query:  KFPLTMRCSAAAGSKICSSRVAAIDVDE-SEG-SNPSDEDEDEDS
        KFPLTMRC+AAA SKICSSRVAAIDVDE SEG SN SDEDEDEDS
Subjt:  KFPLTMRCSAAAGSKICSSRVAAIDVDE-SEG-SNPSDEDEDEDS

A0A6J1FGT4 uncharacterized protein LOC1114452681.5e-20884.27Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T KSH  ISWQAMKS +  S++++K SHLTSI   FGGRNA IQLLLGVVGAPLIPLPI F  Q IT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGS
        +DNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKM ASEF+SGEGC NGK  KAKNGKGG   GEMG FV+WQKRP+LWCLE+MLSGCKISAGS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGS

Query:  DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDS
        DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKT+N EDCFILKLEAESSVLRARSSS VEIIRHTVWGYFSQRTGLLV LEDS
Subjt:  DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDS

Query:  HLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPT
        HLLRIK GGSRNDN+FWETTME+ IQDYRTIDGVNIAHAGKTTVSL RFG+ AEGHSKTKMEEIW+IEEVDFNI+GLSM+FFLPPSDLKKEEE +G I +
Subjt:  HLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPT

Query:  NGKFPLTM-RCSAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED
        + KFPL M R SA AGS+I  SRVAA+D DESEGS+ SDED+D+D
Subjt:  NGKFPLTM-RCSAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED

A0A6J1GZA9 uncharacterized protein LOC1114588775.5e-22489.76Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMK+ +  SNHH KSSH+ SI   FGGRNA IQLLLGVVGAPLIPLPI F  QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS
        +DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEFSSGE CLNGKA K KNGKGG    GGGEMGGFVVWQKRPELWCLELMLSGCKIS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG----GGGEMGGFVVWQKRPELWCLELMLSGCKIS

Query:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL
        AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+L
Subjt:  AGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQL

Query:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL
        EDSHLLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG 
Subjt:  EDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGL

Query:  IPTNGKFPLTMRC--SAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED
          +NGK PLTMRC  +AAAGSKICSSRVAAIDVDESEGSN SDEDE+ +
Subjt:  IPTNGKFPLTMRC--SAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED

A0A6J1JRQ6 uncharacterized protein LOC1114891901.3e-20482.7Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSC T KSH  ISWQAMKS +  S++++K SH+ SI   FGGRNA IQLLLGVVGAPLIPLPI F  Q IT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGS
        +DNPIEASMAKYIVQQY+AAVGGEHALNSI SMYAMGKVKM ASEF+SGEG  NGK  KAKNGK G   GEMG FV+WQKRP+LWCLE+MLSGCKISAGS
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGG-GGGEMGGFVVWQKRPELWCLELMLSGCKISAGS

Query:  DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDS
        DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKT+N EDCFILKLEAESSVLRARSSS VEIIRHTVWGYFSQRTGLLV LEDS
Subjt:  DGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDS

Query:  HLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPT
        HLLRIK GGSRND++FWETTME+ I+DYRTIDGVNIAHAGKTTVSL RFG+ AEGHSKTKMEEIW+IEEVDFNI+GLSM+FFLPPSDLKKEEE +G++ +
Subjt:  HLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPT

Query:  NGKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDEDEDS
        + KFP TMR ++ AGS+I SSRVAA+D DESEGS+ SDEDED+DS
Subjt:  NGKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDEDEDS

A0A6J1JSU8 uncharacterized protein LOC1114875312.1e-22390.48Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI
        MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNK+H  ISWQAMK+ +  SN HHKSSH+ SI   FGGRNA IQLLLGVVGAPLIPLPI F  QPIT NI
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSI---FGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNI

Query:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
        +DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEFSSGE CLNGKA K KNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
Subjt:  RDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD

Query:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
        GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTC+GEKTINDEDCFIL LEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LEDSH
Subjt:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH

Query:  LLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTN
        LLRIKAGGSRND+IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNIQGLSMDFFLPPSDLKKE+EGVG   +N
Subjt:  LLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTN

Query:  GKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDED
        GK PLTMRC AA+GSKIC SRVAAIDVDESEGSN SDEDE+
Subjt:  GKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49840.1 Protein of unknown function (DUF620)5.3e-9446.43Show/hide
Query:  RNAHIQLLLGVVGAPLIPLPITFDRQPITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGG
        R + ++LLLGV+GAPL P+ ++     +   IRD+P E S A+YI+QQY AA GG    N+I + YAMGK+KM  SE  +  G +  + +          
Subjt:  RNAHIQLLLGVVGAPLIPLPITFDRQPITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGG

Query:  GEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLR
         E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR TPW  SH ++GP RPLRR LQGLDP++TAT+F+ S C+GE+ +N EDCFILKL  +   LR
Subjt:  GEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLR

Query:  ARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEE
        ARS    EI+RH ++GYFSQRTGLL Q+EDS L RI++  +  D ++WETT+ + + DY+ ++G+ IAH+G++ V+LFRFGE A  H++TKMEE W IEE
Subjt:  ARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEE

Query:  VDFNIQGLSMDFFLPPSDLKK----EEEGVGLIPTNGKFPLTMRCSAAAGSKICSSRVAAIDVD
        V FN+ GLS+D F+PP+DL+     E          GK  L +  + A  +K+ +    + D D
Subjt:  VDFNIQGLSMDFFLPPSDLKK----EEEGVGLIPTNGKFPLTMRCSAAAGSKICSSRVAAIDVD

AT1G75160.1 Protein of unknown function (DUF620)3.8e-10851.49Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFS-CNTN------------KSHTPISWQA--MKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIP--
        MRKLCPN DREDGL+TVLEVP+PEEMF+   +N            K+H  ++  A  +++  + S+  + + HL S     +     LL +VG+PLIP  
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFS-CNTN------------KSHTPISWQA--MKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIP--

Query:  LPITFDRQPITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCL
        +P+ F    ++  I D  IEAS AKYIVQQYVAA GG  ALN++ SMYA+G+V+M  SE  +GE    G     + GK  G  E+GGFV+WQK P LW L
Subjt:  LPITFDRQPITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCL

Query:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF
        EL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYF
Subjt:  ELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYF

Query:  SQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSD
        SQRTGLLV+  D+ L+R+K+G  +ND +FWET+ME++I DY  +D VNIAH G+T  +L+R+G     + + ++EE W IEEVDFNI GL ++ FLPPSD
Subjt:  SQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSD

Query:  LKKE
        +  +
Subjt:  LKKE

AT3G55720.1 Protein of unknown function (DUF620)3.6e-13557.56Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDR----QPITSN
        MR LCPNFDREDGL+TVLEVP+PEE+F  + NKS    +W+++KS L  S   + SS L ++FGGR++ IQ+LLG+VGAP IPLPI+ D+     PI++ 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDR----QPITSN

Query:  IRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSGC
        I++  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +  LNGK  K        N   G GGEMGGFV+W+K    W LEL++SGC
Subjt:  IRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAK------NGKGGGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
        K+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + S L++RS S +E ++HTVWG F QRTGL
Subjt:  KISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDL-KKEE
        LVQLED++L+RIK G    D + WETT ETLIQDY++IDG+ IAH GKT VSL R  E+ E HSKT MEE WEIEEV FN++GLS DFFLPP DL  KEE
Subjt:  LVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDL-KKEE

Query:  EGVGLIPTNGKFPLTMRCSAAAGS-KICSSRVAAI-DVDESEG
        E  G    +   P+ +    +  S KI SS+V AI D  E EG
Subjt:  EGVGLIPTNGKFPLTMRCSAAAGS-KICSSRVAAI-DVDESEG

AT5G05840.1 Protein of unknown function (DUF620)2.8e-15965.26Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLN----YSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFD-----RQ
        MRKLCPN++ EDGL+TVLEVP+PEE+F+     S T   W  MKS  +     +     ++++T +FGGRNA IQLLLGVVGAPLIPLP+  D       
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLN----YSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFD-----RQ

Query:  PITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCK
        PI  +I+D P+E SMA+YIV+QY+AAVGG+ ALN+++SMYAMGKV+M ASEF +GEG LN K  KA++ K  GGGE+GGFV+WQK  ELWCLEL++SGCK
Subjt:  PITSNIRDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCK

Query:  ISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLV
        ISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE S L+ARSSS+VEIIRHTVWG FSQRTGLL+
Subjt:  ISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLV

Query:  QLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKK---EE
        QLEDSHLLRIKA    +++IFWETTME+LIQDYRT+DG+ +AHAGK++VSLFRFGE ++ HS+T+MEE WEIEE+DFNI+GLSMD FLPPSDLKK   EE
Subjt:  QLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKK---EE

Query:  EGV--GLIPTNGKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDE
        E +  GL   N K P+ +R   +A  +I SS+V AI V+E + S  ++E
Subjt:  EGV--GLIPTNGKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDE

AT5G66740.1 Protein of unknown function (DUF620)3.1e-11052.32Show/hide
Query:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIRDN
        MRKLCPN D++DGL+TVLEVPIPEEMFS   N  +  + WQ M + +        S  L      R   ++ LL +VG+PLIPL +      +   ++D 
Subjt:  MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIRDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV
         I+AS AKYIVQQY+AA GG  ALN+++SM   G+VKM ASEF  G+   +G   K+ +       EMGGFV+WQK P+LWCLEL++SGCK+  GS+G++
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKV

Query:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR
        +WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE   +V  A+S  + EII HT+WGYFSQR+GLL+Q EDS LLR
Subjt:  AWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLR

Query:  IKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEE
        ++     ++++FWET+ E+++ DYR +D VNIAH GKT+V++FR+GE +  H + +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  IKAGGSRNDNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTCTGTCCCAACTTCGACCGAGAGGATGGCCTCGACACTGTCCTCGAGGTTCCCATCCCCGAGGAGATGTTCTCTTGCAACACCAACAAGTCCCACACGCC
CATCTCATGGCAAGCCATGAAATCATGCCTCAACTACTCCAATCATCACCACAAATCATCGCACCTTACTTCCATCTTCGGCGGCCGCAACGCCCACATCCAGCTCCTCC
TCGGCGTCGTCGGAGCTCCGTTAATCCCTCTCCCCATCACTTTCGATCGCCAACCCATTACGAGCAACATCAGAGACAATCCCATTGAGGCGTCCATGGCCAAGTACATA
GTGCAGCAATACGTGGCTGCAGTGGGAGGAGAACATGCGTTGAATTCCATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGCGGCGTCGGAGTTTTCCTCCGGCGA
AGGCTGTTTGAATGGAAAAGCGGCCAAGGCCAAGAACGGGAAAGGCGGTGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGGAGTTATGGTGCTTGG
AATTGATGCTGTCGGGCTGTAAAATCAGCGCCGGCAGCGACGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTC
CGGCGATTCCTGCAGGGACTGGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCCACCTGCATCGGCGAGAAAACAATCAACGACGAAGATTGCTTCATTCTAAAGCT
AGAAGCCGAATCCTCAGTTCTGAGAGCAAGAAGCAGTAGCAGCGTCGAGATAATCCGCCACACAGTTTGGGGATATTTCAGCCAGAGAACCGGCCTCCTAGTGCAGCTCG
AAGATTCGCATCTTCTCCGAATCAAAGCCGGCGGATCTCGAAACGACAACATCTTCTGGGAAACCACCATGGAAACCCTAATTCAGGACTACAGAACGATCGACGGAGTC
AACATCGCACACGCCGGAAAAACGACCGTCTCGCTTTTCCGATTCGGCGAAACCGCTGAAGGCCATTCGAAAACCAAGATGGAGGAGATTTGGGAGATCGAAGAGGTGGA
TTTCAATATCCAGGGCTTGTCCATGGATTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAGGAAGGAGTTGGTTTGATTCCGACTAATGGAAAGTTTCCGTTGACGA
TGAGGTGTTCGGCTGCTGCTGGTTCCAAGATTTGTTCGTCTAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGGGAGTAATCCGAGTGATGAAGATGAAGATGAAGAT
TCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGCTCTGTCCCAACTTCGACCGAGAGGATGGCCTCGACACTGTCCTCGAGGTTCCCATCCCCGAGGAGATGTTCTCTTGCAACACCAACAAGTCCCACACGCC
CATCTCATGGCAAGCCATGAAATCATGCCTCAACTACTCCAATCATCACCACAAATCATCGCACCTTACTTCCATCTTCGGCGGCCGCAACGCCCACATCCAGCTCCTCC
TCGGCGTCGTCGGAGCTCCGTTAATCCCTCTCCCCATCACTTTCGATCGCCAACCCATTACGAGCAACATCAGAGACAATCCCATTGAGGCGTCCATGGCCAAGTACATA
GTGCAGCAATACGTGGCTGCAGTGGGAGGAGAACATGCGTTGAATTCCATTGATAGTATGTATGCGATGGGGAAGGTGAAGATGGCGGCGTCGGAGTTTTCCTCCGGCGA
AGGCTGTTTGAATGGAAAAGCGGCCAAGGCCAAGAACGGGAAAGGCGGTGGCGGCGGAGAGATGGGTGGGTTTGTGGTGTGGCAGAAGCGGCCGGAGTTATGGTGCTTGG
AATTGATGCTGTCGGGCTGTAAAATCAGCGCCGGCAGCGACGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCTCATGCTTCTCGTGGCCCTCCTCGTCCCCTC
CGGCGATTCCTGCAGGGACTGGATCCAAAATCGACGGCGACTCTGTTCTCAAACTCCACCTGCATCGGCGAGAAAACAATCAACGACGAAGATTGCTTCATTCTAAAGCT
AGAAGCCGAATCCTCAGTTCTGAGAGCAAGAAGCAGTAGCAGCGTCGAGATAATCCGCCACACAGTTTGGGGATATTTCAGCCAGAGAACCGGCCTCCTAGTGCAGCTCG
AAGATTCGCATCTTCTCCGAATCAAAGCCGGCGGATCTCGAAACGACAACATCTTCTGGGAAACCACCATGGAAACCCTAATTCAGGACTACAGAACGATCGACGGAGTC
AACATCGCACACGCCGGAAAAACGACCGTCTCGCTTTTCCGATTCGGCGAAACCGCTGAAGGCCATTCGAAAACCAAGATGGAGGAGATTTGGGAGATCGAAGAGGTGGA
TTTCAATATCCAGGGCTTGTCCATGGATTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAGGAAGGAGTTGGTTTGATTCCGACTAATGGAAAGTTTCCGTTGACGA
TGAGGTGTTCGGCTGCTGCTGGTTCCAAGATTTGTTCGTCTAGAGTGGCGGCCATTGATGTTGATGAATCGGAGGGGAGTAATCCGAGTGATGAAGATGAAGATGAAGAT
TCGTGA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREDGLDTVLEVPIPEEMFSCNTNKSHTPISWQAMKSCLNYSNHHHKSSHLTSIFGGRNAHIQLLLGVVGAPLIPLPITFDRQPITSNIRDNPIEASMAKYI
VQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFSSGEGCLNGKAAKAKNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPL
RRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNDNIFWETTMETLIQDYRTIDGV
NIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIQGLSMDFFLPPSDLKKEEEGVGLIPTNGKFPLTMRCSAAAGSKICSSRVAAIDVDESEGSNPSDEDEDED
S