; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027636 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027636
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein LIKE COV 1-like isoform X1
Genome locationtig00153055:1353032..1373446
RNA-Seq ExpressionSgr027636
SyntenySgr027636
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007462 - Protein of unknown function DUF502


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591322.1 hypothetical protein SDJN03_13668, partial [Cucurbita argyrosperma subsp. sororia]8.1e-27871.11Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQ LS+P+V SPLV+GRKLKRLKKAS +SE+LPR+DDQ  SGVLG+F  IDDRFDD  KM ELSAAE  ADDS+K NGQDLDDSDELR+SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NLE  LGLD +EN SG EK LEFDAVAGI+E  EDQ   +GEESGD++VD    +R +   +   R  + ++  N+R   S   G PE     
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK
           K   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR+RKLELS KSIN+EN ILD  +DDN L TEVVIKHRL+VEGR DS+E+
Subjt:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK

Query:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------
        E EDMDQ PAD+ N+K SMCIDERSNGTN+ SER  ATD+VTEAF  PINDTQ +FS                                           
Subjt:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------

Query:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA
             SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHPHG S+LP+ A+GDPVKAFVDDEAEEEDDSDHDMRFQDE+EDE TDSEEL+DMIA
Subjt:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA

Query:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS
        TAYEENPLD+EKRNELHQKWLE +DAAGTEDLLQKLK                           +AAEDLLPLNVARMNIRKVKQMLPQ+YTD DDQYMS
Subjt:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS

Query:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF
        DDEETERR+ RERV  KAE KSTFLSPAEDESTREVFGLIKKLNVV DVKKRPKAQSFLDPPLTGVGKN +SKSSFLGRSSNLSLSSS KHGSS NSRSF
Subjt:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF

Query:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        IFGRDDSN++SAI  +EESSD  QSENKP R   AKFSYSQV+PSAQN    E KSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

KAG7024204.1 hypothetical protein SDJN02_13018 [Cucurbita argyrosperma subsp. argyrosperma]8.1e-27871.11Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQ LS+P+V SPLV+GRKLKRLKKAS +SE+LPR+DDQ  SGVLG+F  IDDRFDD  KM ELSAAE  ADDS+K NGQDLDDSDELR+SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NLE  LGLD +EN SG EK LEFDAVAGI+E  EDQ   +GEESGD++VD    +R +   +   R  + ++  N+R   S   G PE     
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILD-DSEDDNLFTEVVIKHRLAVEGREDSLEK
           K   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR+RKLELS KSIN+EN ILD D  D+ L TEVVIKHRL+VEGR DS+E+
Subjt:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILD-DSEDDNLFTEVVIKHRLAVEGREDSLEK

Query:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------
        E EDMDQ PAD+ N+K SMCIDERSNGTN+PSER  ATD+VTEAF  PINDTQ +FS                                           
Subjt:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------

Query:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA
             SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHPHG S+LP+ A+GDPVKAFVDDEAEEEDDSDHDMRFQDE+EDE TDSEEL+DMIA
Subjt:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA

Query:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS
        TAYEENPLD+EKRNELHQKWLE +DAAGTEDLLQKLK                           +AAEDLLPLNVARMNIRKVKQMLPQ+YTD DDQYMS
Subjt:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS

Query:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF
        DDEETERR+ RERV  KAE KSTFLSPAEDESTREVFGLIKKLNVV DVKKRPKAQSFLDPPLTGVGKN +SKSSFLGRSSNLSLSSS KHGSS NSRSF
Subjt:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF

Query:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        IFGRDDSN++SAI  +EESSD  QSENKP R   AKFSYSQV+PSAQN    E KSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

XP_022136352.1 uncharacterized protein LOC111008080 [Momordica charantia]9.2e-28271.41Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQFLS+PE ASPLVSGRKLKRLKKAS VSEDLP IDDQS SGVLGDF  IDDRFD  ++MH LSA+E   D+ NKL+GQDL DSDEL++SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGT-PEISVR
        RDLD GDNLE ++GLD EEN SGVEK  EFDA AGI++  EDQ  G+GEE+GD ++D    +R +   +   R  + ++  N+R   S   G   E +V 
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGT-PEISVR

Query:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE
         R  K+  RREYVEQLRAESQRLLRDTRGAAFKP+P VQKPISSVLEKIR+RKLELS KSI V+NI+ D  + D LF+EVVIKHRL+VEGREDSLE+E E
Subjt:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE

Query:  DMDQCPADVENQKDSMCIDERSNGTNLPSER---ATDEVTEAFRAPINDTQ-VFS--------------------------------------------F
        DM+Q PAD+EN++DSMCIDER+NGTN+PSER     +EVTEAFRAPINDTQ +FS                                             
Subjt:  DMDQCPADVENQKDSMCIDERSNGTNLPSER---ATDEVTEAFRAPINDTQ-VFS--------------------------------------------F

Query:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY
          SHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHG SDLP+SA GDPVKAFVDDEAEEEDDSDHDMRFQDEEED D+DSEEL+DMIATAY
Subjt:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY

Query:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK--------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDEE
        EENPLD+E+RNELHQKWLE QDAAGTE+LLQKLK                           AAEDLLPLNVARMNIRKVKQMLPQ+YTDKDDQY+SDDEE
Subjt:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK--------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDEE

Query:  TERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFGR
        TERRLARERVFDKAEGKSTFLSPAEDE TREVFGLIKKLNVVPD KKR KA +FLDPPL GV +NTSSKSSFLGRSS+LSLSSSHK+GSSTN RSFIFGR
Subjt:  TERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFGR

Query:  DDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSG--TSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKK
        DDSN+RSAIA  EE SDP QSE +  R   AKFSYSQV+PS QN TAPEIKSG  TSLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHMKK
Subjt:  DDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSG--TSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKK

Query:  PIKSEGRF
        PIKSEGRF
Subjt:  PIKSEGRF

XP_022937257.1 uncharacterized protein LOC111443597 [Cucurbita moschata]1.2e-27871.23Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQ LS+P+V SPLV+GRKLKRLKKAS +SE+LPR+DDQ  SGVLG+F  IDDRFDD  KM ELSAAE  ADDS+K NGQDLDDSDELR+SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NLE  LGLD +EN SG EK LEFDAVAGI+E  EDQ   +GEESGDA+VD    +R +   +   R  + ++  N+R   S   G PE     
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK
           K   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR+RKLELS KSIN+EN ILD ++DDN L TEVVIKHRL+VEGR DS+E+
Subjt:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK

Query:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------
        E EDMDQ PAD+ N+K SMCIDERSNGTN+PSER  AT++ TEAF  PINDTQ +FS                                           
Subjt:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------

Query:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA
             SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHPHG S+LP+ A+GDPVKAFVDDEAEEEDDSDHDMRFQDE+EDE TDSEEL+DMIA
Subjt:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA

Query:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS
        TAYEENPLD+EKRNELHQKWLE +DAAGTEDLLQKLK                           +AAEDLLPLNVARMNIRKVKQMLPQ+YTD DDQYMS
Subjt:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS

Query:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF
        DDEETERR+ RERV  KAE KSTFLSPAEDESTREVFGLIKKLNVV DVKKRPKAQSFLDPPLTGVGKN +SKSSFLGRSSNLSLSSS KHGSS NSRSF
Subjt:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF

Query:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        IFGRDDSN++SAI  +EESSD  QSENKP R   AKFSYSQVKPSAQN    E KSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

XP_038903146.1 putative leucine-rich repeat-containing protein DDB_G0290503 [Benincasa hispida]7.6e-27669.63Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        M+SDDD Q LS+P + SPLVSGRKLKRLKKAS  S+DL  ID++  SG+LG+F  IDDR DDG K+ ELSA E  A+DS+KLNGQDLD+ DE+++SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NL + LGLDGEEN S VEK LEFDAVAG +E  EDQ  G+G ESGDA+VD    +R + + +   R  + ++  N+R   S   G    +   
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIIL--DDSEDDNL-FTEVVIKHRLAVEGREDSLEKE
        +   E  RREYV+QLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELS KSIN+ N IL  DD +DDN  FTEVVIKHRL+VEGR DS+EKE
Subjt:  RPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIIL--DDSEDDNL-FTEVVIKHRLAVEGREDSLEKE

Query:  REDMDQCPADVENQKDSMCIDERSNGTNLP--SERATDEVTEAFRAPINDTQ-VFS--------------------------------------------
         E+MDQ PAD +NQ+DSMCIDERSNG N+P   ERATDEVTE FRAP+NDTQ +FS                                            
Subjt:  REDMDQCPADVENQKDSMCIDERSNGTNLP--SERATDEVTEAFRAPINDTQ-VFS--------------------------------------------

Query:  -FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIAT
            SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHPHG SDLP+SA+GDPVKAFVDDEAEEEDDSDHDMRFQD+EEDEDTD EEL+DMIAT
Subjt:  -FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIAT

Query:  AYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSD
        AYEENPLD+EKRNELHQKWLE QDAAGTEDLLQKLK                            AAED LPL+VARMNIRKVK+MLPQ+YTDKDD Y+SD
Subjt:  AYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSD

Query:  DEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFI
        D+ET+R+LARERVFDKA+GKSTFLSPAEDESTREVFGLIKKLNVVPDVKKR KAQ F D PLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSS N+RSFI
Subjt:  DEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFI

Query:  FGRDDSNTRSAIAPIEESSDPVQSENKPA----RAKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        FGRDDSN+RSAI  +EE+SD  QSEN  A     AKFSYSQV+PSAQN T PEIKSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  FGRDDSNTRSAIAPIEESSDPVQSENKPA----RAKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

TrEMBL top hitse value%identityAlignment
A0A0A0LD80 Uncharacterized protein4.3e-26968.52Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKK-ASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSG
        M+SDDDFQ LS+P++ SPLVSGRKLKRLKK A+  S+ LP+ID Q   G LG+F  IDDRFDDG+K+ ELSA +  A+DS+KL GQDLDDSD+L++SGSG
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKK-ASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSG

Query:  SRDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVR
        S DLD+G NLE+ LGLDG+E  SGV K LEFDAVAGI E   DQ  G+G ESGDA+VD    +R +   +   R  + ++  N+R   S   G    +  
Subjt:  SRDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVR

Query:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIIL--DDSEDDNL-FTEVVIKHRLAVEGREDSLEK
         +   E  RREYV QLRAESQRLLRDTRGA FKPMP+VQKPISSVLEKIRRRKLELS KSIN+EN IL  DD +DDN  F +VV KHRL+VEGR DS+EK
Subjt:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIIL--DDSEDDNL-FTEVVIKHRLAVEGREDSLEK

Query:  EREDMDQCPADVENQKDSMCIDERSNGTNLP--SERATDEVTEAFRAPINDTQ-VFS-------------------------------------------
        E  DMD+ PAD EN+KD+MCI ERSNGTN+P   ERATDEVTE FRAP+NDTQ +FS                                           
Subjt:  EREDMDQCPADVENQKDSMCIDERSNGTNLP--SERATDEVTEAFRAPINDTQ-VFS-------------------------------------------

Query:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA
             SHLQENFTPS+LAMNLRLDSAALDD  +EEDNDKENVNPHPHG SDLP+SA+GDPVKAFVDDEAEEEDDSDHDMRFQD+EED+D D EEL+DMIA
Subjt:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA

Query:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS
        TAY+ENPLD+EKRNELHQKWLE QDAAGTEDLLQKLK                           + AED LPL+VARMNIRKVKQMLPQ+YTDKDD YMS
Subjt:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS

Query:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF
        DDEETERRL RERVFDKA+GKSTFLSPAE ESTREVFGLIKKLNVVPDVKKRPKAQ F DPPLTGVGKNTSSKSSFLGRSSN S SSSHKHGSSTNSRSF
Subjt:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF

Query:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        IFGRDD+N+RS+I  +EESSD  Q+ENK  R   AKFSYSQV+PSAQNS   EIKSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

A0A6J1C5B3 uncharacterized protein LOC1110080804.5e-28271.41Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQFLS+PE ASPLVSGRKLKRLKKAS VSEDLP IDDQS SGVLGDF  IDDRFD  ++MH LSA+E   D+ NKL+GQDL DSDEL++SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGT-PEISVR
        RDLD GDNLE ++GLD EEN SGVEK  EFDA AGI++  EDQ  G+GEE+GD ++D    +R +   +   R  + ++  N+R   S   G   E +V 
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGT-PEISVR

Query:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE
         R  K+  RREYVEQLRAESQRLLRDTRGAAFKP+P VQKPISSVLEKIR+RKLELS KSI V+NI+ D  + D LF+EVVIKHRL+VEGREDSLE+E E
Subjt:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE

Query:  DMDQCPADVENQKDSMCIDERSNGTNLPSER---ATDEVTEAFRAPINDTQ-VFS--------------------------------------------F
        DM+Q PAD+EN++DSMCIDER+NGTN+PSER     +EVTEAFRAPINDTQ +FS                                             
Subjt:  DMDQCPADVENQKDSMCIDERSNGTNLPSER---ATDEVTEAFRAPINDTQ-VFS--------------------------------------------F

Query:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY
          SHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHG SDLP+SA GDPVKAFVDDEAEEEDDSDHDMRFQDEEED D+DSEEL+DMIATAY
Subjt:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY

Query:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK--------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDEE
        EENPLD+E+RNELHQKWLE QDAAGTE+LLQKLK                           AAEDLLPLNVARMNIRKVKQMLPQ+YTDKDDQY+SDDEE
Subjt:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK--------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDEE

Query:  TERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFGR
        TERRLARERVFDKAEGKSTFLSPAEDE TREVFGLIKKLNVVPD KKR KA +FLDPPL GV +NTSSKSSFLGRSS+LSLSSSHK+GSSTN RSFIFGR
Subjt:  TERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFGR

Query:  DDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSG--TSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKK
        DDSN+RSAIA  EE SDP QSE +  R   AKFSYSQV+PS QN TAPEIKSG  TSLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHMKK
Subjt:  DDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSG--TSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKK

Query:  PIKSEGRF
        PIKSEGRF
Subjt:  PIKSEGRF

A0A6J1FAP3 uncharacterized protein LOC1114435976.0e-27971.23Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQ LS+P+V SPLV+GRKLKRLKKAS +SE+LPR+DDQ  SGVLG+F  IDDRFDD  KM ELSAAE  ADDS+K NGQDLDDSDELR+SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NLE  LGLD +EN SG EK LEFDAVAGI+E  EDQ   +GEESGDA+VD    +R +   +   R  + ++  N+R   S   G PE     
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK
           K   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR+RKLELS KSIN+EN ILD ++DDN L TEVVIKHRL+VEGR DS+E+
Subjt:  RPPK---EFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEK

Query:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------
        E EDMDQ PAD+ N+K SMCIDERSNGTN+PSER  AT++ TEAF  PINDTQ +FS                                           
Subjt:  EREDMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS-------------------------------------------

Query:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA
             SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHPHG S+LP+ A+GDPVKAFVDDEAEEEDDSDHDMRFQDE+EDE TDSEEL+DMIA
Subjt:  --FRGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIA

Query:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS
        TAYEENPLD+EKRNELHQKWLE +DAAGTEDLLQKLK                           +AAEDLLPLNVARMNIRKVKQMLPQ+YTD DDQYMS
Subjt:  TAYEENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMS

Query:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF
        DDEETERR+ RERV  KAE KSTFLSPAEDESTREVFGLIKKLNVV DVKKRPKAQSFLDPPLTGVGKN +SKSSFLGRSSNLSLSSS KHGSS NSRSF
Subjt:  DDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSF

Query:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM
        IFGRDDSN++SAI  +EESSD  QSENKP R   AKFSYSQVKPSAQN    E KSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHM
Subjt:  IFGRDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHM

Query:  KKPIKSEGRF
        KKPIK+EGRF
Subjt:  KKPIKSEGRF

A0A6J1FJZ1 uncharacterized protein LOC1114461261.5e-26968.9Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKAS-AVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSG
        M+SDDDFQ LS+PE+ SPLVSGRKLKRLKK S AVSED PR DD+   G +G+F  IDDRFD+   M ELSA E  ADDS+ LNGQDL+DSDEL++SGSG
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKAS-AVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSG

Query:  SRDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVR
        SRDLD G +LE  LGLDGEEN SGVEK LEFDAVAG++E  EDQ  GVGEESG   +D    +R +   +   R  + ++  N+R   S   G    +  
Subjt:  SRDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVR

Query:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE
         +   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR RKLELS KS  +EN I DD +DD  FTEVVIKHRL+VEGR DS++KE E
Subjt:  LRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKERE

Query:  DMDQCPADVENQKDSMCIDERSNGTNLPS--ERATDEVTEAFRAPINDTQ-VFS---------------------------------------------F
        D+DQ  ADVE+QKDS+CIDERSNG N+PS  ERATD ++EAFR+P+NDTQ +FS                                              
Subjt:  DMDQCPADVENQKDSMCIDERSNGTNLPS--ERATDEVTEAFRAPINDTQ-VFS---------------------------------------------F

Query:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY
          S LQENFTPS+LAMNLRLDSAA+DD+SDEEDNDKENVNP PH SS+LP+S +GDPVKAFVDDEAEEEDDSDHDMRFQDEEEDED+DSEEL+DMIATAY
Subjt:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY

Query:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDE
        EENPLD+E+RNELHQKWLE QDAAGTEDLLQKLK                           +A EDLLPLNVARMNIRKVKQMLPQ+YTD+DDQYMSDDE
Subjt:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDE

Query:  ETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFG
        ETERRLARE +FDKA+GKSTFLSPAEDEST++VFGLIKKLNVVPDVKKRPKAQSF DP L+G+GKNTSSKSSFLGRSSN SLSSSHKHGSSTN RSFIFG
Subjt:  ETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFG

Query:  RDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKKP
        RD+SN+RSAI  +EESS   Q ENKP R   AKFSYSQV+PSAQN TAPE KSG+SLFDILRQSSLQLQ RKPCTF EES+QMSSAFASFKLEKTHMKK 
Subjt:  RDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKKP

Query:  IKSEGRF
        IK+EGRF
Subjt:  IKSEGRF

A0A6J1ILI0 uncharacterized protein LOC1114766932.6e-27470.38Show/hide
Query:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS
        MESDDDFQ LS+P+V SPLV+GRKLKRLKKAS +SE+LP +DDQ  SGVLG+F  IDDRFDD  KM ELSAAE  ADDS+K +GQDL DSDELR+SGSGS
Subjt:  MESDDDFQFLSTPEVASPLVSGRKLKRLKKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGS

Query:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL
        RDLD+G NLE  LGLD +EN SG EK LEFDAVAGI+E  EDQ   +GEES DA+VD    +R +   +   R  + ++  N+R   S        +   
Subjt:  RDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINENPEDQRQGVGEESGDAVVD---NRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRL

Query:  RPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEKERE
        +   E  RREYVEQLRAESQRLLRDTRGAAFKPMPLV+KPISSVLEKIR+RKLELS KSIN+EN ILD  +DDN L TEVVIKHRL+VEGR DS+E+E E
Subjt:  RPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRRKLELSMKSINVENIILDDSEDDN-LFTEVVIKHRLAVEGREDSLEKERE

Query:  DMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS---------------------------------------------F
        DM Q PAD+ N+K SMCIDERSNGTN+PSER  ATD+VTEAF  PINDTQ +FS                                              
Subjt:  DMDQCPADVENQKDSMCIDERSNGTNLPSER--ATDEVTEAFRAPINDTQ-VFS---------------------------------------------F

Query:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY
          SHLQENFTPS+LAMNLRLDSAALD+DSDEEDNDKENVNPHP G S+LP+ A+GDPVKAFVDDEAEEEDDSDHDMRFQDE+EDE TDSEEL+DMIATAY
Subjt:  RGSHLQENFTPSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAY

Query:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDE
        EENPLD+EKRNELHQKWLE +DAAGTEDLLQKLK                           +AAEDLLPLNVARMNIRKVKQMLPQ+YTD DDQYMSDDE
Subjt:  EENPLDSEKRNELHQKWLEHQDAAGTEDLLQKLK---------------------------SAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDE

Query:  ETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFG
        ETERR+ RERV  KAE KSTFLSPAEDESTREVFGLIKKLNVV DVKKRPKAQSFLDPPLTGVGKN +SKSSFLGRSSNLSLSSS KHGSS NSRSFIFG
Subjt:  ETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLDPPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFG

Query:  RDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKKP
        RDDSN++SAI  +EESSD  QSENKP R   AKFSYSQV+PSAQN    E KSG+SLFDILRQSSLQLQ RKPCTF EESSQMSSAFASFKLEKTHMKKP
Subjt:  RDDSNTRSAIAPIEESSDPVQSENKPAR---AKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQKRKPCTFSEESSQMSSAFASFKLEKTHMKKP

Query:  IKSEGRF
        IK+EGRF
Subjt:  IKSEGRF

SwissProt top hitse value%identityAlignment
F4IQJ6 Protein LIKE COV 31.2e-9875.71Show/hide
Query:  RERDLELLIPVATVSENGSSKSSSAPAATLSH--HHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT
        RERDLE LIP+     +      S P + L+   H +G+EA+ KVIRSWASKKFMTGCVILLPIA+TFY TW FI FVDGFFSPIY HLGIN+FGLGFVT
Subjt:  RERDLELLIPVATVSENGSSKSSSAPAATLSH--HHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT

Query:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH
        SITFIF+VG+FMSSWLGASVL++GEWFIKKMPLVSYIY+ASKQIS AISPDQ+S AFKEVAIIRHP  GEYAFGFITS+V+L+ R   EEL CVYVPTNH
Subjt:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH

Query:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAI
        LYLGDIFL+SSKDI+RPNLSVREGIEIVISGGMS+P +LTT+D+  I
Subjt:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAI

F4IUE7 Protein CONTINUOUS VASCULAR RING 11.5e-9675.41Show/hide
Query:  RERDLELLIPVA---TVSENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFV
        RERD ELLIPVA      +  SSK SS+ +A+ S H S  E L+  IR WASKKFMTGCVILLPIAITFYITW FI FVDGFFSPIY  LGIN+FG GF+
Subjt:  RERDLELLIPVA---TVSENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFV

Query:  TSITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTN
        TSI FIFLVG+FMSSWLGASVL LGEWFIK+MP V +IY ASKQISTAISPDQ + AFKEVAIIRHPR GEYAFGFITS+VVLQ+   EEEL CVYVPTN
Subjt:  TSITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTN

Query:  HLYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVD
        HLY+GDI L++S D++RPNLSVREGIEIV+SGGMS+PQIL+TVD
Subjt:  HLYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVD

O49929 Outer envelope pore protein 24, chloroplastic1.3e-8474.65Show/hide
Query:  MKATLKTKYDAD-KSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILD
        MKA LK KYD D  S  A+T+A NAGDVKLRASITDAT  N PSL GL LAVEKPG F VDYNVPKKD RFQFMNTV+VAEKPLNL YIHS+GDNRTILD
Subjt:  MKATLKTKYDAD-KSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILD

Query:  GTLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKR
        GTLV D +NKVSAN+ + SGNCKLKY+Y H+G+TT EP+YDVAKNSWDF+VS KVYGDD  KA+YQT++K LGLEWTRN K  G FK+VASVNLAEE K 
Subjt:  GTLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKR

Query:  PKVTAESTWNFEI
        PK++ EST NFE+
Subjt:  PKVTAESTWNFEI

Q1H5C9 Outer envelope pore protein 24A, chloroplastic7.5e-8568.4Show/hide
Query:  MKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILDG
        MKA+ K K+D DKSG+ ++L  NAG+ KLRA++TDA+ + GPS NGL+LAVEKPGFFI+DYNVPKKD+RFQFMNT+++AEKPLNLTYIH RGDNRTI+DG
Subjt:  MKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILDG

Query:  TLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKRP
        + V+D ANK+SAN+++G+ NCKLKYTYVH G+ TFEP YDVAKN WDF++S K+YG D  KATYQT++K LGLEW+ N KS G+FK+ AS+NLAEE K P
Subjt:  TLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKRP

Query:  KVTAESTWNFEI
        K+TAE+TWN E+
Subjt:  KVTAESTWNFEI

Q8VY49 Protein LIKE COV 19.5e-9672.33Show/hide
Query:  RERDLELLIPVATV--SENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT
        RERD ELLIPVA     ++GSS   S+ ++  S H SG E L+  IR WASKKFMTGCVILLPIA+TFY TW FI FVDGFFSPIY  LGINIFG GF+T
Subjt:  RERDLELLIPVATV--SENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT

Query:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH
        SI FIFLVG+FMSSWLGASVL LGEWFIK+MP V +IY ASKQISTAISPDQ + AFKEVAIIRHPR GEYAFGFITS+VVLQ+   EEEL CVYVPTNH
Subjt:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH

Query:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAIPAVGIE
        LY+GDI L++S D++RPNLSVREGIEIV+SGGMS+PQIL+T+D + + ++G E
Subjt:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAIPAVGIE

Arabidopsis top hitse value%identityAlignment
AT1G45170.1 unknown protein5.3e-8668.4Show/hide
Query:  MKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILDG
        MKA+ K K+D DKSG+ ++L  NAG+ KLRA++TDA+ + GPS NGL+LAVEKPGFFI+DYNVPKKD+RFQFMNT+++AEKPLNLTYIH RGDNRTI+DG
Subjt:  MKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILDG

Query:  TLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKRP
        + V+D ANK+SAN+++G+ NCKLKYTYVH G+ TFEP YDVAKN WDF++S K+YG D  KATYQT++K LGLEW+ N KS G+FK+ AS+NLAEE K P
Subjt:  TLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKRP

Query:  KVTAESTWNFEI
        K+TAE+TWN E+
Subjt:  KVTAESTWNFEI

AT2G18460.1 like COV 38.5e-10075.71Show/hide
Query:  RERDLELLIPVATVSENGSSKSSSAPAATLSH--HHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT
        RERDLE LIP+     +      S P + L+   H +G+EA+ KVIRSWASKKFMTGCVILLPIA+TFY TW FI FVDGFFSPIY HLGIN+FGLGFVT
Subjt:  RERDLELLIPVATVSENGSSKSSSAPAATLSH--HHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT

Query:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH
        SITFIF+VG+FMSSWLGASVL++GEWFIKKMPLVSYIY+ASKQIS AISPDQ+S AFKEVAIIRHP  GEYAFGFITS+V+L+ R   EEL CVYVPTNH
Subjt:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH

Query:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAI
        LYLGDIFL+SSKDI+RPNLSVREGIEIVISGGMS+P +LTT+D+  I
Subjt:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAI

AT2G20120.1 Protein of unknown function (DUF502)1.8e-9775Show/hide
Query:  RERDLELLIPVA---TVSENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFV
        RERD ELLIPVA      +  SSK SS+ +A+ S H S  E L+  IR WASKKFMTGCVILLPIAITFYITW FI FVDGFFSPIY  LGIN+FG GF+
Subjt:  RERDLELLIPVA---TVSENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFV

Query:  TSITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTN
        TSI FIFLVG+FMSSWLGASVL LGEWFIK+MP V +IY ASKQISTAISPDQ + AFKEVAIIRHPR GEYAFGFITS+VVLQ+   EEEL CVYVPTN
Subjt:  TSITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTN

Query:  HLYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVD
        HLY+GD+ L++S D++RPNLSVREGIEIV+SGGMS+PQIL+TVD
Subjt:  HLYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVD

AT2G20130.1 like COV 16.7e-9772.33Show/hide
Query:  RERDLELLIPVATV--SENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT
        RERD ELLIPVA     ++GSS   S+ ++  S H SG E L+  IR WASKKFMTGCVILLPIA+TFY TW FI FVDGFFSPIY  LGINIFG GF+T
Subjt:  RERDLELLIPVATV--SENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVT

Query:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH
        SI FIFLVG+FMSSWLGASVL LGEWFIK+MP V +IY ASKQISTAISPDQ + AFKEVAIIRHPR GEYAFGFITS+VVLQ+   EEEL CVYVPTNH
Subjt:  SITFIFLVGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNH

Query:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAIPAVGIE
        LY+GDI L++S D++RPNLSVREGIEIV+SGGMS+PQIL+T+D + + ++G E
Subjt:  LYLGDIFLMSSKDILRPNLSVREGIEIVISGGMSVPQILTTVDARAIPAVGIE

AT5G42960.1 unknown protein7.7e-8567.76Show/hide
Query:  LTMKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTIL
        + MKA++K KYD DK+    +LA NAGD+KLRA++TDAT++ GP+L GLALAVEKPG FIV+YNVPKKD+RFQFMNTV++AEKPLNLTYIHSR DNRTI+
Subjt:  LTMKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNGLALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTIL

Query:  DGTLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESK
        DG+LV+DSANK+SANH++G+ NCK+KYTY H G+ TFEP YD+AKN+WDF+VSR+ Y  D  +ATYQT++K LG+EW+RN K++G FK+ ASVNLA+E K
Subjt:  DGTLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYGDDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESK

Query:  RPKVTAESTWNFEI
         PK+TAE+TWN E+
Subjt:  RPKVTAESTWNFEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACAAGAGAGAGGGAAAGAGATCTAGAGCTTCTGATCCCAGTCGCTACAGTTTCTGAAAATGGCTCCTCCAAATCTTCGTCCGCACCCGCTGCGACTTTATCCCA
TCATCACTCCGGCCGAGAGGCATTGGCTAAAGTCATTCGCAGCTGGGCATCGAAAAAGTTCATGACAGGATGTGTCATTCTTCTACCAATTGCAATTACATTCTACATCA
CGTGGGGCTTCATTCGTTTTGTTGATGGTTTCTTCTCTCCAATATATAATCACTTAGGGATCAATATTTTTGGGCTTGGATTCGTTACCTCCATCACATTCATCTTCCTA
GTTGGTATTTTCATGTCATCTTGGTTGGGAGCTTCTGTACTAACTCTTGGTGAATGGTTCATTAAGAAAATGCCACTTGTAAGCTATATCTACACGGCCTCAAAGCAAAT
AAGTACTGCAATATCACCTGATCAAACTTCAAATGCCTTCAAGGAAGTGGCAATCATCAGGCACCCTCGTGCTGGAGAATATGCTTTTGGATTCATCACTTCCTCCGTTG
TTCTTCAGAGCAGAATGGAAGAGGAAGAACTCTATTGTGTTTATGTGCCTACCAATCATCTCTATCTTGGAGATATATTTCTTATGAGTTCCAAGGACATCCTGAGGCCG
AATCTGTCTGTTCGAGAGGGAATTGAAATTGTCATCTCTGGAGGGATGTCAGTACCTCAAATACTGACCACAGTGGATGCAAGAGCAATTCCAGCAGTTGGAATCGAAAA
ACTTTGCAAAACCAAAAGTATGAATGGGCATGCAACTGAATCTGGCTTCTCCTCCACCTTCACCAGCTGTAATGGAGCAAACTCTTGCAGAGATGATTTGTTAGTTCTCT
CTCGCCTCTTGATCTCCGCCGTCGGATTGTTCCCTCTTCATATCCGCCGGAGGCAAGAAGCAATCCATGGACAAACCACAAATATTAAAATCAACCTCTTCAATTCTCCA
ACTCTCCTCTATTTTCCGTCTGTGGTTGAGCGCCTCCCCGTACCGGTGCAGCCTGGCCGTCGTCCGGCCGCCGTGCGCGATGTTAATGTTGTCGACATATCGCTTCGTGT
CCTCGAATTGGACGAGGAGGCCGGTTCGCTGGCTGAAGTAGCCCCATATGGTGTGGTGGACGATCTCGACGTGCGGCGTGCTCTGCGACTTCAGCAACTCCGCCGCCGTC
TCCAGTTTCAGAATGAAACAGTCTTCTTTGTTTATGGCTTTCTCCCCAATGCAGACCGCGCCTTCGAACAGATTCGCCGTGTACCTTGGGTCCAATCCCTAGGTCCGGAT
TCTTCTGCCACAGCACAAACCCACCCACCTCGCATTTCCCCCTCGCCTGCACGCTATCTTCCCCCTGCTGCATCGCCGATCCCACCATCTTCACCTGCCCCACCGCGTAC
ATGCTGTTCACCGAGTTCAACGCCGCCACCCCTCCCGTCGCCGCCACGTATTGCTGCACTATATACTTCGCCGTCGACGCCTCCTACACGTGCGCATTTTTCACGGAGAG
GAACATATACTTTCATTTCAACGGGCATTGAAAACGAAATGGAATATCAGACAGAGTTATCAAAGGAAAAATCGGACGCTTACGATGGAATAATCTCTGAAAGAGCAGTT
AAAGGTGAGATCAAGATGGACCTGGAATGGAATGAGAGGAGAGCCAACGAGCTTGAGCAGAGCCATAAACTCGTTGTTAGAAGCCGCGGAGAGATGCGACGACTTGTTGA
TATTCTCAGCCGACTGCGCCTTCATCAAGGCGCGGAGGTTCTGCCAGCGGAGGGCGGCGTTGGTGCCCATGTTGGTGAACATCTCCTCCGGTATCGGCACCTCCAGCACC
GTCTCGAGTCCATTTTCTTTGTCGAAGTTAGGACAGAGTTTCCTCATGGAATTGAGAGAGAGGAATATTGGGGAAGTGAAAATTTGAGGGGAAAGGAAAGAGGAAGGAGA
AGAGAGGGTGGTGTTTTTTGGAAGGTGGTCCTTGCGCGTAGGTGTCGTGAGGGCACGATTTTAGGAGCGCGACTGCCTGGTTTTCTTTTCTTTTATTGGCTTTCTTCTCT
CTTCGCCGCGTGTCTTGCACGCCATTGGCAAATTCATGTTTTTTGTAAGGTTATACCTTGCTTATCTCCTCCGTTGCTCACCATGAAGGCAACTCTGAAGACCAAGTACG
ACGCTGACAAAAGCGGCGCCGCCTCGACTTTAGCTGTTAACGCCGGAGATGTCAAGCTCCGAGCTTCCATCACTGACGCTACGATTATCAACGGCCCTAGCTTAAATGGC
CTAGCTCTTGCCGTCGAGAAACCAGGCTTCTTCATAGTCGATTACAATGTCCCCAAAAAGGATCTTCGCTTTCAGTTTATGAACACAGTTAAGGTAGCCGAGAAGCCTTT
GAACTTGACTTACATCCATAGTAGGGGCGACAACAGAACAATTCTCGACGGGACCCTAGTCGTTGATTCGGCTAACAAGGTCTCTGCTAACCATGTTCTTGGCTCTGGAA
ATTGTAAATTGAAGTACACTTACGTGCATGAGGGGATGACCACTTTTGAACCTTCCTATGATGTTGCAAAGAACTCGTGGGACTTTTCTGTTTCAAGGAAGGTTTATGGT
GATGATGTATTCAAGGCTACGTACCAGACTACAACCAAGGCACTGGGATTAGAATGGACAAGGAATTTAAAGTCAAACGGAAATTTCAAGATTGTAGCATCGGTCAATCT
GGCAGAGGAATCAAAAAGGCCAAAAGTAACTGCAGAATCGACATGGAACTTTGAAATCATCAAAGATTCCGTCGAGTCGTCCAACGAACTGGTCACTCCTTCGTCTTCGA
TTATTGTTACAAATCGGTTGCGGAGAATGGAGAGCGACGACGATTTCCAGTTCCTGTCGACTCCCGAGGTCGCATCCCCTCTGGTTTCCGGGAGGAAGCTGAAGCGCTTG
AAGAAAGCAAGCGCAGTCTCGGAGGATTTGCCAAGGATCGACGATCAATCTGTAAGTGGAGTCTTGGGGGATTTTCCCATGATCGATGACCGATTCGATGATGGGTATAA
GATGCATGAATTGAGTGCTGCGGAAGTTGTAGCGGATGACTCGAACAAATTGAACGGCCAAGATCTGGATGACTCCGATGAGCTCCGACGATCAGGATCTGGATCGAGGG
ATTTGGATAACGGAGACAATTTGGAACTGATTCTTGGTCTGGATGGCGAGGAAAATGGCTCCGGGGTGGAGAAGACCTTGGAGTTTGATGCTGTGGCCGGGATTAATGAA
AATCCCGAAGATCAAAGACAGGGCGTAGGAGAGGAAAGCGGGGATGCGGTAGTAGACAACCGGAGAAGAAACGGCATAGTTTGGATGCGTTCGAGGACGAGAGAGAATCA
AAGAGGAGGAAATCGAAGAACAAACGGCTCAAGAGCAGTGGGAACCCCGGAAATTTCAGTGAGACTGCGGCCCCCAAAAGAATTCTCGAGAAGAGAATATGTTGAGCAAC
TCCGCGCCGAGTCTCAGAGACTCTTGCGAGATACTAGAGGAGCAGCATTCAAGCCTATGCCACTTGTTCAGAAGCCAATATCTTCAGTTTTGGAGAAGATCCGGCGAAGG
AAGCTTGAGCTCTCAATGAAATCTATCAATGTTGAGAACATCATTTTGGACGACAGCGAGGATGATAATCTATTCACAGAGGTCGTGATCAAGCATAGGTTGGCTGTAGA
AGGGAGGGAGGACTCTCTAGAAAAAGAACGTGAGGATATGGATCAGTGTCCTGCAGATGTGGAGAACCAAAAAGATTCAATGTGTATAGATGAACGTAGTAATGGGACAA
ACTTGCCCTCAGAAAGGGCTACCGATGAGGTCACAGAAGCATTTCGGGCTCCCATTAATGATACTCAGGTATTTTCATTCAGAGGCAGTCATCTGCAAGAAAATTTCACG
CCATCCATATTGGCAATGAATTTAAGGCTTGATTCTGCTGCCCTTGATGACGATTCTGATGAAGAGGATAATGATAAGGAGAATGTTAATCCACATCCACATGGTTCGTC
AGACTTACCTACATCGGCAAACGGGGATCCTGTTAAAGCTTTTGTTGATGATGAAGCTGAGGAAGAAGATGACAGTGACCATGATATGCGCTTCCAAGATGAAGAGGAAG
ACGAGGACACTGATTCAGAAGAACTTGAAGATATGATAGCAACTGCATATGAAGAAAATCCATTAGACAGTGAAAAACGCAACGAACTCCATCAGAAGTGGCTTGAACAT
CAAGATGCTGCTGGAACAGAAGATCTTCTGCAGAAATTGAAATCAGCTGCAGAAGATTTACTTCCACTGAATGTGGCTCGAATGAACATAAGAAAAGTAAAGCAAATGCT
TCCTCAATTGTATACAGATAAAGATGATCAGTACATGTCCGATGATGAGGAAACGGAGAGGAGGCTAGCAAGAGAACGTGTATTTGACAAAGCTGAGGGAAAATCCACAT
TCCTGTCACCTGCTGAGGACGAGAGCACTAGAGAAGTTTTTGGCCTTATTAAGAAGCTGAATGTTGTACCTGATGTAAAGAAGAGACCCAAAGCACAATCGTTTTTGGAC
CCTCCACTAACTGGTGTAGGCAAGAACACATCTTCAAAGTCATCTTTCTTAGGCCGAAGTTCAAATTTATCTCTTTCTTCATCCCATAAGCATGGATCATCAACCAACAG
CCGTTCTTTTATATTCGGTCGGGACGATAGCAACACCAGGAGTGCAATTGCACCAATAGAGGAATCTTCTGATCCGGTTCAGAGTGAAAATAAGCCTGCAAGAGCGAAGT
TTAGTTACTCCCAAGTGAAACCAAGTGCACAGAACAGTACTGCACCCGAAATTAAGTCAGGTACTTCATTGTTTGATATATTAAGGCAATCTTCCTTGCAACTGCAAAAA
CGCAAACCATGCACGTTCAGTGAGGAGTCTAGCCAAATGAGTTCTGCATTTGCTTCATTCAAATTGGAGAAGACACACATGAAGAAGCCAATCAAAAGTGAAGGAAGATT
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTACAAGAGAGAGGGAAAGAGATCTAGAGCTTCTGATCCCAGTCGCTACAGTTTCTGAAAATGGCTCCTCCAAATCTTCGTCCGCACCCGCTGCGACTTTATCCCA
TCATCACTCCGGCCGAGAGGCATTGGCTAAAGTCATTCGCAGCTGGGCATCGAAAAAGTTCATGACAGGATGTGTCATTCTTCTACCAATTGCAATTACATTCTACATCA
CGTGGGGCTTCATTCGTTTTGTTGATGGTTTCTTCTCTCCAATATATAATCACTTAGGGATCAATATTTTTGGGCTTGGATTCGTTACCTCCATCACATTCATCTTCCTA
GTTGGTATTTTCATGTCATCTTGGTTGGGAGCTTCTGTACTAACTCTTGGTGAATGGTTCATTAAGAAAATGCCACTTGTAAGCTATATCTACACGGCCTCAAAGCAAAT
AAGTACTGCAATATCACCTGATCAAACTTCAAATGCCTTCAAGGAAGTGGCAATCATCAGGCACCCTCGTGCTGGAGAATATGCTTTTGGATTCATCACTTCCTCCGTTG
TTCTTCAGAGCAGAATGGAAGAGGAAGAACTCTATTGTGTTTATGTGCCTACCAATCATCTCTATCTTGGAGATATATTTCTTATGAGTTCCAAGGACATCCTGAGGCCG
AATCTGTCTGTTCGAGAGGGAATTGAAATTGTCATCTCTGGAGGGATGTCAGTACCTCAAATACTGACCACAGTGGATGCAAGAGCAATTCCAGCAGTTGGAATCGAAAA
ACTTTGCAAAACCAAAAGTATGAATGGGCATGCAACTGAATCTGGCTTCTCCTCCACCTTCACCAGCTGTAATGGAGCAAACTCTTGCAGAGATGATTTGTTAGTTCTCT
CTCGCCTCTTGATCTCCGCCGTCGGATTGTTCCCTCTTCATATCCGCCGGAGGCAAGAAGCAATCCATGGACAAACCACAAATATTAAAATCAACCTCTTCAATTCTCCA
ACTCTCCTCTATTTTCCGTCTGTGGTTGAGCGCCTCCCCGTACCGGTGCAGCCTGGCCGTCGTCCGGCCGCCGTGCGCGATGTTAATGTTGTCGACATATCGCTTCGTGT
CCTCGAATTGGACGAGGAGGCCGGTTCGCTGGCTGAAGTAGCCCCATATGGTGTGGTGGACGATCTCGACGTGCGGCGTGCTCTGCGACTTCAGCAACTCCGCCGCCGTC
TCCAGTTTCAGAATGAAACAGTCTTCTTTGTTTATGGCTTTCTCCCCAATGCAGACCGCGCCTTCGAACAGATTCGCCGTGTACCTTGGGTCCAATCCCTAGGTCCGGAT
TCTTCTGCCACAGCACAAACCCACCCACCTCGCATTTCCCCCTCGCCTGCACGCTATCTTCCCCCTGCTGCATCGCCGATCCCACCATCTTCACCTGCCCCACCGCGTAC
ATGCTGTTCACCGAGTTCAACGCCGCCACCCCTCCCGTCGCCGCCACGTATTGCTGCACTATATACTTCGCCGTCGACGCCTCCTACACGTGCGCATTTTTCACGGAGAG
GAACATATACTTTCATTTCAACGGGCATTGAAAACGAAATGGAATATCAGACAGAGTTATCAAAGGAAAAATCGGACGCTTACGATGGAATAATCTCTGAAAGAGCAGTT
AAAGGTGAGATCAAGATGGACCTGGAATGGAATGAGAGGAGAGCCAACGAGCTTGAGCAGAGCCATAAACTCGTTGTTAGAAGCCGCGGAGAGATGCGACGACTTGTTGA
TATTCTCAGCCGACTGCGCCTTCATCAAGGCGCGGAGGTTCTGCCAGCGGAGGGCGGCGTTGGTGCCCATGTTGGTGAACATCTCCTCCGGTATCGGCACCTCCAGCACC
GTCTCGAGTCCATTTTCTTTGTCGAAGTTAGGACAGAGTTTCCTCATGGAATTGAGAGAGAGGAATATTGGGGAAGTGAAAATTTGAGGGGAAAGGAAAGAGGAAGGAGA
AGAGAGGGTGGTGTTTTTTGGAAGGTGGTCCTTGCGCGTAGGTGTCGTGAGGGCACGATTTTAGGAGCGCGACTGCCTGGTTTTCTTTTCTTTTATTGGCTTTCTTCTCT
CTTCGCCGCGTGTCTTGCACGCCATTGGCAAATTCATGTTTTTTGTAAGGTTATACCTTGCTTATCTCCTCCGTTGCTCACCATGAAGGCAACTCTGAAGACCAAGTACG
ACGCTGACAAAAGCGGCGCCGCCTCGACTTTAGCTGTTAACGCCGGAGATGTCAAGCTCCGAGCTTCCATCACTGACGCTACGATTATCAACGGCCCTAGCTTAAATGGC
CTAGCTCTTGCCGTCGAGAAACCAGGCTTCTTCATAGTCGATTACAATGTCCCCAAAAAGGATCTTCGCTTTCAGTTTATGAACACAGTTAAGGTAGCCGAGAAGCCTTT
GAACTTGACTTACATCCATAGTAGGGGCGACAACAGAACAATTCTCGACGGGACCCTAGTCGTTGATTCGGCTAACAAGGTCTCTGCTAACCATGTTCTTGGCTCTGGAA
ATTGTAAATTGAAGTACACTTACGTGCATGAGGGGATGACCACTTTTGAACCTTCCTATGATGTTGCAAAGAACTCGTGGGACTTTTCTGTTTCAAGGAAGGTTTATGGT
GATGATGTATTCAAGGCTACGTACCAGACTACAACCAAGGCACTGGGATTAGAATGGACAAGGAATTTAAAGTCAAACGGAAATTTCAAGATTGTAGCATCGGTCAATCT
GGCAGAGGAATCAAAAAGGCCAAAAGTAACTGCAGAATCGACATGGAACTTTGAAATCATCAAAGATTCCGTCGAGTCGTCCAACGAACTGGTCACTCCTTCGTCTTCGA
TTATTGTTACAAATCGGTTGCGGAGAATGGAGAGCGACGACGATTTCCAGTTCCTGTCGACTCCCGAGGTCGCATCCCCTCTGGTTTCCGGGAGGAAGCTGAAGCGCTTG
AAGAAAGCAAGCGCAGTCTCGGAGGATTTGCCAAGGATCGACGATCAATCTGTAAGTGGAGTCTTGGGGGATTTTCCCATGATCGATGACCGATTCGATGATGGGTATAA
GATGCATGAATTGAGTGCTGCGGAAGTTGTAGCGGATGACTCGAACAAATTGAACGGCCAAGATCTGGATGACTCCGATGAGCTCCGACGATCAGGATCTGGATCGAGGG
ATTTGGATAACGGAGACAATTTGGAACTGATTCTTGGTCTGGATGGCGAGGAAAATGGCTCCGGGGTGGAGAAGACCTTGGAGTTTGATGCTGTGGCCGGGATTAATGAA
AATCCCGAAGATCAAAGACAGGGCGTAGGAGAGGAAAGCGGGGATGCGGTAGTAGACAACCGGAGAAGAAACGGCATAGTTTGGATGCGTTCGAGGACGAGAGAGAATCA
AAGAGGAGGAAATCGAAGAACAAACGGCTCAAGAGCAGTGGGAACCCCGGAAATTTCAGTGAGACTGCGGCCCCCAAAAGAATTCTCGAGAAGAGAATATGTTGAGCAAC
TCCGCGCCGAGTCTCAGAGACTCTTGCGAGATACTAGAGGAGCAGCATTCAAGCCTATGCCACTTGTTCAGAAGCCAATATCTTCAGTTTTGGAGAAGATCCGGCGAAGG
AAGCTTGAGCTCTCAATGAAATCTATCAATGTTGAGAACATCATTTTGGACGACAGCGAGGATGATAATCTATTCACAGAGGTCGTGATCAAGCATAGGTTGGCTGTAGA
AGGGAGGGAGGACTCTCTAGAAAAAGAACGTGAGGATATGGATCAGTGTCCTGCAGATGTGGAGAACCAAAAAGATTCAATGTGTATAGATGAACGTAGTAATGGGACAA
ACTTGCCCTCAGAAAGGGCTACCGATGAGGTCACAGAAGCATTTCGGGCTCCCATTAATGATACTCAGGTATTTTCATTCAGAGGCAGTCATCTGCAAGAAAATTTCACG
CCATCCATATTGGCAATGAATTTAAGGCTTGATTCTGCTGCCCTTGATGACGATTCTGATGAAGAGGATAATGATAAGGAGAATGTTAATCCACATCCACATGGTTCGTC
AGACTTACCTACATCGGCAAACGGGGATCCTGTTAAAGCTTTTGTTGATGATGAAGCTGAGGAAGAAGATGACAGTGACCATGATATGCGCTTCCAAGATGAAGAGGAAG
ACGAGGACACTGATTCAGAAGAACTTGAAGATATGATAGCAACTGCATATGAAGAAAATCCATTAGACAGTGAAAAACGCAACGAACTCCATCAGAAGTGGCTTGAACAT
CAAGATGCTGCTGGAACAGAAGATCTTCTGCAGAAATTGAAATCAGCTGCAGAAGATTTACTTCCACTGAATGTGGCTCGAATGAACATAAGAAAAGTAAAGCAAATGCT
TCCTCAATTGTATACAGATAAAGATGATCAGTACATGTCCGATGATGAGGAAACGGAGAGGAGGCTAGCAAGAGAACGTGTATTTGACAAAGCTGAGGGAAAATCCACAT
TCCTGTCACCTGCTGAGGACGAGAGCACTAGAGAAGTTTTTGGCCTTATTAAGAAGCTGAATGTTGTACCTGATGTAAAGAAGAGACCCAAAGCACAATCGTTTTTGGAC
CCTCCACTAACTGGTGTAGGCAAGAACACATCTTCAAAGTCATCTTTCTTAGGCCGAAGTTCAAATTTATCTCTTTCTTCATCCCATAAGCATGGATCATCAACCAACAG
CCGTTCTTTTATATTCGGTCGGGACGATAGCAACACCAGGAGTGCAATTGCACCAATAGAGGAATCTTCTGATCCGGTTCAGAGTGAAAATAAGCCTGCAAGAGCGAAGT
TTAGTTACTCCCAAGTGAAACCAAGTGCACAGAACAGTACTGCACCCGAAATTAAGTCAGGTACTTCATTGTTTGATATATTAAGGCAATCTTCCTTGCAACTGCAAAAA
CGCAAACCATGCACGTTCAGTGAGGAGTCTAGCCAAATGAGTTCTGCATTTGCTTCATTCAAATTGGAGAAGACACACATGAAGAAGCCAATCAAAAGTGAAGGAAGATT
CTGA
Protein sequenceShow/hide protein sequence
MGTRERERDLELLIPVATVSENGSSKSSSAPAATLSHHHSGREALAKVIRSWASKKFMTGCVILLPIAITFYITWGFIRFVDGFFSPIYNHLGINIFGLGFVTSITFIFL
VGIFMSSWLGASVLTLGEWFIKKMPLVSYIYTASKQISTAISPDQTSNAFKEVAIIRHPRAGEYAFGFITSSVVLQSRMEEEELYCVYVPTNHLYLGDIFLMSSKDILRP
NLSVREGIEIVISGGMSVPQILTTVDARAIPAVGIEKLCKTKSMNGHATESGFSSTFTSCNGANSCRDDLLVLSRLLISAVGLFPLHIRRRQEAIHGQTTNIKINLFNSP
TLLYFPSVVERLPVPVQPGRRPAAVRDVNVVDISLRVLELDEEAGSLAEVAPYGVVDDLDVRRALRLQQLRRRLQFQNETVFFVYGFLPNADRAFEQIRRVPWVQSLGPD
SSATAQTHPPRISPSPARYLPPAASPIPPSSPAPPRTCCSPSSTPPPLPSPPRIAALYTSPSTPPTRAHFSRRGTYTFISTGIENEMEYQTELSKEKSDAYDGIISERAV
KGEIKMDLEWNERRANELEQSHKLVVRSRGEMRRLVDILSRLRLHQGAEVLPAEGGVGAHVGEHLLRYRHLQHRLESIFFVEVRTEFPHGIEREEYWGSENLRGKERGRR
REGGVFWKVVLARRCREGTILGARLPGFLFFYWLSSLFAACLARHWQIHVFCKVIPCLSPPLLTMKATLKTKYDADKSGAASTLAVNAGDVKLRASITDATIINGPSLNG
LALAVEKPGFFIVDYNVPKKDLRFQFMNTVKVAEKPLNLTYIHSRGDNRTILDGTLVVDSANKVSANHVLGSGNCKLKYTYVHEGMTTFEPSYDVAKNSWDFSVSRKVYG
DDVFKATYQTTTKALGLEWTRNLKSNGNFKIVASVNLAEESKRPKVTAESTWNFEIIKDSVESSNELVTPSSSIIVTNRLRRMESDDDFQFLSTPEVASPLVSGRKLKRL
KKASAVSEDLPRIDDQSVSGVLGDFPMIDDRFDDGYKMHELSAAEVVADDSNKLNGQDLDDSDELRRSGSGSRDLDNGDNLELILGLDGEENGSGVEKTLEFDAVAGINE
NPEDQRQGVGEESGDAVVDNRRRNGIVWMRSRTRENQRGGNRRTNGSRAVGTPEISVRLRPPKEFSRREYVEQLRAESQRLLRDTRGAAFKPMPLVQKPISSVLEKIRRR
KLELSMKSINVENIILDDSEDDNLFTEVVIKHRLAVEGREDSLEKEREDMDQCPADVENQKDSMCIDERSNGTNLPSERATDEVTEAFRAPINDTQVFSFRGSHLQENFT
PSILAMNLRLDSAALDDDSDEEDNDKENVNPHPHGSSDLPTSANGDPVKAFVDDEAEEEDDSDHDMRFQDEEEDEDTDSEELEDMIATAYEENPLDSEKRNELHQKWLEH
QDAAGTEDLLQKLKSAAEDLLPLNVARMNIRKVKQMLPQLYTDKDDQYMSDDEETERRLARERVFDKAEGKSTFLSPAEDESTREVFGLIKKLNVVPDVKKRPKAQSFLD
PPLTGVGKNTSSKSSFLGRSSNLSLSSSHKHGSSTNSRSFIFGRDDSNTRSAIAPIEESSDPVQSENKPARAKFSYSQVKPSAQNSTAPEIKSGTSLFDILRQSSLQLQK
RKPCTFSEESSQMSSAFASFKLEKTHMKKPIKSEGRF