; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007936 (gene) of Chayote v1 genome

Gene IDSed0007936
OrganismSechium edule (Chayote v1)
DescriptionCarboxypeptidase
Genome locationLG06:34715676..34722557
RNA-Seq ExpressionSed0007936
SyntenySed0007936
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR040415 - SET domain-containing protein 9


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152712.1 uncharacterized protein LOC111020368 [Momordica charantia]9.7e-9668.57Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK
        FL   YNRLGR+A EADAEEIIDMASKASFADQQKQVQ+NIHSQV++FC HMDE+L P  R SNEPAESP Q ND                    ADIPK
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPL RSELSQKLKD IGYTLDIKPSQIPHK+AGQGLFIDGEADVGSV+AIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNP KQGDEKSDRLWRMLSK LE           E  NPL+FAHFANHPAKDMVPNVM+CPYD+PLTEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

XP_023523546.1 uncharacterized protein LOC111787739 [Cucurbita pepo subsp. pepo]2.2e-9267.14Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK
        FL   YNRLGRDA EADA+EIIDMASKA FADQQKQVQ+NIHSQVE+FC HMDEIL P TR S  PAESP Q N A                    DIPK
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPL RSELSQKLKD IGYTLDI+PSQIPHK+AGQGLF+DGEA+VGSVIAIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNPTKQGDEKSDRLW+MLSK LE           E  NPLAFAH+ANHPAKDM PNVM+CPYDFP+TEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

XP_038903268.1 uncharacterized protein LOC120089904 isoform X1 [Benincasa hispida]9.7e-9668.93Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK
        FL   YNRLG +A EADAEEIIDMA+KASFADQQKQVQ+NIHSQV++FC HMDEIL P TR SNEPAESP + +D                    ADIP 
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPLGRSELSQKLKDEIGYTLDI+PSQIPHK+AGQGLFIDGEADVGSVIAIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNP KQGDEKSDRLWRMLSK LE           E  NPLAFAHFANHPAKDMVPNVM+CPYDFPLTEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

XP_038903269.1 uncharacterized protein LOC120089904 isoform X2 [Benincasa hispida]9.7e-9668.93Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK
        FL   YNRLG +A EADAEEIIDMA+KASFADQQKQVQ+NIHSQV++FC HMDEIL P TR SNEPAESP + +D                    ADIP 
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPLGRSELSQKLKDEIGYTLDI+PSQIPHK+AGQGLFIDGEADVGSVIAIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNP KQGDEKSDRLWRMLSK LE           E  NPLAFAHFANHPAKDMVPNVM+CPYDFPLTEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

XP_038903270.1 uncharacterized protein LOC120089904 isoform X3 [Benincasa hispida]2.8e-9569.45Show/hide
Query:  YNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPKMRPLG
        YNRLG +A EADAEEIIDMA+KASFADQQKQVQ+NIHSQV++FC HMDEIL P TR SNEPAESP + +D                    ADIP  RPLG
Subjt:  YNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPKMRPLG

Query:  RSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------------
        RSELSQKLKDEIGYTLDI+PSQIPHK+AGQGLFIDGEADVGSVIAIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                        
Subjt:  RSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------------

Query:  ----SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
            SNP KQGDEKSDRLWRMLSK LE           E  NPLAFAHFANHPAKDMVPNVM+CPYDFPLTEKDM
Subjt:  ----SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

TrEMBL top hitse value%identityAlignment
A0A0A0LCB3 Uncharacterized protein7.7e-9166.55Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA---------------------DIP
        FL   YNRLG DA EADAEEIIDMASKASFADQQKQVQ+NIHSQV++FC HMD IL P     +EPAESP +  DA                      IP
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA---------------------DIP

Query:  KMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------
          RPLGRSELSQ+LKDEIGYTLDIKPS+IPHK AGQGLFIDGEADVGS+IAIYPG++YSPAHYQYIPGYPRVDAQNPYLITR                  
Subjt:  KMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------

Query:  ----------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                  SNPTKQGDEKSDRLWRMLSK LE           E  NPLAFAHFANHPAKDMVPNVM+CPYDFPLTEKDM
Subjt:  ----------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

A0A1S3CFI8 uncharacterized protein LOC1035003461.1e-8965.84Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA---------------------DIP
        FL   Y RLG +A EADAEEIIDMA+KASFADQQKQVQ+NIHSQV++FC HMDEIL P     +EPAESP +  DA                      IP
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA---------------------DIP

Query:  KMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------
          RPLGRSELSQKLKDEIGYTLDIKPSQIPHK+AGQGLFIDGEADVGS+IAIYPG++YSPAHYQYIPGYPRV AQNPYLITR                  
Subjt:  KMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR------------------

Query:  ----------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                  SNPTKQG EKSDRLWRMLSK LE           E  NPLAFAHFANHPAKDMVPNVM+CPYDFPLTEKDM
Subjt:  ----------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

A0A6J1DH10 uncharacterized protein LOC1110203684.7e-9668.57Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK
        FL   YNRLGR+A EADAEEIIDMASKASFADQQKQVQ+NIHSQV++FC HMDE+L P  R SNEPAESP Q ND                    ADIPK
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFND--------------------ADIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPL RSELSQKLKD IGYTLDIKPSQIPHK+AGQGLFIDGEADVGSV+AIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNP KQGDEKSDRLWRMLSK LE           E  NPL+FAHFANHPAKDMVPNVM+CPYD+PLTEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

A0A6J1GBM4 uncharacterized protein LOC1114526919.1e-9266.9Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK
        FL   YNRLGRDA EADAEEIIDMASKA FADQQKQVQ+NIHSQVE+FC HMDEIL P  R S  PAESP Q N A                    DIPK
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPL RSELSQKLKD IGYTLDI+PSQIPHK+AGQGLF+DGEADVGSVIAIYPG+IYSPAHYQYIPGYPRVD QNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE------------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNPTKQGDEKSDRLW+MLSK LE            E  NPLAFAH+ANHPAKDM PNVM+CPYDFP+TEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE------------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

A0A6J1KA93 uncharacterized protein LOC1114930793.5e-9166.79Show/hide
Query:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK
        FL   YNRLGRDA EADAEEIIDMASKA FADQQKQVQ+NIHSQVE+FC HMDEIL P TR S  PAES  Q N A                    DIPK
Subjt:  FLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPVQFNDA--------------------DIPK

Query:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------
         RPLGRSELSQKLKD IGYTLDI+PSQIPHK+AGQGLF+DGEADVGSVIAIYPG+IYSPAHYQYIPGYPRVDAQNPYLITR                   
Subjt:  MRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR-------------------

Query:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
                 SNP K G+EKSDRLW+MLSK LE           E  NPLAFAH+ANHPAKDM PNVM+CPYDFP+TEKDM
Subjt:  ---------SNPTKQGDEKSDRLWRMLSKLLE-----------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G08270.1 unknown protein4.5e-6748.96Show/hide
Query:  TYALNPTDKEPCTMQSMQSFLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPG---TRNSNEPAESPVQFN---D
        T+A NP   +       + F+   YNRLGRDA EADAEEII+MA KA+F++QQKQVQ+NIH Q++NFC+ M+EIL      T++ NEP     + N    
Subjt:  TYALNPTDKEPCTMQSMQSFLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPG---TRNSNEPAESPVQFN---D

Query:  AD---IPKMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLIT-------RSNPT
        AD   +P+ +PL   E+S +LKD +GYTL+IKPS IPHK+AGQG FI+GEADVG+V+A YPG+IYSPA ++YIPGYP VDAQN YLIT        + P 
Subjt:  AD---IPKMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLIT-------RSNPT

Query:  KQG-----------------------DEKSDRLWRMLSKLLE---------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
         +G                       +  SD++W+MLS+ LE         E  NPLAF HF NHP K+M  NVM+CPYDF L+E +M
Subjt:  KQG-----------------------DEKSDRLWRMLSKLLE---------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM

AT5G23200.1 unknown protein9.1e-6847.65Show/hide
Query:  TYALNPTDKEPCTMQSMQSFLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPV----------
        T+A NP   +       + F+   YNRLGR+A E DAEEII+MA KA+ ++QQKQVQ+NIH QVE FC+ MD IL P  R +   ++S            
Subjt:  TYALNPTDKEPCTMQSMQSFLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSNEPAESPV----------

Query:  -------QFNDAD---IPKMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR
                F  AD   +P+ +PL  +++SQ+L D++GYTL+ KPS IPHK+AGQG FI GEADVG+V+A YPG+IYSPA Y+YIPGYP+VD+QN YLITR
Subjt:  -------QFNDAD---IPKMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITR

Query:  SN-----------------------------PTKQGDEKSDRLWRMLSKLLE---------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM
         +                              TK  +  SDRLW+ LSK LE         E  NPLAF H ANHPAK+M PNVM+CPYDFPL  KD+
Subjt:  SN-----------------------------PTKQGDEKSDRLWRMLSKLLE---------ESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTACAAAAATGGAGACTGTATCTCCTTGGTAGAAATTTTATTATTAAAACCGACCAAAGAAGGGAGATTGGGTGTACTTACGCCTTAAACCCTACAGACAAGGA
ACCTTGTACCATGCAATCAATGCAAAGCTTTCTCCCCGAGATCTACAACCGGTTGGGGAGAGATGCTGTGGAGGCGGATGCAGAGGAGATCATTGACATGGCCAGTAAAG
CTTCTTTTGCTGATCAACAAAAGCAAGTGCAACAGAACATCCATTCGCAAGTTGAAAATTTTTGCAACCACATGGATGAAATTCTTTTTCCTGGGACGAGGAACAGCAAT
GAACCTGCTGAATCACCCGTACAGTTTAATGATGCTGATATTCCGAAAATGAGGCCGTTGGGACGTTCTGAACTTTCTCAGAAATTAAAGGATGAAATTGGCTACACTCT
CGATATCAAGCCATCTCAAATCCCTCACAAGGAAGCTGGACAAGGTCTTTTTATAGATGGTGAAGCTGATGTCGGATCGGTTATAGCAATATACCCCGGCATTATTTATT
CCCCTGCTCATTATCAGTACATTCCTGGATATCCAAGAGTTGATGCTCAGAACCCTTATCTGATCACGAGAAGTAATCCAACCAAACAAGGCGACGAGAAATCTGATCGT
CTTTGGCGAATGCTGAGCAAACTATTAGAGGAAAGCGAAAATCCTCTAGCCTTTGCTCATTTCGCCAACCACCCAGCTAAGGATATGGTTCCAAATGTCATGGTTTGTCC
TTATGATTTCCCCTTGACAGAGAAGGATATGGAGGAGATCGAGTTGAATCCTCTCGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTACAAAAATGGAGACTGTATCTCCTTGGTAGAAATTTTATTATTAAAACCGACCAAAGAAGGGAGATTGGGTGTACTTACGCCTTAAACCCTACAGACAAGGA
ACCTTGTACCATGCAATCAATGCAAAGCTTTCTCCCCGAGATCTACAACCGGTTGGGGAGAGATGCTGTGGAGGCGGATGCAGAGGAGATCATTGACATGGCCAGTAAAG
CTTCTTTTGCTGATCAACAAAAGCAAGTGCAACAGAACATCCATTCGCAAGTTGAAAATTTTTGCAACCACATGGATGAAATTCTTTTTCCTGGGACGAGGAACAGCAAT
GAACCTGCTGAATCACCCGTACAGTTTAATGATGCTGATATTCCGAAAATGAGGCCGTTGGGACGTTCTGAACTTTCTCAGAAATTAAAGGATGAAATTGGCTACACTCT
CGATATCAAGCCATCTCAAATCCCTCACAAGGAAGCTGGACAAGGTCTTTTTATAGATGGTGAAGCTGATGTCGGATCGGTTATAGCAATATACCCCGGCATTATTTATT
CCCCTGCTCATTATCAGTACATTCCTGGATATCCAAGAGTTGATGCTCAGAACCCTTATCTGATCACGAGAAGTAATCCAACCAAACAAGGCGACGAGAAATCTGATCGT
CTTTGGCGAATGCTGAGCAAACTATTAGAGGAAAGCGAAAATCCTCTAGCCTTTGCTCATTTCGCCAACCACCCAGCTAAGGATATGGTTCCAAATGTCATGGTTTGTCC
TTATGATTTCCCCTTGACAGAGAAGGATATGGAGGAGATCGAGTTGAATCCTCTCGTCTAA
Protein sequenceShow/hide protein sequence
MAVQKWRLYLLGRNFIIKTDQRREIGCTYALNPTDKEPCTMQSMQSFLPEIYNRLGRDAVEADAEEIIDMASKASFADQQKQVQQNIHSQVENFCNHMDEILFPGTRNSN
EPAESPVQFNDADIPKMRPLGRSELSQKLKDEIGYTLDIKPSQIPHKEAGQGLFIDGEADVGSVIAIYPGIIYSPAHYQYIPGYPRVDAQNPYLITRSNPTKQGDEKSDR
LWRMLSKLLEESENPLAFAHFANHPAKDMVPNVMVCPYDFPLTEKDMEEIELNPLV