; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G003220 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G003220
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPeroxidase
Genome locationchr04:3288635..3291045
RNA-Seq ExpressionLsi04G003220
SyntenyLsi04G003220
Gene Ontology termsGO:0006979 - response to oxidative stress (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000823 - Plant peroxidase
IPR002016 - Haem peroxidase
IPR010255 - Haem peroxidase superfamily
IPR019794 - Peroxidase, active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034499.1 peroxidase 11 [Cucumis melo var. makuwa]1.5e-4960Show/hide
Query:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------
        MGISNKVYGV+MMI  C  FVV S SLFETGE  L LDYYAKTCPNVLQ+VRKEMECAVLSEPRNAA VVRLHFHDCFVQ                    
Subjt:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------

Query:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                    VGGPYW+VPLGRKDST+ASYELAN+NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

XP_008446499.1 PREDICTED: LOW QUALITY PROTEIN: peroxidase 11 [Cucumis melo]1.5e-4960Show/hide
Query:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------
        MGISNKVYGV+MMI  C  FVV S SLFETGE  L LDYYAKTCPNVLQ+VRKEMECAVLSEPRNAA VVRLHFHDCFVQ                    
Subjt:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------

Query:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                    VGGPYW+VPLGRKDST+ASYELAN+NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

XP_011655736.1 peroxidase 11 isoform X1 [Cucumis sativus]2.2e-4859Show/hide
Query:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------
        M ISNKVYGV+MMI  C  FVV S SLFETGE  L LDYY +TCPNVLQ+VRKEMECAVLSEPRNAAFVVRLHFHDCFVQ                    
Subjt:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------

Query:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                    VGGPYW+VPLGRKDST+ASYELAN+NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

XP_023548451.1 peroxidase 11-like isoform X1 [Cucurbita pepo subsp. pepo]1.1e-4756.54Show/hide
Query:  YGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-----------------------------
        YG+M+++   F  +S+SL++TG+PPL LDYYAKTCPNVLQVVRKEMECAVLS+PRNAAF VRLHFHDCFVQ                             
Subjt:  YGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-----------------------------

Query:  -----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                           VGGPYWEVPLGRKDSTTASYELAN NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  -----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

XP_038892918.1 peroxidase 11 isoform X1 [Benincasa hispida]1.1e-5261.11Show/hide
Query:  MGISNKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------
        M ISNKVYGVMM++   F V+S SLFETGEP L LDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ                      
Subjt:  MGISNKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------

Query:  ------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                  VGGPYW+VPLGRKDSTTASYELANSNLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  ------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

TrEMBL top hitse value%identityAlignment
A0A1S3BF69 Peroxidase7.4e-5060Show/hide
Query:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------
        MGISNKVYGV+MMI  C  FVV S SLFETGE  L LDYYAKTCPNVLQ+VRKEMECAVLSEPRNAA VVRLHFHDCFVQ                    
Subjt:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------

Query:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                    VGGPYW+VPLGRKDST+ASYELAN+NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

A0A314UFE3 Peroxidase1.2e-4471.2Show/hide
Query:  VMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQVGGPYWEVPLGRKDSTTASYELANSNLPSAN
        ++  + + F V++ SL    EPPL LDYYA  CPN+ ++V+KEMECAVLS+PRNAA +VRLHFHDCFVQVGGPYW+VPLGRKDS TAS ELA +NLP+AN
Subjt:  VMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQVGGPYWEVPLGRKDSTTASYELANSNLPSAN

Query:  EGLLSLISKFLYQGLSVTDMVALSG
        EGL ++ISKFLYQGLSVTDMVALSG
Subjt:  EGLLSLISKFLYQGLSVTDMVALSG

A0A5D3CB22 Peroxidase7.4e-5060Show/hide
Query:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------
        MGISNKVYGV+MMI  C  FVV S SLFETGE  L LDYYAKTCPNVLQ+VRKEMECAVLSEPRNAA VVRLHFHDCFVQ                    
Subjt:  MGISNKVYGVMMMI--CVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------

Query:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                                    VGGPYW+VPLGRKDST+ASYELAN+NLPSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  --------------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

A0A6J1GZT7 Peroxidase2.6e-4755.73Show/hide
Query:  VYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------
        VYG+M+++   F  +S+SL++TG+P L LDYYAKTCPNVLQVVRKEMECAVLS+PRNAAF VRLHFHDCFVQ                            
Subjt:  VYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------

Query:  ------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                            VGGPYWEVPLGRKDSTTASYELAN NLPSANEGL+S+ISKFLYQGLSVTDMVALSG
Subjt:  ------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

A0A6J1JHN4 Peroxidase7.0e-4855.73Show/hide
Query:  VYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------
        VYG+++++   F  +S+SL++TG+PPL LDYYAKTCPNVLQVVRKEMECAVLS+PRNAAF VRLHFHDCFVQ                            
Subjt:  VYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------

Query:  ------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                            VGGPYWEVPLGRKDSTTASYELAN N+PSANEGLLS+ISKFLYQGLSVTDMVALSG
Subjt:  ------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

SwissProt top hitse value%identityAlignment
O23237 Peroxidase 491.4e-1633.33Show/hide
Query:  YYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------------------------
        YYA +CP V ++VR  +  AV  E R AA ++RLHFHDCFVQ                                                          
Subjt:  YYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------------------------

Query:  ------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
               GGP W VPLGR+DS +AS   +N+N+P+ N    +++SKF  QGL +TD+VALSG
Subjt:  ------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

Q4W1I8 Basic peroxidase1.0e-1631.05Show/hide
Query:  NKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------------
        +K  G ++M+ ++ ++IS + F +    L   +Y  TCP  L  +R  +  +V S  RNAA V+RL FHDCFVQ                          
Subjt:  NKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------------

Query:  ----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                          VGGP W V LGR+DSTT++   A ++LP  N  L  LIS F  +GL+  +MVALSG
Subjt:  ----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

Q4W1I9 Basic peroxidase1.8e-1631.05Show/hide
Query:  NKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------------
        +K  G  +M+ ++ ++IS + F +    L   +Y  TCP  L  +R  +  +V S  RNAA V+RL FHDCFVQ                          
Subjt:  NKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ--------------------------

Query:  ----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                          VGGP W V LGR+DSTT++   A ++LP  N  L  LIS F  +GL+  +MVALSG
Subjt:  ----------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

Q96519 Peroxidase 112.2e-3041.03Show/hide
Query:  VMMMICVWFVV--ISESLFETGEP----PLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-------------------------
        +M ++ V+F+V  I    F    P    PL LDYY  TCP V  V++KEMEC V  +PRNAA ++RLHFHDCFVQ                         
Subjt:  VMMMICVWFVV--ISESLFETGEP----PLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-------------------------

Query:  ---------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                               VGGPYW+VP+GRKDS TASYELA +NLP+  EGL+S+I+KF  QGLSV DMVAL G
Subjt:  ---------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

Q9FG34 Peroxidase 541.0e-1631.67Show/hide
Query:  VVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------
        +VI  SLF T    L   +Y+ TCPN   +VR  ++ A+ S+ R    ++RLHFHDCFV                                         
Subjt:  VVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------

Query:  ------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                 GGP W V LGR+D  TA+   ANS+LPS  EGL ++ SKF+  GL  TD+V+LSG
Subjt:  ------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

Arabidopsis top hitse value%identityAlignment
AT1G68850.1 Peroxidase superfamily protein1.5e-3141.03Show/hide
Query:  VMMMICVWFVV--ISESLFETGEP----PLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-------------------------
        +M ++ V+F+V  I    F    P    PL LDYY  TCP V  V++KEMEC V  +PRNAA ++RLHFHDCFVQ                         
Subjt:  VMMMICVWFVV--ISESLFETGEP----PLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ-------------------------

Query:  ---------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                               VGGPYW+VP+GRKDS TASYELA +NLP+  EGL+S+I+KF  QGLSV DMVAL G
Subjt:  ---------------------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

AT2G35380.1 Peroxidase superfamily protein8.2e-1730.16Show/hide
Query:  VMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFV--------------------------------
        V + + V + + +  L + GEP L+  +Y ++CP   ++V+  +E AVL +PR AA ++RL FHDCFV                                
Subjt:  VMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFV--------------------------------

Query:  --------------------------------QVGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                          GGP+WEV LGR+DS  AS+  AN  +P+ N  L SLI  F  QGL++ D++ALSG
Subjt:  --------------------------------QVGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

AT4G36430.1 Peroxidase superfamily protein9.7e-1833.33Show/hide
Query:  YYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------------------------
        YYA +CP V ++VR  +  AV  E R AA ++RLHFHDCFVQ                                                          
Subjt:  YYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------------------------

Query:  ------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
               GGP W VPLGR+DS +AS   +N+N+P+ N    +++SKF  QGL +TD+VALSG
Subjt:  ------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

AT5G06730.1 Peroxidase superfamily protein7.4e-1831.67Show/hide
Query:  VVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------
        +VI  SLF T    L   +Y+ TCPN   +VR  ++ A+ S+ R    ++RLHFHDCFV                                         
Subjt:  VVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ----------------------------------------

Query:  ------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
                                 GGP W V LGR+D  TA+   ANS+LPS  EGL ++ SKF+  GL  TD+V+LSG
Subjt:  ------------------------VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG

AT5G19890.1 Peroxidase superfamily protein1.1e-1634.38Show/hide
Query:  DYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ---------------------------------------------------------
        D YAK+CPN++Q+VRK++  A+ +E R AA ++RLHFHDCFV                                                          
Subjt:  DYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQ---------------------------------------------------------

Query:  ----VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG
             GGP W V LGRKD   A+   AN NLPS  E L ++I+KF+   L++TD+VALSG
Subjt:  ----VGGPYWEVPLGRKDSTTASYELANSNLPSANEGLLSLISKFLYQGLSVTDMVALSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCTCAAATAAAGTTTATGGAGTGATGATGATGATTTGTGTTTGGTTTGTTGTCATCAGCGAGAGCTTGTTTGAAACAGGGGAGCCCCCATTGAGATTGGATTA
CTATGCCAAAACTTGTCCCAATGTGTTGCAAGTTGTGAGGAAAGAAATGGAGTGTGCAGTGCTTTCTGAACCACGTAATGCAGCTTTTGTTGTTCGATTACACTTTCACG
ACTGCTTTGTTCAGGTGGGTGGGCCTTACTGGGAGGTTCCTCTTGGAAGAAAGGATTCCACAACTGCAAGTTATGAACTTGCAAACTCAAATCTTCCCTCTGCCAATGAG
GGGCTTCTCAGCCTCATCTCCAAGTTTCTTTATCAGGGTCTCTCTGTCACTGACATGGTAGCTCTATCAGGTATTACACTTTTGGTTTGA
mRNA sequenceShow/hide mRNA sequence
TTTGGTTTGCAAGATGCAAGCCAATAGTGTGTGAAAAGCCGCCTTGACGACAAAGCCATGACATGAAAACAGAGAGAAAAAGACAGAGAAGAAACTCAAAGTTGGTAATT
GGTATTAAGCTTTTTAAGGATCGATATCTTTGATTATTACTGCCTCCATCCACTTTCGTTTGGTGACTTGCTCTAATATTTTTATCCTCATACACCCAATGAGCTTTAAT
TCTCTAAATATTATTTTTTGGGTTTCGTTTGGGAAAGGGAAGGAAGACCCAACAACCAGACCAATATTAGTTGTTGCATCCTCTCTTGACTAAAACACTCAAGCGCAGCC
ATGCAAGAAAAGCAACTTCTTGTATAAAAATTTGATTCCCATTCCCATCTCTCCCACTTTCTAAAACCCTAAAAAAAGAAAGAAAGAAAAAGGAAAATGGGAATCTCAAA
TAAAGTTTATGGAGTGATGATGATGATTTGTGTTTGGTTTGTTGTCATCAGCGAGAGCTTGTTTGAAACAGGGGAGCCCCCATTGAGATTGGATTACTATGCCAAAACTT
GTCCCAATGTGTTGCAAGTTGTGAGGAAAGAAATGGAGTGTGCAGTGCTTTCTGAACCACGTAATGCAGCTTTTGTTGTTCGATTACACTTTCACGACTGCTTTGTTCAG
GTGGGTGGGCCTTACTGGGAGGTTCCTCTTGGAAGAAAGGATTCCACAACTGCAAGTTATGAACTTGCAAACTCAAATCTTCCCTCTGCCAATGAGGGGCTTCTCAGCCT
CATCTCCAAGTTTCTTTATCAGGGTCTCTCTGTCACTGACATGGTAGCTCTATCAGGTATTACACTTTTGGTTTGA
Protein sequenceShow/hide protein sequence
MGISNKVYGVMMMICVWFVVISESLFETGEPPLRLDYYAKTCPNVLQVVRKEMECAVLSEPRNAAFVVRLHFHDCFVQVGGPYWEVPLGRKDSTTASYELANSNLPSANE
GLLSLISKFLYQGLSVTDMVALSGITLLV