; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0009598 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0009598
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrotransposon protein
Genome locationchr11:4103984..4104984
RNA-Seq ExpressionPay0009598
SyntenyPay0009598
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031678.1 retrotransposon protein [Cucumis melo var. makuwa]1.3e-10970.47Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        MASSSRA KH WTKEEEAKLV+CLVELVS+GGWRSDNGTFR GYL QLQRMMAEKL DTN+QGSPTIDCR                              
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFDIKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRNCTSGSK
                  KGLLHKSFPYY+D+SYVFG D   GA SETFAD GSNV NMFND VPLGDSH++DI T+YSQGVHMS DE+F IRAGQAS+RRN +SGSK
Subjt:  IIAERDLFDIKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRNCTSGSK

Query:  RKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILLQNNV
        RKRGS+ YE +EVI    EFGN+QLK I DWPKEKRA  VE+R +VVKQLQDIP+LRSQDRAKLMQILFRS+E IEGFLSIPTKLKLEYCNILL+NNV
Subjt:  RKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILLQNNV

KAA0035621.1 retrotransposon protein [Cucumis melo var. makuwa]2.0e-8652.15Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        M SSSR  KHTWTKEEEA LV+CLVELV++GGWRSDNGTFRPGYL QL RMMA K+  +N+  S TID R+K +K+++HA+ EMRGP+CS FGWN+E +C
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRN
        I+AE+++FD       KGLL+KSF +Y+++SYVFGKDR  G R+E+FAD+GSN P  + D V      D D P MYS G++MSPD++   R  + S+RRN
Subjt:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRN

Query:  CTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILL
         +SGSKRKR     ++ +++   +E+GNEQL  IA+WP  +R    + R ++V+ L+ IPEL   DR +LM+IL R+++ ++ FL +P  +K  YC+++L
Subjt:  CTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILL

Query:  QNN
        Q N
Subjt:  QNN

KAA0038122.1 retrotransposon protein [Cucumis melo var. makuwa]9.8e-8651.47Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        M SSSR  KHTWTKEEEA LV+CLVELV++GGWRSDNGTFRPGYL QL RMMA K+  +N+  S TID R+K +K+++HA+ EMRGP+CS FGWN+E +C
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFN----DTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQAS
        I+AE+++FD       KGLL+KSF +Y+++SYVFGKDR  G R+E+FAD+GSN P  ++    D +P     D D P MYS G++MSPD++   R  + S
Subjt:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFN----DTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQAS

Query:  DRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYC
        +RRN +SGSKRKR     ++ +++   +E+GNEQL  IA+WP  +R    + R ++V+ L+ IPEL   DR +LM+IL R+++ ++ FL +P  +K  YC
Subjt:  DRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYC

Query:  NILLQNN
        +++LQ N
Subjt:  NILLQNN

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]1.5e-8675Show/hide
Query:  HAIVEMRGPSCSAFGWNEEFQCIIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMY
        H  +EMRGPSCS FGWNEEFQCIIAERDLFD         KGLLHKSFPYY+D+SYVFGKDR  GARSETF DVGSNVPNMFNDT+PLGDSHDEDIPTMY
Subjt:  HAIVEMRGPSCSAFGWNEEFQCIIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMY

Query:  SQGVHMSPDEMFRIRAGQASDRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFR
        SQGVH+SPDEMF IRA                         EVI  VMEFGNEQLKAIADW KEKRA+ +EMRAQVVKQLQDIPELRSQ R KLMQILFR
Subjt:  SQGVHMSPDEMFRIRAGQASDRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFR

Query:  SLEVIEGFLSIPTKLKLEYCNILLQNNV
        SLE I GFLSIPT+LKLEYCNILLQNNV
Subjt:  SLEVIEGFLSIPTKLKLEYCNILLQNNV

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]7.1e-14585.29Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        MAS SRA KHTWTKEEE K V+CLVELVSSGGWRSDNGTF+PGYLAQLQRMMAEKL  TN+Q S TIDC VKSLKK YHAI EMRGPSCS FGWNEEFQC
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR
        IIAERDLFD         KGLLHKSFPYY+D+SYVFGKDR  GARSETF +VGSNV NMFNDT+PLGDSHDEDIPTMYSQGVHMSPDEMF IRAGQAS+R
Subjt:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR

Query:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI
        RNC+S SKRKRGSERYET+EVI  VMEFGNEQLKAIADWPKEKRA+ VEMRAQVVKQLQDIP+LRSQDRAKLMQILFRSLE IEGFLSIPT+LKLEYCNI
Subjt:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI

Query:  LLQNNV
        LLQN V
Subjt:  LLQNNV

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859533.5e-14585.29Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        MAS SRA KHTWTKEEE K V+CLVELVSSGGWRSDNGTF+PGYLAQLQRMMAEKL  TN+Q S TIDC VKSLKK YHAI EMRGPSCS FGWNEEFQC
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR
        IIAERDLFD         KGLLHKSFPYY+D+SYVFGKDR  GARSETF +VGSNV NMFNDT+PLGDSHDEDIPTMYSQGVHMSPDEMF IRAGQAS+R
Subjt:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR

Query:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI
        RNC+S SKRKRGSERYET+EVI  VMEFGNEQLKAIADWPKEKRA+ VEMRAQVVKQLQDIP+LRSQDRAKLMQILFRSLE IEGFLSIPT+LKLEYCNI
Subjt:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI

Query:  LLQNNV
        LLQN V
Subjt:  LLQNNV

A0A5A7U0H7 Retrotransposon protein3.5e-14585.29Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        MAS SRA KHTWTKEEE K V+CLVELVSSGGWRSDNGTF+PGYLAQLQRMMAEKL  TN+Q S TIDC VKSLKK YHAI EMRGPSCS FGWNEEFQC
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR
        IIAERDLFD         KGLLHKSFPYY+D+SYVFGKDR  GARSETF +VGSNV NMFNDT+PLGDSHDEDIPTMYSQGVHMSPDEMF IRAGQAS+R
Subjt:  IIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDR

Query:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI
        RNC+S SKRKRGSERYET+EVI  VMEFGNEQLKAIADWPKEKRA+ VEMRAQVVKQLQDIP+LRSQDRAKLMQILFRSLE IEGFLSIPT+LKLEYCNI
Subjt:  RNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNI

Query:  LLQNNV
        LLQN V
Subjt:  LLQNNV

A0A5D3BZU3 Retrotransposon protein6.2e-11070.47Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        MASSSRA KH WTKEEEAKLV+CLVELVS+GGWRSDNGTFR GYL QLQRMMAEKL DTN+QGSPTIDCR                              
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFDIKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRNCTSGSK
                  KGLLHKSFPYY+D+SYVFG D   GA SETFAD GSNV NMFND VPLGDSH++DI T+YSQGVHMS DE+F IRAGQAS+RRN +SGSK
Subjt:  IIAERDLFDIKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRNCTSGSK

Query:  RKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILLQNNV
        RKRGS+ YE +EVI    EFGN+QLK I DWPKEKRA  VE+R +VVKQLQDIP+LRSQDRAKLMQILFRS+E IEGFLSIPTKLKLEYCNILL+NNV
Subjt:  RKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILLQNNV

A0A5D3DG22 Retrotransposon protein7.3e-8775Show/hide
Query:  HAIVEMRGPSCSAFGWNEEFQCIIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMY
        H  +EMRGPSCS FGWNEEFQCIIAERDLFD         KGLLHKSFPYY+D+SYVFGKDR  GARSETF DVGSNVPNMFNDT+PLGDSHDEDIPTMY
Subjt:  HAIVEMRGPSCSAFGWNEEFQCIIAERDLFD--------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMY

Query:  SQGVHMSPDEMFRIRAGQASDRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFR
        SQGVH+SPDEMF IRA                         EVI  VMEFGNEQLKAIADW KEKRA+ +EMRAQVVKQLQDIPELRSQ R KLMQILFR
Subjt:  SQGVHMSPDEMFRIRAGQASDRRNCTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFR

Query:  SLEVIEGFLSIPTKLKLEYCNILLQNNV
        SLE I GFLSIPT+LKLEYCNILLQNNV
Subjt:  SLEVIEGFLSIPTKLKLEYCNILLQNNV

A0A5D3DPR5 Retrotransposon protein9.6e-8752.15Show/hide
Query:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC
        M SSSR  KHTWTKEEEA LV+CLVELV++GGWRSDNGTFRPGYL QL RMMA K+  +N+  S TID R+K +K+++HA+ EMRGP+CS FGWN+E +C
Subjt:  MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQC

Query:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRN
        I+AE+++FD       KGLL+KSF +Y+++SYVFGKDR  G R+E+FAD+GSN P  + D V      D D P MYS G++MSPD++   R  + S+RRN
Subjt:  IIAERDLFD------IKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRN

Query:  CTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILL
         +SGSKRKR     ++ +++   +E+GNEQL  IA+WP  +R    + R ++V+ L+ IPEL   DR +LM+IL R+++ ++ FL +P  +K  YC+++L
Subjt:  CTSGSKRKRGSERYETIEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILL

Query:  QNN
        Q N
Subjt:  QNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGTTCGTCACGAGCACTAAAACATACATGGACTAAAGAGGAAGAAGCAAAACTTGTTGACTGCTTGGTGGAATTGGTTTCATCTGGAGGATGGAGGTCC
GATAATGGGACATTTCGACCTGGGTACCTGGCCCAATTGCAACGAATGATGGCAGAGAAGTTGTCCGACACTAATGTCCAAGGATCACCAACAATAGATTGTCGT
GTGAAATCTCTTAAAAAAATCTACCATGCAATTGTAGAAATGAGGGGGCCATCATGTAGTGCCTTTGGATGGAATGAAGAATTCCAATGCATCATCGCAGAACGG
GATTTGTTTGATATCAAAGGCCTTCTACACAAATCATTTCCATATTATAATGATATGTCTTATGTCTTTGGCAAAGATCGGACAATGGGAGCACGTTCAGAGACC
TTTGCTGATGTAGGATCTAATGTGCCGAACATGTTTAATGACACAGTTCCCCTTGGTGATTCACATGATGAAGACATCCCGACGATGTATAGCCAAGGAGTGCAT
ATGTCACCAGATGAGATGTTTAGAATACGTGCAGGTCAAGCAAGTGATAGAAGAAATTGTACAAGTGGAAGTAAGAGGAAACGAGGTAGCGAACGTTATGAAACG
ATAGAGGTGATCATGCGTGTAATGGAATTTGGAAATGAGCAATTAAAGGCAATTGCAGATTGGCCAAAAGAGAAGCGTGCAATAGGGGTCGAAATGCGTGCTCAA
GTTGTGAAACAACTGCAAGATATCCCTGAACTACGAAGCCAAGATAGGGCAAAGCTTATGCAAATCCTATTCCGTAGTTTGGAGGTCATTGAGGGATTCTTGTCA
ATTCCAACAAAACTTAAATTGGAGTATTGCAATATTCTCTTGCAAAATAATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGTTCGTCACGAGCACTAAAACATACATGGACTAAAGAGGAAGAAGCAAAACTTGTTGACTGCTTGGTGGAATTGGTTTCATCTGGAGGATGGAGGTCC
GATAATGGGACATTTCGACCTGGGTACCTGGCCCAATTGCAACGAATGATGGCAGAGAAGTTGTCCGACACTAATGTCCAAGGATCACCAACAATAGATTGTCGT
GTGAAATCTCTTAAAAAAATCTACCATGCAATTGTAGAAATGAGGGGGCCATCATGTAGTGCCTTTGGATGGAATGAAGAATTCCAATGCATCATCGCAGAACGG
GATTTGTTTGATATCAAAGGCCTTCTACACAAATCATTTCCATATTATAATGATATGTCTTATGTCTTTGGCAAAGATCGGACAATGGGAGCACGTTCAGAGACC
TTTGCTGATGTAGGATCTAATGTGCCGAACATGTTTAATGACACAGTTCCCCTTGGTGATTCACATGATGAAGACATCCCGACGATGTATAGCCAAGGAGTGCAT
ATGTCACCAGATGAGATGTTTAGAATACGTGCAGGTCAAGCAAGTGATAGAAGAAATTGTACAAGTGGAAGTAAGAGGAAACGAGGTAGCGAACGTTATGAAACG
ATAGAGGTGATCATGCGTGTAATGGAATTTGGAAATGAGCAATTAAAGGCAATTGCAGATTGGCCAAAAGAGAAGCGTGCAATAGGGGTCGAAATGCGTGCTCAA
GTTGTGAAACAACTGCAAGATATCCCTGAACTACGAAGCCAAGATAGGGCAAAGCTTATGCAAATCCTATTCCGTAGTTTGGAGGTCATTGAGGGATTCTTGTCA
ATTCCAACAAAACTTAAATTGGAGTATTGCAATATTCTCTTGCAAAATAATGTTTGA
Protein sequenceShow/hide protein sequence
MASSSRALKHTWTKEEEAKLVDCLVELVSSGGWRSDNGTFRPGYLAQLQRMMAEKLSDTNVQGSPTIDCRVKSLKKIYHAIVEMRGPSCSAFGWNEEFQCIIAER
DLFDIKGLLHKSFPYYNDMSYVFGKDRTMGARSETFADVGSNVPNMFNDTVPLGDSHDEDIPTMYSQGVHMSPDEMFRIRAGQASDRRNCTSGSKRKRGSERYET
IEVIMRVMEFGNEQLKAIADWPKEKRAIGVEMRAQVVKQLQDIPELRSQDRAKLMQILFRSLEVIEGFLSIPTKLKLEYCNILLQNNV