; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C013302 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C013302
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
Descriptionauxin-responsive protein SAUR68-like
Genome locationchr01:15145698..15146144
RNA-Seq ExpressionMELO3C013302
SyntenyMELO3C013302
Gene Ontology termsGO:0009733 - response to auxin (biological process)
InterPro domainsIPR003676 - Small auxin-up RNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645682.1 hypothetical protein Csa_020235 [Cucumis sativus]3.0e-6791.03Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTP+TL+KLARKWQ VAVAGNG+RRISLPRTRSSSSV NKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMI
        I+RHVD+EVQQALVLSV PA+KC CDSS FSS APVAEN RPVMI
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMI

KAG6589034.1 Auxin-responsive protein SAUR66, partial [Cucurbita argyrosperma subsp. sororia]1.1e-5879.87Show/hide
Query:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY
        MVT PKTL+K+ARKWQ VA A   +R+ISLPR+RSSS+V +KGHFVVYTVDQKRCVLP+RYLG+YVL+ELLKMSEEEFGLPADGPIKLPCEAAFMEY VY
Subjt:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY

Query:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        LI+RHVDLEV++ALVLSVAPA K SC S+LF S AP AE+GRPVMI GF
Subjt:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

XP_004144931.3 auxin-responsive protein SAUR68 [Cucumis sativus]1.1e-6991.22Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTP+TL+KLARKWQ VAVAGNG+RRISLPRTRSSSSV NKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        I+RHVD+EVQQALVLSV PA+KC CDSS FSS APVAEN RPVMICGF
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

XP_016900470.1 PREDICTED: auxin-responsive protein SAUR68-like [Cucumis melo]2.7e-76100Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

XP_023529808.1 auxin-responsive protein SAUR68-like [Cucurbita pepo subsp. pepo]1.1e-5879.87Show/hide
Query:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY
        MVT PKTL+K+ARKWQ VA A   +R+ISLPR+RSSS+V +KGHFVVYTVDQKRCVLP+RYLG+YVL+ELLKMSEEEFGLPADGPIKLPCEAAFMEY VY
Subjt:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY

Query:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        LI+RHVDLEV++ALVLSVAPA K SC S+LF S AP AE+GRPVMI GF
Subjt:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

TrEMBL top hitse value%identityAlignment
A0A0A0K2P5 Uncharacterized protein5.3e-7091.22Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTP+TL+KLARKWQ VAVAGNG+RRISLPRTRSSSSV NKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        I+RHVD+EVQQALVLSV PA+KC CDSS FSS APVAEN RPVMICGF
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

A0A1S4DXM8 auxin-responsive protein SAUR68-like1.3e-76100Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

A0A5D3DI83 Auxin-responsive protein SAUR68-like1.3e-76100Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
        MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYL

Query:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
Subjt:  IQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

A0A6J1ERS1 auxin-responsive protein SAUR68-like3.5e-5880.82Show/hide
Query:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY
        MVT PKTL+K+ARKWQ VA A   +R+ISLPR+RSSS+V +KGHFVVYTVDQKRCVLP+RYLG+YVL+ELLKMSEEEFGLPADGPIKLPCEAAFMEY VY
Subjt:  MVT-PKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY

Query:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMI
        LI+RHVDLEV++ALVLSVAPA K SC SSLF S AP AE+GRPVMI
Subjt:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMI

A0A6J1JK75 auxin-responsive protein SAUR68-like2.7e-5879.19Show/hide
Query:  MVTP-KTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY
        MVTP KTL+K+ARKWQ VA A   +R+ISLPR+RSSS+V +KGHFVVYTVDQKRCVLP+ YLG+YVL+ELLKMSEEEFGLPADGPIKLPCEAAFMEY VY
Subjt:  MVTP-KTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVY

Query:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF
        LI+RHVDLEV++ALVLSVAPA K SC S+LF S AP AE+GRPVMI GF
Subjt:  LIQRHVDLEVQQALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF

SwissProt top hitse value%identityAlignment
F4I1H5 Auxin-responsive protein SAUR634.5e-2651.97Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRT---RSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+K+A+KWQ  A     ++RIS  R+    SSSS V KG FVVYT D+ R   PI YL N V++ELLK+SEE+FGLP +GPI LP ++AF+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRT---RSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPAMKCS
        V LIQR +D + ++AL+LS++ A +CS
Subjt:  VYLIQRHVDLEVQQALVLSVAPAMKCS

F4I1I4 Auxin-responsive protein SAUR671.5e-2650Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTR---SSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+K+A+KWQ  A     ++RIS  R+    SSSS V KG FVVYT D+ R   PI YL N +++ELLK+SEEEFGLP +GPI LP ++ F+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTR---SSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPAMKCSCDSSL
        + LIQR +D + ++AL++S++ A KCS   SL
Subjt:  VYLIQRHVDLEVQQALVLSVAPAMKCSCDSSL

Q0V7Z5 Auxin-responsive protein SAUR641.9e-2446.56Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRS--SSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIV
        M+  K L+K+A+KWQ  A     ++RIS  R+ S  SS+   KG FVVYT D  R   P+ YL N V +ELLK+SEEEFGLP  GPI  P ++ F+EY++
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRS--SSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIV

Query:  YLIQRHVDLEVQQALVLSVAPAMKCSCDSSL
         L+QR +D + ++AL++S++ A +CS   SL
Subjt:  YLIQRHVDLEVQQALVLSVAPAMKCSCDSSL

Q6NMM0 Auxin-responsive protein SAUR615.5e-2445.32Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+K+A+KWQ  A     ++RIS  R   T SSS+   KG FVVYT D+ R   PI YL N V++ELLK+SEEEFG+P +GPI LP ++ F+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAP---AMKCSCDSSLFSST
        + L+QR +D + ++AL+ S++    ++ CS      SST
Subjt:  VYLIQRHVDLEVQQALVLSVAP---AMKCSCDSSLFSST

Q9C7Q8 Auxin-responsive protein SAUR622.2e-2550.41Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+KLA+KWQ  A     ++RIS  R   T SS + V KG FVVYT D+ R   P+ YL N +++ELLK+SEEEFGLP +GPI LP ++AF+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPA
        + LIQR +D + ++AL+LS++ A
Subjt:  VYLIQRHVDLEVQQALVLSVAPA

Arabidopsis top hitse value%identityAlignment
AT1G29430.1 SAUR-like auxin-responsive protein family1.6e-2650.41Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+KLA+KWQ  A     ++RIS  R   T SS + V KG FVVYT D+ R   P+ YL N +++ELLK+SEEEFGLP +GPI LP ++AF+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR---TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPA
        + LIQR +D + ++AL+LS++ A
Subjt:  VYLIQRHVDLEVQQALVLSVAPA

AT1G29440.1 SAUR-like auxin-responsive protein family3.2e-2751.97Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRT---RSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+K+A+KWQ  A     ++RIS  R+    SSSS V KG FVVYT D+ R   PI YL N V++ELLK+SEE+FGLP +GPI LP ++AF+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRT---RSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPAMKCS
        V LIQR +D + ++AL+LS++ A +CS
Subjt:  VYLIQRHVDLEVQQALVLSVAPAMKCS

AT1G29450.1 SAUR-like auxin-responsive protein family1.3e-2546.56Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRS--SSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIV
        M+  K L+K+A+KWQ  A     ++RIS  R+ S  SS+   KG FVVYT D  R   P+ YL N V +ELLK+SEEEFGLP  GPI  P ++ F+EY++
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRS--SSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIV

Query:  YLIQRHVDLEVQQALVLSVAPAMKCSCDSSL
         L+QR +D + ++AL++S++ A +CS   SL
Subjt:  YLIQRHVDLEVQQALVLSVAPAMKCSCDSSL

AT1G29510.1 SAUR-like auxin-responsive protein family1.1e-2750Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTR---SSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI
        M+  K L+K+A+KWQ  A     ++RIS  R+    SSSS V KG FVVYT D+ R   PI YL N +++ELLK+SEEEFGLP +GPI LP ++ F+EY+
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTR---SSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYI

Query:  VYLIQRHVDLEVQQALVLSVAPAMKCSCDSSL
        + LIQR +D + ++AL++S++ A KCS   SL
Subjt:  VYLIQRHVDLEVQQALVLSVAPAMKCSCDSSL

AT5G27780.1 SAUR-like auxin-responsive protein family1.8e-2548.44Show/hide
Query:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR----TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEY
        M+  KTL+KLA+ WQ  A     ++RIS  R    T SS + V KG FVVYT D+ R   P+ YL N +++ELLK+SEEEFGLP +GPI LP ++ F+EY
Subjt:  MVTPKTLVKLARKWQTVAVAGNGQRRISLPR----TRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEY

Query:  IVYLIQRHVDLEVQQALVLSVAPAMKCS
        ++ LIQR +D + ++AL+ S++ A +CS
Subjt:  IVYLIQRHVDLEVQQALVLSVAPAMKCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAACTCCAAAAACCCTAGTTAAGCTAGCTCGTAAATGGCAGACGGTTGCCGTGGCCGGGAACGGGCAGCGGAGGATCTCACTGCCAAGAACAAGAAGCTCCTCCTC
TGTGGTAAACAAAGGTCATTTTGTGGTCTACACGGTGGACCAAAAACGGTGCGTTCTGCCCATAAGGTATTTGGGAAACTATGTTTTAAAGGAATTGTTGAAGATGTCGG
AGGAGGAGTTTGGGCTGCCCGCGGATGGACCGATAAAGCTGCCGTGTGAGGCGGCGTTTATGGAGTATATCGTGTATTTGATCCAACGACATGTTGACCTTGAAGTCCAG
CAAGCTCTGGTTTTGTCGGTTGCTCCAGCGATGAAATGCTCTTGTGATTCTAGTTTGTTTTCTTCGACGGCGCCGGTGGCTGAAAATGGCCGGCCGGTGATGATTTGTGG
GTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAACTCCAAAAACCCTAGTTAAGCTAGCTCGTAAATGGCAGACGGTTGCCGTGGCCGGGAACGGGCAGCGGAGGATCTCACTGCCAAGAACAAGAAGCTCCTCCTC
TGTGGTAAACAAAGGTCATTTTGTGGTCTACACGGTGGACCAAAAACGGTGCGTTCTGCCCATAAGGTATTTGGGAAACTATGTTTTAAAGGAATTGTTGAAGATGTCGG
AGGAGGAGTTTGGGCTGCCCGCGGATGGACCGATAAAGCTGCCGTGTGAGGCGGCGTTTATGGAGTATATCGTGTATTTGATCCAACGACATGTTGACCTTGAAGTCCAG
CAAGCTCTGGTTTTGTCGGTTGCTCCAGCGATGAAATGCTCTTGTGATTCTAGTTTGTTTTCTTCGACGGCGCCGGTGGCTGAAAATGGCCGGCCGGTGATGATTTGTGG
GTTTTGA
Protein sequenceShow/hide protein sequence
MVTPKTLVKLARKWQTVAVAGNGQRRISLPRTRSSSSVVNKGHFVVYTVDQKRCVLPIRYLGNYVLKELLKMSEEEFGLPADGPIKLPCEAAFMEYIVYLIQRHVDLEVQ
QALVLSVAPAMKCSCDSSLFSSTAPVAENGRPVMICGF