; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021485 (gene) of Chayote v1 genome

Gene IDSed0021485
OrganismSechium edule (Chayote v1)
DescriptionRibosomal RNA small subunit methyltransferase G
Genome locationLG11:4374362..4377163
RNA-Seq ExpressionSed0021485
SyntenySed0021485
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]6.6e-8265.89Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK
        M KARKG+A +S   ALFE S +GIKH  LLQDYEEL NETEA K+KLL A RKK  LL EVRFLRHRYE LK QP    PKV  + PRNLE+ PP  KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
        EK SR REASLKPL QA D+NQR  IYNG+EA+S+KS                +V  ++ FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EE+QA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GFKP+R E+EP+N+F R+EHDA N +L +SS+CRN  NGSNRAGKRKISWQDQVALRA
Subjt:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]1.5e-8167.18Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK
        M KARKG+A DS   ALFE S +GIKH  LLQDY+EL NETEA K+KLL A RKK+ LL EVRFLRHRYE LKNQP    PKV  +  RNLE+ PPI KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
        EK SR REASLKPL QA DLNQR  IYNG+EA+S+KS                +V  +N FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EEMQA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GF+PLR  E+E +N+F R+EHDA N DL +SS+CRN  NGSNRAGKRKISWQDQVALRA
Subjt:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]1.1e-7363.08Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK
        M KARK  AMD +  ALFE   IG KH RLLQDYE+L+N TE  KE+LL A RKKS LLAEVRFLRHRYEFLKNQ   + PK  LEPP+N EI PP  KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPL--DQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEM
        EK S+ REASLK L   QA+DLNQR  IY+GMEA S+KS  +            N+V+ HN  PIF+ K+ ++R HE AA+RN+TPVFDLNQISRE EE+
Subjt:  EKKSRNREASLKPL--DQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEM

Query:  QAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        QAGF+P R EE P+N F+R+E+D  N DL IS +CRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]2.5e-6565.42Show/hide
Query:  KEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI----
        K+KLL A RKK  LL EVRFLRHRYE LK QP    PKV  + PRNLE+ PP  KKEK SR REASLKPL QA D+NQR  IYNG+EA+S+KS       
Subjt:  KEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI----

Query:  --------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAG
                 +V  ++ FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EE+QAGFKP+R E+EP+N+F R+EHDA N +L +SS+CRN  NGSNRAG
Subjt:  --------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAG

Query:  KRKISWQDQVALRA
        KRKISWQDQVALRA
Subjt:  KRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]8.6e-8268.6Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK
        M KARKGLA+DSE  ALFE S IG+KH  LLQDYEEL+NETEA KEKLL A RKK  LLAEVRFLRHRYE LKN+P  T PKV+ E P +LEIGPPITKK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSH---QINQ---------VTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
         K SR  EASLKPL +A DLNQR  IYNGMEA S+KS     INQ         VT  +  PIFDQK+RV+R HE   +RN+TPVFDLNQISRE EE+QA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSH---QINQ---------VTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GF+PLR E+E +N+F R+EHDA N DL +SS+CRN  NGSN AGKRKISWQDQVALRA
Subjt:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein3.2e-8265.89Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK
        M KARKG+A +S   ALFE S +GIKH  LLQDYEEL NETEA K+KLL A RKK  LL EVRFLRHRYE LK QP    PKV  + PRNLE+ PP  KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
        EK SR REASLKPL QA D+NQR  IYNG+EA+S+KS                +V  ++ FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EE+QA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GFKP+R E+EP+N+F R+EHDA N +L +SS+CRN  NGSNRAGKRKISWQDQVALRA
Subjt:  GFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936837.1e-8267.18Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK
        M KARKG+A DS   ALFE S +GIKH  LLQDY+EL NETEA K+KLL A RKK+ LL EVRFLRHRYE LKNQP    PKV  +  RNLE+ PPI KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
        EK SR REASLKPL QA DLNQR  IYNG+EA+S+KS                +V  +N FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EEMQA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GF+PLR  E+E +N+F R+EHDA N DL +SS+CRN  NGSNRAGKRKISWQDQVALRA
Subjt:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein7.1e-8267.18Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK
        M KARKG+A DS   ALFE S +GIKH  LLQDY+EL NETEA K+KLL A RKK+ LL EVRFLRHRYE LKNQP    PKV  +  RNLE+ PPI KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPT--LPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA
        EK SR REASLKPL QA DLNQR  IYNG+EA+S+KS                +V  +N FP FDQK+RV+RAHE AANRN+TPVFDLNQISRE EEMQA
Subjt:  EKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQA

Query:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        GF+PLR  E+E +N+F R+EHDA N DL +SS+CRN  NGSNRAGKRKISWQDQVALRA
Subjt:  GFKPLR-AEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246965.5e-7463.08Show/hide
Query:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK
        M KARK  AMD +  ALFE   IG KH RLLQDYE+L+N TE  KE+LL A RKKS LLAEVRFLRHRYEFLKNQ   + PK  LEPP+N EI PP  KK
Subjt:  MNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP--TLPKVSLEPPRNLEIGPPITKK

Query:  EKKSRNREASLKPL--DQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEM
        EK S+ REASLK L   QA+DLNQR  IY+GMEA S+KS  +            N+V+ HN  PIF+ K+ ++R HE AA+RN+TPVFDLNQISRE EE+
Subjt:  EKKSRNREASLKPL--DQARDLNQRRAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEM

Query:  QAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        QAGF+P R EE P+N F+R+E+D  N DL IS +CRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940125.3e-6158.82Show/hide
Query:  IGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP-TLPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARDLNQR
        I I H  LLQDY ELQNETEA KEKLL   +KKS LLAEVRFLRH+YE LKN P T PKV  + P+NL+I PP++KKE +SR RE        AR+LNQR
Subjt:  IGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQP-TLPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARDLNQR

Query:  RAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQAGFKPLRAEEEP---RNLFVRTEH
          I +GMEAT++K+  +             +++  +YFPI  QK+RV+RAHEVA N N+TPVFDLNQISRE EE+Q GF+P+R E+E    +N+  R+E 
Subjt:  RAIYNGMEATSQKSHQI------------NQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQISREGEEMQAGFKPLRAEEEP---RNLFVRTEH

Query:  DAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA
        DA N DL +SS+CRN  NGSNRAGKRKISWQD+VALRA
Subjt:  DAMNVDLTISSICRN-SNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.3e-1130.63Show/hide
Query:  ELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPTLPKVSLEPPRNLEIGP-PITKKEKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQK
        EL+ E E K+++L    +K+  L +EVRFLR RYE LK   TL + S E  R  E G   + +K    R +++ ++      DL  +  I N  EA +  
Subjt:  ELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPTLPKVSLEPPRNLEIGP-PITKKEKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQK

Query:  SHQINQVTTHNYFPIFDQKDRVHRAHEV---------------AANRNITPVFDLNQISREGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSIC
            N V + +     D+K +  R  +V                +  +  P FDLNQISRE EE +   + + A E  +N  +      ++V+  +    
Subjt:  SHQINQVTTHNYFPIFDQKDRVHRAHEV---------------AANRNITPVFDLNQISREGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSIC

Query:  RNSNGSNRAGKRKISWQDQVAL
              NRA KRK++WQD VAL
Subjt:  RNSNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein2.2e-1428.82Show/hide
Query:  FEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPTLPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARD
        FE  ++  +H  L+QDY EL  ETEA +++L     +K+ L+AEVRFLR RY  L+                                       DQ ++
Subjt:  FEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRKKSILLAEVRFLRHRYEFLKNQPTLPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARD

Query:  LNQRRAIYNG--MEATSQKSHQINQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQIS----REGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNV
        + + R    G  +      S++    T H   P  +  ++ H   + +  R + P+FDLNQIS    +E E +    +  R EE      +      M  
Subjt:  LNQRRAIYNG--MEATSQKSHQINQVTTHNYFPIFDQKDRVHRAHEVAANRNITPVFDLNQIS----REGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNV

Query:  DLTISSICRNSNGSNRAGKRKISWQDQVA
             S CR  NG N + KRKISWQD VA
Subjt:  DLTISSICRNSNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACATTTCTCTCTCTTCTCTTTGGCTTTTTGTTTTTGGGGTCTCTTCTCTTCATCCTCATTGCCCATATTCCAAAACCCCCCATTCTTTTCGCGCCTTTTTTCTT
CTCTTTTCTGGGTCTTCCCCTTTTTCTGCAATCGCCCCTTTCCCCCCCTTTTGATCGGATGAACAAGGCTCGAAAAGGGTTGGCGATGGATTCTGAAACGGGTGCTTTGT
TTGAGGGATCGAGGATCGGAATCAAGCATTTGCGGCTGTTGCAGGATTACGAGGAGCTGCAGAATGAAACAGAAGCCAAGAAGGAGAAATTACTTGGCGCAAACAGGAAG
AAGTCGATCCTTTTGGCTGAAGTTCGATTTTTGAGGCATAGATATGAGTTCTTGAAGAACCAGCCAACCCTACCAAAGGTTTCTCTCGAGCCACCACGAAACCTTGAAAT
TGGACCTCCCATCACGAAGAAAGAAAAGAAATCTCGAAACCGAGAAGCTTCTTTGAAACCCCTTGATCAGGCTCGTGATTTAAACCAAAGGAGAGCAATCTACAATGGGA
TGGAAGCCACCTCTCAAAAATCTCATCAAATAAACCAAGTCACTACCCACAATTATTTTCCTATTTTTGATCAGAAAGATAGAGTACACAGGGCACATGAAGTTGCTGCA
AACAGAAACATTACCCCGGTTTTCGACCTCAATCAGATCTCGAGAGAGGGGGAAGAAATGCAGGCTGGTTTCAAACCACTGAGAGCGGAAGAGGAGCCGAGGAATCTCTT
TGTAAGAACCGAACACGATGCAATGAACGTCGACTTGACGATATCATCAATATGTAGAAACAGCAATGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATC
AAGTGGCTTTAAGAGCATAG
mRNA sequenceShow/hide mRNA sequence
CTTAACAACACTTAACAGAGAATGAAAAACAAAAAGGAAAAGCAACATCAATCCCCTTTCACTTCAAAACAATTTCTCCCACAATTCACTGAATCTTGCACTCAAAATCA
AATACACCAACATCAAGAATCCTCTTCAATGGCCCATTGTTATGCCCAACAAATCCCAAAACCAAACAGAGTAAAATGGCTACATTTCTCTCTCTTCTCTTTGGCTTTTT
GTTTTTGGGGTCTCTTCTCTTCATCCTCATTGCCCATATTCCAAAACCCCCCATTCTTTTCGCGCCTTTTTTCTTCTCTTTTCTGGGTCTTCCCCTTTTTCTGCAATCGC
CCCTTTCCCCCCCTTTTGATCGGATGAACAAGGCTCGAAAAGGGTTGGCGATGGATTCTGAAACGGGTGCTTTGTTTGAGGGATCGAGGATCGGAATCAAGCATTTGCGG
CTGTTGCAGGATTACGAGGAGCTGCAGAATGAAACAGAAGCCAAGAAGGAGAAATTACTTGGCGCAAACAGGAAGAAGTCGATCCTTTTGGCTGAAGTTCGATTTTTGAG
GCATAGATATGAGTTCTTGAAGAACCAGCCAACCCTACCAAAGGTTTCTCTCGAGCCACCACGAAACCTTGAAATTGGACCTCCCATCACGAAGAAAGAAAAGAAATCTC
GAAACCGAGAAGCTTCTTTGAAACCCCTTGATCAGGCTCGTGATTTAAACCAAAGGAGAGCAATCTACAATGGGATGGAAGCCACCTCTCAAAAATCTCATCAAATAAAC
CAAGTCACTACCCACAATTATTTTCCTATTTTTGATCAGAAAGATAGAGTACACAGGGCACATGAAGTTGCTGCAAACAGAAACATTACCCCGGTTTTCGACCTCAATCA
GATCTCGAGAGAGGGGGAAGAAATGCAGGCTGGTTTCAAACCACTGAGAGCGGAAGAGGAGCCGAGGAATCTCTTTGTAAGAACCGAACACGATGCAATGAACGTCGACT
TGACGATATCATCAATATGTAGAAACAGCAATGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAGAGCATAGTTCTCTCTTATTGAA
TGAAAGAGCCAATGAGGATCCTTCATTTGCATAGGTGGGAAAACTTGGCTTACTTGAAATTTTATAATTATTATTATTATTTTATAGTTCACACTTGTGTGAAAATTGTG
ACCCTAAAACATTTGGCTATGAAAAGTTTAAAGAACGTTTTAGGTAAGCGATCATCATGTTGGAAACTTGTAAATATTTGGCTATAACTTTTTTTTTTTGTTTATATAGA
TTTCCA
Protein sequenceShow/hide protein sequence
MATFLSLLFGFLFLGSLLFILIAHIPKPPILFAPFFFSFLGLPLFLQSPLSPPFDRMNKARKGLAMDSETGALFEGSRIGIKHLRLLQDYEELQNETEAKKEKLLGANRK
KSILLAEVRFLRHRYEFLKNQPTLPKVSLEPPRNLEIGPPITKKEKKSRNREASLKPLDQARDLNQRRAIYNGMEATSQKSHQINQVTTHNYFPIFDQKDRVHRAHEVAA
NRNITPVFDLNQISREGEEMQAGFKPLRAEEEPRNLFVRTEHDAMNVDLTISSICRNSNGSNRAGKRKISWQDQVALRA