; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:7484143..7485129
RNA-Seq ExpressionMoc07g09740
SyntenyMoc07g09740
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]5.3e-9355.62Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N   +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA

Query:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV
           ++ +S + N   A A FE KPMM QML  IG F G EHEDP  +LKSFI++ N FRLPGI+DDALRLTLFPFSL  QA AWL+AFP+ +I T   +V
Subjt:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E ++ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQK
        WCS++ R   K+ DPA VLALD  TSMQK
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQK

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]1.1e-11770.82Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKP
        E+R   Q  ++K ++ E   ESE EST T M +IPP++  DPP VNGNM                            AFQN DS I+NPI   ANFELKP
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKP

Query:  MMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR
        +MFQMLQT+G FGG+EH+DPHD+LK+F QI  AFR P ITDDALRLTLFPFSL+D+AR WL+ FP GSITTW SLVEKFLTK+FPPTRHADI EEI++FR
Subjt:  MMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR

Query:  QYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA
        QYDREP+HEAWERFKEL+RKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDI 
Subjt:  QYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA

Query:  TSMQK
        TSMQK
Subjt:  TSMQK

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]2.2e-9164.43Show/hide
Query:  ARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFS
        A N   N   +AD +D AMR+YAAT  ++ +S ++NP+ A A FE KPMM QML  I  FGG EHEDP  +LKSFI++ N  RLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFS

Query:  LEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        L  QA AWL+AFP+G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E ++ AWE FK+LIR CPN G+PAC+QIEHFFRG D PTKMMLN AAN
Subjt:  LEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQK
        G FT K+FNEIV+IL+ L+ HN+ W S+RSR   K+ DPAGVLALD  TSMQK
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.1e-17493.9Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTST MADIPPRD VDPPAVNGNMRDHARNDEFN+ QMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAAT

Query:  AFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVE
        AFQNFDS IVNPI AH NFELKPMMFQMLQTIGHFGGQEHEDPHD+LKSFIQI NAFRLPGITDDA  LTLFPFSL+DQAR  L+AFP GSITTWGSLVE
Subjt:  AFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREP+HEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQK
        CSQRSRAAPKKQDPAGVLALDIATSMQK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQK

XP_022159074.1 uncharacterized protein LOC111025516 [Momordica charantia]1.5e-9556.84Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N   MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA

Query:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV
        T  ++ +S ++NP+   A FE KPMM QML TIG FGG EHEDP  +LKSFI++ N FRLPGI+DDALRLTLF FSL  QA AWL+AF + +ITTW  +V
Subjt:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ   E ++ AWERFK+LIR CPN  + AC+QIEHFFRG D PTKMMLN AANG FT K+FN+IV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQK
        WCS+RSR   K+ DP GV ALD  TSMQK
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQK

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129472.6e-9355.62Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N   +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA

Query:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV
           ++ +S + N   A A FE KPMM QML  IG F G EHEDP  +LKSFI++ N FRLPGI+DDALRLTLFPFSL  QA AWL+AFP+ +I T   +V
Subjt:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E ++ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQK
        WCS++ R   K+ DPA VLALD  TSMQK
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQK

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221605.2e-11870.82Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKP
        E+R   Q  ++K ++ E   ESE EST T M +IPP++  DPP VNGNM                            AFQN DS I+NPI   ANFELKP
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKP

Query:  MMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR
        +MFQMLQT+G FGG+EH+DPHD+LK+F QI  AFR P ITDDALRLTLFPFSL+D+AR WL+ FP GSITTW SLVEKFLTK+FPPTRHADI EEI++FR
Subjt:  MMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR

Query:  QYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA
        QYDREP+HEAWERFKEL+RKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDI 
Subjt:  QYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA

Query:  TSMQK
        TSMQK
Subjt:  TSMQK

A0A6J1DSZ5 uncharacterized protein LOC1110241071.1e-9164.43Show/hide
Query:  ARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFS
        A N   N   +AD +D AMR+YAAT  ++ +S ++NP+ A A FE KPMM QML  I  FGG EHEDP  +LKSFI++ N  RLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFS

Query:  LEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        L  QA AWL+AFP+G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E ++ AWE FK+LIR CPN G+PAC+QIEHFFRG D PTKMMLN AAN
Subjt:  LEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQK
        G FT K+FNEIV+IL+ L+ HN+ W S+RSR   K+ DPAGVLALD  TSMQK
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQK

A0A6J1DW02 uncharacterized protein LOC1110248971.0e-17493.9Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTST MADIPPRD VDPPAVNGNMRDHARNDEFN+ QMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAAT

Query:  AFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVE
        AFQNFDS IVNPI AH NFELKPMMFQMLQTIGHFGGQEHEDPHD+LKSFIQI NAFRLPGITDDA  LTLFPFSL+DQAR  L+AFP GSITTWGSLVE
Subjt:  AFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREP+HEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQK
        CSQRSRAAPKKQDPAGVLALDIATSMQK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQK

A0A6J1E1A9 uncharacterized protein LOC1110255167.3e-9656.84Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N   MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAA

Query:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV
        T  ++ +S ++NP+   A FE KPMM QML TIG FGG EHEDP  +LKSFI++ N FRLPGI+DDALRLTLF FSL  QA AWL+AF + +ITTW  +V
Subjt:  TAFQNFDSRIVNPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ   E ++ AWERFK+LIR CPN  + AC+QIEHFFRG D PTKMMLN AANG FT K+FN+IV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQK
        WCS+RSR   K+ DP GV ALD  TSMQK
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTAGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATTAATGGCAGATATTCCACCTCGTGATCTGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCACGCAAGAAATGATGAATTCAACCATACCCAAATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCATTTCAGAACTTTGATTCAAGGATAGTC
AACCCTATTCTAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATTATCT
GAAATCATTCATTCAAATTACAAATGCATTTCGATTACCTGGTATAACAGACGATGCTCTTAGACTAACACTTTTTCCATTTTCTTTGGAGGACCAAGCTAGAGCATGGC
TCGATGCATTTCCAACTGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATTAGAGAGGAGATCATC
TCCTTTAGACAGTATGATCGTGAACCTATTCACGAGGCGTGGGAAAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACA
TTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAG
CTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCACTGGACATTGCGACCTCGATGCAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTAGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATTAATGGCAGATATTCCACCTCGTGATCTGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCACGCAAGAAATGATGAATTCAACCATACCCAAATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCATTTCAGAACTTTGATTCAAGGATAGTC
AACCCTATTCTAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATTATCT
GAAATCATTCATTCAAATTACAAATGCATTTCGATTACCTGGTATAACAGACGATGCTCTTAGACTAACACTTTTTCCATTTTCTTTGGAGGACCAAGCTAGAGCATGGC
TCGATGCATTTCCAACTGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATTAGAGAGGAGATCATC
TCCTTTAGACAGTATGATCGTGAACCTATTCACGAGGCGTGGGAAAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACA
TTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAG
CTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCACTGGACATTGCGACCTCGATGCAAAAATAG
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTLMADIPPRDLVDPPAVNGNMRDHARNDEFNHTQMADNRDVAMREYAATAFQNFDSRIV
NPILAHANFELKPMMFQMLQTIGHFGGQEHEDPHDYLKSFIQITNAFRLPGITDDALRLTLFPFSLEDQARAWLDAFPTGSITTWGSLVEKFLTKFFPPTRHADIREEII
SFRQYDREPIHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQK