; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:2762828..2775663
RNA-Seq ExpressionMoc01g04250
SyntenyMoc01g04250
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]8.4e-5453.28Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+D+ALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDRE
        +KFL K+FPPTR+AD+REEIISFRQ + E
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDRE

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]4.7e-1236.81Show/hide
Query:  MQMGQLANELKSRPQGTFPGHTENLKREQDGKEQCKAVITRSGLSYDRPTIPDEGI--DVATLILASISN-----PHPEEKAKHVSSDEKGK-----QVV
        MQ+GQLANE+++RPQG+ P  TE  +R          V        D   +PD+ +  +V+  +   +SN     P P+   +    +   K     + +
Subjt:  MQMGQLANELKSRPQGTFPGHTENLKREQDGKEQCKAVITRSGLSYDRPTIPDEGI--DVATLILASISN-----PHPEEKAKHVSSDEKGK-----QVV

Query:  PCTTPHVDDLKQMPNYTKLLKDVIYRHKKLGEHETVALTKCSSD
            P V+ L+QMP Y K LKD+I R KKLGE+ETVALT+CSS+
Subjt:  PCTTPHVDDLKQMPNYTKLLKDVIYRHKKLGEHETVALTKCSSD

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]2.5e-5064.71Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFS
        A N   N I +AD +D AMR+YAAT  ++ +S ++NP+PA A FE KPMM QML  I  FGG EHEDP  HLKSFI++AN  RLPGI+D+ALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDRE
        L  QA AWLNAFP G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDRE

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]5.8e-6360.83Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKP
        E+R   Q  ++K ++ E   ESE EST T M +IPP++P DPP VNGNM                            AFQN DS I+NPIP  ANFELKP
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKP

Query:  MMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR
        +MFQMLQT+G FGG+EH+DPHDHLK+F QIA AFR P ITD+ALRLTLFPFSLKD+AR WLN FPPGSITTW SLVEKFLTK+FPPTRHADI EEI++FR
Subjt:  MMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR

Query:  QYDREPLMQMGQLANEL
        QYDREP+ +  +   EL
Subjt:  QYDREPLMQMGQLANEL

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]8.8e-12092.5Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITD+A  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPLMQMGQLANEL
        KFLTKFFPPTRHADIREEIISFRQYDREP+ +  +   EL
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPLMQMGQLANEL

XP_022159074.1 uncharacterized protein LOC111025516 [Momordica charantia]1.1e-5655.02Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
        T  ++ +S ++NP+P  A FE KPMM QML TIG FGG EHEDP  HLKSFI++AN FRLPGI+D+ALRLTLF FSL  QA AWLNAF   +ITTW  +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDRE
        +KFL K+FPPTR+AD+REEIISFRQ   E
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDRE

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129474.1e-5453.28Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+D+ALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDRE
        +KFL K+FPPTR+AD+REEIISFRQ + E
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDRE

A0A6J1CPJ3 uncharacterized protein LOC1110129472.3e-1236.81Show/hide
Query:  MQMGQLANELKSRPQGTFPGHTENLKREQDGKEQCKAVITRSGLSYDRPTIPDEGI--DVATLILASISN-----PHPEEKAKHVSSDEKGK-----QVV
        MQ+GQLANE+++RPQG+ P  TE  +R          V        D   +PD+ +  +V+  +   +SN     P P+   +    +   K     + +
Subjt:  MQMGQLANELKSRPQGTFPGHTENLKREQDGKEQCKAVITRSGLSYDRPTIPDEGI--DVATLILASISN-----PHPEEKAKHVSSDEKGK-----QVV

Query:  PCTTPHVDDLKQMPNYTKLLKDVIYRHKKLGEHETVALTKCSSD
            P V+ L+QMP Y K LKD+I R KKLGE+ETVALT+CSS+
Subjt:  PCTTPHVDDLKQMPNYTKLLKDVIYRHKKLGEHETVALTKCSSD

A0A6J1CPJ3 uncharacterized protein LOC1110129471.2e-5064.71Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFS
        A N   N I +AD +D AMR+YAAT  ++ +S ++NP+PA A FE KPMM QML  I  FGG EHEDP  HLKSFI++AN  RLPGI+D+ALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDRE
        L  QA AWLNAFP G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDRE

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221602.8e-6360.83Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKP
        E+R   Q  ++K ++ E   ESE EST T M +IPP++P DPP VNGNM                            AFQN DS I+NPIP  ANFELKP
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKP

Query:  MMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR
        +MFQMLQT+G FGG+EH+DPHDHLK+F QIA AFR P ITD+ALRLTLFPFSLKD+AR WLN FPPGSITTW SLVEKFLTK+FPPTRHADI EEI++FR
Subjt:  MMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFR

Query:  QYDREPLMQMGQLANEL
        QYDREP+ +  +   EL
Subjt:  QYDREPLMQMGQLANEL

A0A6J1DW02 uncharacterized protein LOC1110248974.3e-12092.5Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITD+A  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPLMQMGQLANEL
        KFLTKFFPPTRHADIREEIISFRQYDREP+ +  +   EL
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPLMQMGQLANEL

A0A6J1E1A9 uncharacterized protein LOC1110255165.2e-5755.02Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
        T  ++ +S ++NP+P  A FE KPMM QML TIG FGG EHEDP  HLKSFI++AN FRLPGI+D+ALRLTLF FSL  QA AWLNAF   +ITTW  +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDNALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDRE
        +KFL K+FPPTR+AD+REEIISFRQ   E
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACAGAC
AATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCACTTATGCAAATGGGGCAATTAGCGA
ATGAACTGAAGTCTAGACCACAAGGTACATTTCCAGGACACACCGAGAATCTTAAGCGAGAACAAGATGGTAAAGAACAATGTAAGGCAGTCATCACGAGAAGCGGACTG
AGTTACGACAGACCCACAATTCCAGATGAAGGAATCGATGTAGCTACACTTATTCTTGCATCCATTTCTAATCCACATCCAGAAGAGAAAGCCAAACATGTGAGTTCAGA
TGAGAAAGGTAAGCAAGTAGTGCCTTGCACTACTCCGCATGTAGATGATTTAAAGCAGATGCCTAATTACACCAAGCTTTTAAAAGATGTCATTTATAGGCATAAGAAAT
TAGGCGAGCATGAGACGGTAGCCTTAACAAAGTGTAGTAGTGATGCTTTAGGGAAGCCATTGTCTGTCAAATATGAAACCTTATGCTATAGGGAGGTACCCATTAAGATC
TTAGCAAGAGAGACCAAAGTGTTGCGGAATCGGGCAATCGACTTGGTGAAGGTCTTGTGGATGAATCACCAAGTGGCGGAAGCTACCTGGGAAAGGGAAGACGAGATCAG
AGCCCGATATCCTGAGTTGTTCGATCAACGAACTTTCGAGGACGAAACAAGCCCAACGGCCGGCGACCTCGACAGCGGCTCTCAACTCCGGCGAACAGCAGCAGGTGGCG
CGGCTCGGACAGCAGCAAGCGTGTGGCTCCGGGCAGCAGCGGCGCGTGGTGACGAACTGCATCGGACGCGCGACTTTTGGCAGCAACGAGTGTGCACGCGGTATCGGGCA
GCAGTGCGGCAGCGGCAGTGCGGCGGCGAAGCGGTTTTGGGGCGTTTCCGGCGGCGCATCGTCCCCCTCGCGAGTGGGTTCGATCTTCCTTGGTGGTTAGAAATCGCGGC
GCAGGTCCTTGCTGTACATAGCATAGTGGGTATCGATTTGGTGGTCTTGAAGGGGTTAGTGGGAGTTGTCAAAATGAATGGGACGGTGGATGTACCGGTGTTGTGGCCTC
GTGGGCAACGAACGAGGCTGCTAGGTCCATACCGGATGTCGAGGCTCCGAGGGTCGTGTAGTCGAGGTCCTGGGAGTCCTGGTGTATGGATGCTTATATCAGAATGTGCG
AATAGGGCCCCCGACCGTGGTGATGACGTGGAGGAGCTCGGAAGGCCGAGATGGCCGCCACCAACGCCGACAAATGCCAAGTGTGAGGGCCGAGGTGAGCCTGGCCAGGT
CCGACCCACCGGGGAGCTCGGTCCGCCCAGGTGGTCAGGTCGGTCCGGAGGCCGGGTTCGAGCTACAACCAGGAACACACTGTTGTGCAAATCTTTGCATAAACATTTGG
CGCCGTCTGTGGGGAACGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACAGAC
AATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCACTTATGCAAATGGGGCAATTAGCGA
ATGAACTGAAGTCTAGACCACAAGGTACATTTCCAGGACACACCGAGAATCTTAAGCGAGAACAAGATGGTAAAGAACAATGTAAGGCAGTCATCACGAGAAGCGGACTG
AGTTACGACAGACCCACAATTCCAGATGAAGGAATCGATGTAGCTACACTTATTCTTGCATCCATTTCTAATCCACATCCAGAAGAGAAAGCCAAACATGTGAGTTCAGA
TGAGAAAGGTAAGCAAGTAGTGCCTTGCACTACTCCGCATGTAGATGATTTAAAGCAGATGCCTAATTACACCAAGCTTTTAAAAGATGTCATTTATAGGCATAAGAAAT
TAGGCGAGCATGAGACGGTAGCCTTAACAAAGTGTAGTAGTGATGCTTTAGGGAAGCCATTGTCTGTCAAATATGAAACCTTATGCTATAGGGAGGTACCCATTAAGATC
TTAGCAAGAGAGACCAAAGTGTTGCGGAATCGGGCAATCGACTTGGTGAAGGTCTTGTGGATGAATCACCAAGTGGCGGAAGCTACCTGGGAAAGGGAAGACGAGATCAG
AGCCCGATATCCTGAGTTGTTCGATCAACGAACTTTCGAGGACGAAACAAGCCCAACGGCCGGCGACCTCGACAGCGGCTCTCAACTCCGGCGAACAGCAGCAGGTGGCG
CGGCTCGGACAGCAGCAAGCGTGTGGCTCCGGGCAGCAGCGGCGCGTGGTGACGAACTGCATCGGACGCGCGACTTTTGGCAGCAACGAGTGTGCACGCGGTATCGGGCA
GCAGTGCGGCAGCGGCAGTGCGGCGGCGAAGCGGTTTTGGGGCGTTTCCGGCGGCGCATCGTCCCCCTCGCGAGTGGGTTCGATCTTCCTTGGTGGTTAGAAATCGCGGC
GCAGGTCCTTGCTGTACATAGCATAGTGGGTATCGATTTGGTGGTCTTGAAGGGGTTAGTGGGAGTTGTCAAAATGAATGGGACGGTGGATGTACCGGTGTTGTGGCCTC
GTGGGCAACGAACGAGGCTGCTAGGTCCATACCGGATGTCGAGGCTCCGAGGGTCGTGTAGTCGAGGTCCTGGGAGTCCTGGTGTATGGATGCTTATATCAGAATGTGCG
AATAGGGCCCCCGACCGTGGTGATGACGTGGAGGAGCTCGGAAGGCCGAGATGGCCGCCACCAACGCCGACAAATGCCAAGTGTGAGGGCCGAGGTGAGCCTGGCCAGGT
CCGACCCACCGGGGAGCTCGGTCCGCCCAGGTGGTCAGGTCGGTCCGGAGGCCGGGTTCGAGCTACAACCAGGAACACACTGTTGTGCAAATCTTTGCATAAACATTTGG
CGCCGTCTGTGGGGAACGACTGA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNRFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSM
ADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITD
NALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPLMQMGQLANELKSRPQGTFPGHTENLKREQDGKEQCKAVITRSGL
SYDRPTIPDEGIDVATLILASISNPHPEEKAKHVSSDEKGKQVVPCTTPHVDDLKQMPNYTKLLKDVIYRHKKLGEHETVALTKCSSDALGKPLSVKYETLCYREVPIKI
LARETKVLRNRAIDLVKVLWMNHQVAEATWEREDEIRARYPELFDQRTFEDETSPTAGDLDSGSQLRRTAAGGAARTAASVWLRAAAARGDELHRTRDFWQQRVCTRYRA
AVRQRQCGGEAVLGRFRRRIVPLASGFDLPWWLEIAAQVLAVHSIVGIDLVVLKGLVGVVKMNGTVDVPVLWPRGQRTRLLGPYRMSRLRGSCSRGPGSPGVWMLISECA
NRAPDRGDDVEELGRPRWPPPTPTNAKCEGRGEPGQVRPTGELGPPRWSGRSGGRVRATTRNTLLCKSLHKHLAPSVGND