; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g12960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g12960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:11013575..11019409
RNA-Seq ExpressionMoc09g12960
SyntenyMoc09g12960
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON60333.1 Zinc finger, CCHC-type [Parasponia andersonii]3.3e-6651.5Show/hide
Query:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG
        KM E  SVA HIN F+T+V++L +V++ F +E+ A++LL SLP SWEPM+AA+ NS GKEKL+F DV+D  L +E+RR D SG  ++S + LN+   DR 
Subjt:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG

Query:  RNNSRDYGK-HGKSRNNISKSRN-IRLECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE
           + + G+   KSRN  +KSR+  R+ECWNCGKT H+++NC+AP+K + +   AN IT+E+  AL L V+   ++WV++SGASFHTT  RD+LENY++ 
Subjt:  RNNSRDYGK-HGKSRNNISKSRN-IRLECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE

Query:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW
        NYGKVY ADGEPLDI+G+GD+ LK +NG +WKI+KVRHV  +M+NL              FT GSW
Subjt:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW

PON63051.1 Zinc finger, CCHC-type [Trema orientale]1.8e-6450.75Show/hide
Query:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG
        KM E  SVA HIN F+T+V++L +V++ F DE+ A++LL SLP SWE M+AA+ NS GK KL+F DV+D  L +E+RR D SG  ++S + LN+   DR 
Subjt:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG

Query:  RNNSRDYGK-HGKSRNNISKSRNIRL-ECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE
           + ++G+   KSRN   KSR+ R  ECWNCGKT H+++NC+AP+K + +   AN +T+E+  AL L V+   ++WV++SGASFHTT  R++LENYV+ 
Subjt:  RNNSRDYGK-HGKSRNNISKSRNIRL-ECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE

Query:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW
        NYGKVY ADGEPLDI+G+GD+ LK +NG +WKI+KVRHV  +M+NL              FT GSW
Subjt:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW

RVW67125.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.6e-6043.21Show/hide
Query:  GKISHYLKNRRSNLPYCRARVP---YCRVLHSKELEFSLDKKPADMEEAKWKKLDR-----------------KMDEDTSVAVHINGFDTLVNKLVAVDL
        GK S   K   ++  Y R ++    Y R LH       L  KP  M+  +W  LDR                 KM E+ SVA H+N F+T+ N+L +V++
Subjt:  GKISHYLKNRRSNLPYCRARVP---YCRVLHSKELEFSLDKKPADMEEAKWKKLDR-----------------KMDEDTSVAVHINGFDTLVNKLVAVDL

Query:  TFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKSRN---NISKSRN-IR
         F DE+ A+++L SLP+SWE M+ A+ NS GKEKLK+ D++D  L +EIRR+D +G  S SG+ LN++ RGR N+R+  +    SRN   N SKSR+  +
Subjt:  TFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKSRN---NISKSRN-IR

Query:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA
        ++CWNCGKT H +R CK+PKK + ++  AN +TEE+  AL L V+   + WV++SGASFHTT  R+I++NYV+ ++GKVY ADG  LD++G+GDV +   
Subjt:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA

Query:  NGPIWKIRKVRHVQSMMKNLFTWG
        NG +W + KVRH+  + +NL + G
Subjt:  NGPIWKIRKVRHVQSMMKNLFTWG

RVW92338.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.1e-6044.41Show/hide
Query:  KISHYLKNRRSNLPYCRARVPYCRVLHSKELEFSLDKKPADMEEAKWKKL-DRKMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWE
        +I  YL  R+ +LP     +   +    K L    +K  A+ +    KKL + KM E+ SVA H+N F+T+ N+L +V++ F DE+ A+++L SLP+SWE
Subjt:  KISHYLKNRRSNLPYCRARVPYCRVLHSKELEFSLDKKPADMEEAKWKKL-DRKMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWE

Query:  PMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKSRN---NISKSRN-IRLECWNCGKTRHLRRNCKAPK
         M+ A+ NS GKEKLK+ D++D  L +EIRR+D +G  S SG+ LN++ RGR N+R+  +    SRN   N SKSR+  +++CWNCGKT H +R CK+PK
Subjt:  PMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKSRN---NISKSRN-IRLECWNCGKTRHLRRNCKAPK

Query:  KAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL
        K + ++  AN +TEE+  AL L V+   + WV++SGASFHTT  R+I++NYV+ ++GKVY ADG  LD++G+GDV +   NG +W + KVRH+  + +NL
Subjt:  KAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL

Query:  FTWG
         + G
Subjt:  FTWG

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]3.8e-7855.16Show/hide
Query:  LHSKELEFSLDKKPADMEEAKWKKLDR----------------------------------------------------KMDEDTSVAVHINGFDTLVNK
        LHSKELEF L+ KP DM E +WKKLDR                                                    KM E T +  H+N FD L+NK
Subjt:  LHSKELEFSLDKKPADMEEAKWKKLDR----------------------------------------------------KMDEDTSVAVHINGFDTLVNK

Query:  LVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVDRGRNNSRDYGKHGKSRNNISKSRNIR
        LVAVDL F+ E+ AILLLRSLPDSWEPM+AAI NSC KEKLKF DV+DAAL +EIRRKD SGIA TSG+ LNVDRGRNN+R YG  GKS+NN S+SRN R
Subjt:  LVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVDRGRNNSRDYGKHGKSRNNISKSRNIR

Query:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA
         ECWNCGK  HL+ NCKAPKK EG EA AN + E+IH AL + VE AH+ WV++SG                  N+GKVY ADGEPLDIIGIG+VNLK A
Subjt:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA

Query:  NGPIWKIRKV
        NG +WKIRK+
Subjt:  NGPIWKIRKV

TrEMBL top hitse value%identityAlignment
A0A2N9GTI4 Uncharacterized protein3.1e-6242.26Show/hide
Query:  GKISHYLKNRRSNLPYCRARVPYCRVLHSKELEFS-LDKKPADMEEAKWKKLDR----------------------------------KMDEDTSVAVHI
        GK+S   K   ++  Y R ++     L+ K+L    L +KP DME+A+W  LDR                                  KM E T+VA H+
Subjt:  GKISHYLKNRRSNLPYCRARVPYCRVLHSKELEFS-LDKKPADMEEAKWKKLDR----------------------------------KMDEDTSVAVHI

Query:  NGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKS
        N F+T+ N+L +V++ F DE+ A+++L SLP+SWE M+ A+ NS GK KLK+ D++D  L +E+RR+D +G  S+SG+ LN++ RGR   R+Y +   KS
Subjt:  NGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKS

Query:  RNNISKSRNIR-LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLD
        R   SKS+  R LECWNCGKT H+R+NC   KK + +   AN++TEE+H AL L V+    +WV++SGASFHTT  R+I++NYV+ ++GKVY AD E LD
Subjt:  RNNISKSRNIR-LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLD

Query:  IIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFTWG
        ++G+GDV +   NG +W ++KVRHV  + +NL + G
Subjt:  IIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFTWG

A0A2N9HHD8 Uncharacterized protein3.1e-6242.26Show/hide
Query:  GKISHYLKNRRSNLPYCRARVPYCRVLHSKELEFS-LDKKPADMEEAKWKKLDR----------------------------------KMDEDTSVAVHI
        GK+S   K   ++  Y R ++     L+ K+L    L +KP DME+A+W  LDR                                  KM E T+VA H+
Subjt:  GKISHYLKNRRSNLPYCRARVPYCRVLHSKELEFS-LDKKPADMEEAKWKKLDR----------------------------------KMDEDTSVAVHI

Query:  NGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKS
        N F+T+ N+L +V++ F DE+ A+++L SLP+SWE M+ A+ NS GK KLK+ D++D  L +E+RR+D +G  S+SG+ LN++ RGR   R+Y +   KS
Subjt:  NGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVD-RGRNNSRDYGK-HGKS

Query:  RNNISKSRNIR-LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLD
        R   SKS+  R LECWNCGKT H+R+NC   KK + +   AN++TEE+H AL L V+    +WV++SGASFHTT  R+I++NYV+ ++GKVY AD E LD
Subjt:  RNNISKSRNIR-LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLD

Query:  IIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFTWG
        ++G+GDV +   NG +W ++KVRHV  + +NL + G
Subjt:  IIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFTWG

A0A2P5CH01 Zinc finger, CCHC-type1.6e-6651.5Show/hide
Query:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG
        KM E  SVA HIN F+T+V++L +V++ F +E+ A++LL SLP SWEPM+AA+ NS GKEKL+F DV+D  L +E+RR D SG  ++S + LN+   DR 
Subjt:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG

Query:  RNNSRDYGK-HGKSRNNISKSRN-IRLECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE
           + + G+   KSRN  +KSR+  R+ECWNCGKT H+++NC+AP+K + +   AN IT+E+  AL L V+   ++WV++SGASFHTT  RD+LENY++ 
Subjt:  RNNSRDYGK-HGKSRNNISKSRN-IRLECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE

Query:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW
        NYGKVY ADGEPLDI+G+GD+ LK +NG +WKI+KVRHV  +M+NL              FT GSW
Subjt:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW

A0A2P5CPV0 Zinc finger, CCHC-type8.8e-6550.75Show/hide
Query:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG
        KM E  SVA HIN F+T+V++L +V++ F DE+ A++LL SLP SWE M+AA+ NS GK KL+F DV+D  L +E+RR D SG  ++S + LN+   DR 
Subjt:  KMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNV---DRG

Query:  RNNSRDYGK-HGKSRNNISKSRNIRL-ECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE
           + ++G+   KSRN   KSR+ R  ECWNCGKT H+++NC+AP+K + +   AN +T+E+  AL L V+   ++WV++SGASFHTT  R++LENYV+ 
Subjt:  RNNSRDYGK-HGKSRNNISKSRNIRL-ECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSE

Query:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW
        NYGKVY ADGEPLDI+G+GD+ LK +NG +WKI+KVRHV  +M+NL              FT GSW
Subjt:  NYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNL--------------FTWGSW

A0A6J1DF43 uncharacterized protein LOC1110204691.8e-7855.16Show/hide
Query:  LHSKELEFSLDKKPADMEEAKWKKLDR----------------------------------------------------KMDEDTSVAVHINGFDTLVNK
        LHSKELEF L+ KP DM E +WKKLDR                                                    KM E T +  H+N FD L+NK
Subjt:  LHSKELEFSLDKKPADMEEAKWKKLDR----------------------------------------------------KMDEDTSVAVHINGFDTLVNK

Query:  LVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVDRGRNNSRDYGKHGKSRNNISKSRNIR
        LVAVDL F+ E+ AILLLRSLPDSWEPM+AAI NSC KEKLKF DV+DAAL +EIRRKD SGIA TSG+ LNVDRGRNN+R YG  GKS+NN S+SRN R
Subjt:  LVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVDRGRNNSRDYGKHGKSRNNISKSRNIR

Query:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA
         ECWNCGK  HL+ NCKAPKK EG EA AN + E+IH AL + VE AH+ WV++SG                  N+GKVY ADGEPLDIIGIG+VNLK A
Subjt:  LECWNCGKTRHLRRNCKAPKKAEGKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKA

Query:  NGPIWKIRKV
        NG +WKIRK+
Subjt:  NGPIWKIRKV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-2533.2Show/hide
Query:  MDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRK---DYSGIASTSGTTLNVDRGR
        M E T+   H+N F+ L+ +L  + +   +E  AILLL SLP S++ +   IL+  GK  ++  DV  A L  E  RK   +      T G   +  R  
Subjt:  MDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLLRSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRK---DYSGIASTSGTTLNVDRGR

Query:  NNSRDYGKHGKSRNNISKSRNIRLECWNCGKTRHLRRNCKAPKKAEGKEAG-------ANIITEEIHVALFLIVE-------GAHNAWVVNSGASFHTTG
        NN    G  GKS+N  SKSR +R  C+NC +  H +R+C  P+K +G+ +G       A ++    +V LF+  E       G  + WVV++ AS H T 
Subjt:  NNSRDYGKHGKSRNNISKSRNIRLECWNCGKTRHLRRNCKAPKKAEGKEAG-------ANIITEEIHVALFLIVE-------GAHNAWVVNSGASFHTTG

Query:  QRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFT
         RD+   YV+ ++G V   +     I GIGD+ +K   G    ++ VRHV  +  NL +
Subjt:  QRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCTCTGTGAAAGTCAGACTGCACTTTTCGGTTGGCAAAATCAGCCACTACTTGAAGAACCGACGCTCCAATTTACCTTACTGTCGTGCTCGTGTACCT
TACTGCAGAGTGCTACATTCCAAGGAATTGGAATTTTCATTAGATAAGAAGCCAGCTGACATGGAAGAAGCCAAATGGAAAAAGTTAGATAGGAAGATGGATGAA
GATACATCCGTAGCTGTCCATATAAATGGATTTGATACGTTGGTTAACAAACTGGTTGCTGTCGATTTAACATTTACGGATGAATTAAATGCTATCTTATTGTTA
AGATCCTTACCTGACAGTTGGGAGCCTATGAAGGCAGCTATTTTAAATTCTTGTGGAAAAGAGAAATTGAAATTTGCAGATGTCAAAGATGCAGCTCTTGAAAAG
GAGATTCGCAGAAAGGATTATTCTGGTATTGCGTCTACTTCTGGTACAACATTGAATGTGGACAGAGGAAGAAATAATAGCAGAGACTACGGAAAACATGGAAAG
TCAAGAAACAACATAAGCAAGTCTAGAAACATCAGACTAGAATGTTGGAATTGTGGTAAGACAAGACATCTGAGGAGGAACTGTAAAGCCCCAAAGAAAGCTGAG
GGGAAAGAAGCTGGTGCAAATATTATTACTGAAGAAATACATGTTGCTCTATTTCTTATAGTTGAAGGCGCTCATAACGCATGGGTGGTGAATTCAGGTGCGTCT
TTTCACACTACAGGACAACGTGATATTCTTGAGAACTATGTTTCAGAAAATTATGGAAAGGTGTATTTTGCTGATGGAGAACCTTTGGACATCATTGGGATTGGT
GACGTTAATTTAAAAAAGGCAAACGGTCCAATCTGGAAGATTCGCAAGGTACGTCACGTTCAGAGTATGATGAAGAACTTGTTTACGTGGGGCAGCTGGATAATG
AAGGATGTCAAATATTCTTCGGTCAAGGAAACTGGAAAGTTACAAAGGGTTCCATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCTCTGTGAAAGTCAGACTGCACTTTTCGGTTGGCAAAATCAGCCACTACTTGAAGAACCGACGCTCCAATTTACCTTACTGTCGTGCTCGTGTACCT
TACTGCAGAGTGCTACATTCCAAGGAATTGGAATTTTCATTAGATAAGAAGCCAGCTGACATGGAAGAAGCCAAATGGAAAAAGTTAGATAGGAAGATGGATGAA
GATACATCCGTAGCTGTCCATATAAATGGATTTGATACGTTGGTTAACAAACTGGTTGCTGTCGATTTAACATTTACGGATGAATTAAATGCTATCTTATTGTTA
AGATCCTTACCTGACAGTTGGGAGCCTATGAAGGCAGCTATTTTAAATTCTTGTGGAAAAGAGAAATTGAAATTTGCAGATGTCAAAGATGCAGCTCTTGAAAAG
GAGATTCGCAGAAAGGATTATTCTGGTATTGCGTCTACTTCTGGTACAACATTGAATGTGGACAGAGGAAGAAATAATAGCAGAGACTACGGAAAACATGGAAAG
TCAAGAAACAACATAAGCAAGTCTAGAAACATCAGACTAGAATGTTGGAATTGTGGTAAGACAAGACATCTGAGGAGGAACTGTAAAGCCCCAAAGAAAGCTGAG
GGGAAAGAAGCTGGTGCAAATATTATTACTGAAGAAATACATGTTGCTCTATTTCTTATAGTTGAAGGCGCTCATAACGCATGGGTGGTGAATTCAGGTGCGTCT
TTTCACACTACAGGACAACGTGATATTCTTGAGAACTATGTTTCAGAAAATTATGGAAAGGTGTATTTTGCTGATGGAGAACCTTTGGACATCATTGGGATTGGT
GACGTTAATTTAAAAAAGGCAAACGGTCCAATCTGGAAGATTCGCAAGGTACGTCACGTTCAGAGTATGATGAAGAACTTGTTTACGTGGGGCAGCTGGATAATG
AAGGATGTCAAATATTCTTCGGTCAAGGAAACTGGAAAGTTACAAAGGGTTCCATGGTGA
Protein sequenceShow/hide protein sequence
MSRSVKVRLHFSVGKISHYLKNRRSNLPYCRARVPYCRVLHSKELEFSLDKKPADMEEAKWKKLDRKMDEDTSVAVHINGFDTLVNKLVAVDLTFTDELNAILLL
RSLPDSWEPMKAAILNSCGKEKLKFADVKDAALEKEIRRKDYSGIASTSGTTLNVDRGRNNSRDYGKHGKSRNNISKSRNIRLECWNCGKTRHLRRNCKAPKKAE
GKEAGANIITEEIHVALFLIVEGAHNAWVVNSGASFHTTGQRDILENYVSENYGKVYFADGEPLDIIGIGDVNLKKANGPIWKIRKVRHVQSMMKNLFTWGSWIM
KDVKYSSVKETGKLQRVPW