; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:13619691..13620518
RNA-Seq ExpressionMoc02g18210
SyntenyMoc02g18210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]1.1e-8055.36Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  LM AL+ MYEKP  NNKV+L TK FNLKMA+   +  HLNEF+ + N+L +V+++F DE+ A+++L SLP+SWE MR A+SNS GKEKLK+ D+RD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNR-SYRNRGKSKN---NRSRSRN-SMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV
          LAEEIRR+D+G  S  GS LN++ RGR NNR S + R  S+N   NRS+SR+    +CWNCGKTGH KR CK+PKK   + +   V E++ DAL+LAV
Subjt:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNR-SYRNRGKSKN---NRSRSRN-SMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV

Query:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        +S  D WV+DSG SFHTT  R+I++NY+ G+ GKVYLADG  LD++G+GDV + + NGS+W + KVRH+ ++ +NLIS+G
Subjt:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]8.8e-8155.59Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT GLM  L++MYEKP  NNKV+L  K F+LKM +G P+  H+NEF+ ++N+L +V++EF DEV A++L+ SLP+SWEPMRAA+SNS G +KLKF DVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNRSYRNRGKSKN----NRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNK-AGAN-VAEQIHDALVLA
          L EE+RR D+G  ST  S  NV+ RGR+ NR  +NRG+SK+     +S+SR  + ECWNCGKTGH K NC AP K E NK  GAN V ++I DAL+++
Subjt:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNRSYRNRGKSKN----NRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNK-AGAN-VAEQIHDALVLA

Query:  VES-----AHDT------------------WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNL
        V+S      HDT                  WV+DSG SFHTT  R+I+ENY+VGN+GKVYLA+G PLDI+GIGD+NLKM++G +WKI KVRHV  +M+NL
Subjt:  VES-----AHDT------------------WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNL

Query:  ISMG
        IS+G
Subjt:  ISMG

PON60333.1 Zinc finger, CCHC-type [Parasponia andersonii]5.0e-8456.07Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  +M+ L++MYEKP  NNKV+L  K F LKM +G  +  H+NEF+ ++++L +V++ F +EV A++LL SLP SWEPMRAA+SNS GKEKL+F DVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNV---DRG--RNNNRSYRNRGKSKNNRSRSRNS-MFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV
          LAEE+RR DSG  ++  S LN+   DR   +N+NR  R++ KS+N R++SR+    ECWNCGKTGH+K+NC+AP+K++       + +++ DAL+L+V
Subjt:  AALAEEIRRKDSGIASTFGSVLNV---DRG--RNNNRSYRNRGKSKNNRSRSRNS-MFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV

Query:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        ++  D+WV+DSG SFHTT  RD+LENYI GN+GKVYLADGEPLDI+G+GD+ LKM+NGS+WKI+KVRHV  +M+NLIS+G
Subjt:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

PON63051.1 Zinc finger, CCHC-type [Trema orientale]6.7e-8154.48Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  +M+ L+++YEK   NNKVYL  K F LKM +G  +  H+NEF+ ++++L +V++ F DEV A++LL SLP SWE MRAA+SNS GK KL+F DVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNV---DRGRNNNRSY-RNRGKSKNNRSRSRNSMF-ECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVE
          LAEE+RR DSG  ++  S LN+   DR    N ++ R R KS+N R +SR+    ECWNCGKTGH+K+NC+AP+K++       V +++ DAL+L+V+
Subjt:  AALAEEIRRKDSGIASTFGSVLNV---DRGRNNNRSY-RNRGKSKNNRSRSRNSMF-ECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVE

Query:  SAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        +  D+WV+DSG SFHTT  R++LENY+ GN+GKVYLADGEPLDI+G+GD+ LKM+NGS+WKI+KVRHV  +M+NL+S+G
Subjt:  SAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]3.2e-11585.38Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TTMGLMNALANMYEK  VNNKVYLATKFFNLKMA+ TPITAHLNEFD LINKLVAVDLEFS EVYAILLLRSLPDSWEPMRAAISNSC KEKLKFEDVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVESAHDT
        AALAEEIRRKDSGIA T GSVLNVDRGRNNNR Y NRGKSKNNRSRSRNS FECWNCGK GHLK NCKAPKKNEGN+A ANVAEQIHDALV+AVESAHDT
Subjt:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVESAHDT

Query:  WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKV
        WVMDS                  GNHGKVYLADGEPLDIIGIG+VNLKMANGS+WKIRK+
Subjt:  WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKV

TrEMBL top hitse value%identityAlignment
A0A0D3CS45 Uncharacterized protein3.2e-8154.97Show/hide
Query:  TMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDA
        T GLM  L++MYEKP  NNKV+L  K F+LKM +G  + AH+NEF+ ++N+L +V++EF DEV A++LL SLP+SWEPMRAA+SNS G +KLKF DVRD 
Subjt:  TMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDA

Query:  ALAEEIRRKDSGIASTFGSVLNVD-RGRNNNRSYRNRGKSKNNRSRSRNSM---FECWNCGKTGHLKRNCKAPKKNEGN-KAGAN-VAEQIHDALVLAVE
         LAEE+RR DSG AST  S  NV+ RGRN +R+ R+ G+SK+   R ++      ECWNCGKTGH+K+NC+AP K E N + GAN V  +I DALV++V+
Subjt:  ALAEEIRRKDSGIASTFGSVLNVD-RGRNNNRSYRNRGKSKNNRSRSRNSM---FECWNCGKTGHLKRNCKAPKKNEGN-KAGAN-VAEQIHDALVLAVE

Query:  SAH-----------------------DTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLIS
        S                         D+WV+DSG SFHTT   +I+ENY+ GN+GKVYLADG PLDI+GIGD+NLKM++G +WKI KVRHV  +M+NLIS
Subjt:  SAH-----------------------DTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLIS

Query:  MG
        +G
Subjt:  MG

A0A2P5CH01 Zinc finger, CCHC-type2.4e-8456.07Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  +M+ L++MYEKP  NNKV+L  K F LKM +G  +  H+NEF+ ++++L +V++ F +EV A++LL SLP SWEPMRAA+SNS GKEKL+F DVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNV---DRG--RNNNRSYRNRGKSKNNRSRSRNS-MFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV
          LAEE+RR DSG  ++  S LN+   DR   +N+NR  R++ KS+N R++SR+    ECWNCGKTGH+K+NC+AP+K++       + +++ DAL+L+V
Subjt:  AALAEEIRRKDSGIASTFGSVLNV---DRG--RNNNRSYRNRGKSKNNRSRSRNS-MFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV

Query:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        ++  D+WV+DSG SFHTT  RD+LENYI GN+GKVYLADGEPLDI+G+GD+ LKM+NGS+WKI+KVRHV  +M+NLIS+G
Subjt:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

A0A2P5CPV0 Zinc finger, CCHC-type3.2e-8154.48Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  +M+ L+++YEK   NNKVYL  K F LKM +G  +  H+NEF+ ++++L +V++ F DEV A++LL SLP SWE MRAA+SNS GK KL+F DVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNV---DRGRNNNRSY-RNRGKSKNNRSRSRNSMF-ECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVE
          LAEE+RR DSG  ++  S LN+   DR    N ++ R R KS+N R +SR+    ECWNCGKTGH+K+NC+AP+K++       V +++ DAL+L+V+
Subjt:  AALAEEIRRKDSGIASTFGSVLNV---DRGRNNNRSY-RNRGKSKNNRSRSRNSMF-ECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVE

Query:  SAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        +  D+WV+DSG SFHTT  R++LENY+ GN+GKVYLADGEPLDI+G+GD+ LKM+NGS+WKI+KVRHV  +M+NL+S+G
Subjt:  SAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

A0A6J1DF43 uncharacterized protein LOC1110204691.5e-11585.38Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TTMGLMNALANMYEK  VNNKVYLATKFFNLKMA+ TPITAHLNEFD LINKLVAVDLEFS EVYAILLLRSLPDSWEPMRAAISNSC KEKLKFEDVRD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVESAHDT
        AALAEEIRRKDSGIA T GSVLNVDRGRNNNR Y NRGKSKNNRSRSRNS FECWNCGK GHLK NCKAPKKNEGN+A ANVAEQIHDALV+AVESAHDT
Subjt:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVESAHDT

Query:  WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKV
        WVMDS                  GNHGKVYLADGEPLDIIGIG+VNLKMANGS+WKIRK+
Subjt:  WVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKV

A0A7N2N811 Uncharacterized protein2.5e-8155Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        TT  LM AL+ MYEKP VNNKV+L  K FNLKMA+   +  HLNEF+ + N+L +V+++F DE++A+++L SLP+SWE MR A+SNS GKEKLK+ D+RD
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNR-SYRNRGKSKN---NRSRSRN-SMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV
          L EEIRR+D+G  S  GS LN++ RGR NNR S R R KS+N   NRS+SR+    +CWNCGKTGH +  CK+PKK   + +   V E++ DAL+LAV
Subjt:  AALAEEIRRKDSGIASTFGSVLNVD-RGRNNNR-SYRNRGKSKN---NRSRSRN-SMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAV

Query:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG
        +S  D WV+DSGTSFHTT  R+I++NY++G+ GKVYLADG  LD++G+GDV + + NGS+W + K+RH+ ++ +NLIS+G
Subjt:  ESAHDTWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-0625.33Show/hide
Query:  MTTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVR
        +T   ++  L  +YE+  + +++ L  +  +LK++    + +H + FD LI++L+A   +  +      LL +LP  ++ +  AI  +  +E L    V+
Subjt:  MTTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVR

Query:  DAALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSR---SRNSMF--ECWNCGKTGHLKRNCKAPKKNEGNKAGAN---VAEQIHDALV
        +  L +EI+ K+     T   V+N     NNN    N  K++  + +     NS +  +C +CG+ GH+K++C   K+   NK   N   V       + 
Subjt:  DAALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSR---SRNSMF--ECWNCGKTGHLKRNCKAPKKNEGNKAGAN---VAEQIHDALV

Query:  LAVESAHDTWVMDSGTSFHTTGQRDILEN
          V+  ++T VMD+      +G  D L N
Subjt:  LAVESAHDTWVMDSGTSFHTTGQRDILEN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-3233.11Show/hide
Query:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD
        T  G+   L ++Y    + NK+YL  + + L M++GT   +HLN F+ LI +L  + ++  +E  AILLL SLP S++ +   I +  GK  ++ +DV  
Subjt:  TTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRD

Query:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRN------RGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGA----NVAEQI--HD
        A L  E  RK     +   +++   RGR+  RS  N      RGKSKN   RS++ +  C+NC + GH KR+C  P+K +G  +G     N A  +  +D
Subjt:  AALAEEIRRKDSGIASTFGSVLNVDRGRNNNRSYRN------RGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGA----NVAEQI--HD

Query:  ALVLAVESAHD---------TWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLIS
         +VL +    +          WV+D+  S H T  RD+   Y+ G+ G V + +     I GIGD+ +K   G    ++ VRHV ++  NLIS
Subjt:  ALVLAVESAHD---------TWVMDSGTSFHTTGQRDILENYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLIS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACAATGGGGTTGATGAATGCCCTGGCCAACATGTATGAAAAACCTTTGGTAAATAATAAGGTGTATCTTGCAACTAAATTTTTTAATTTGAAAATGGCTAAAGG
TACACCTATTACTGCCCATTTAAATGAATTTGACGCATTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAAGTTTATGCTATTTTGTTATTAAGATCTT
TGCCTGATAGTTGGGAACCCATGCGAGCTGCTATTTCGAATTCTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGAAGAAATTCGCAGG
AAGGACTCTGGTATCGCTTCTACTTTTGGTTCAGTATTGAATGTGGACAGAGGAAGAAATAATAATAGAAGTTATAGGAATCGAGGCAAGTCGAAAAACAACAGAAGCAG
GTCGAGAAACAGCATGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGCAAGGCCCCAAAGAAAAATGAAGGGAACAAAGCCGGTGCTAATGTTG
CTGAGCAGATACATGATGCTTTGGTTCTTGCAGTTGAGAGCGCTCATGACACATGGGTGATGGATTCAGGTACGTCTTTTCATACTACAGGACAACGTGACATTCTTGAA
AATTATATTGTAGGAAATCATGGAAAGGTCTATCTTGCCGATGGAGAGCCTTTAGATATCATTGGTATCGGTGACGTGAATTTAAAAATGGCAAATGGTTCAATTTGGAA
AATTCGCAAGGTACGTCACGTTCAGAATATGATGAAAAACTTGATTTCTATGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCACAATGGGGTTGATGAATGCCCTGGCCAACATGTATGAAAAACCTTTGGTAAATAATAAGGTGTATCTTGCAACTAAATTTTTTAATTTGAAAATGGCTAAAGG
TACACCTATTACTGCCCATTTAAATGAATTTGACGCATTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAAGTTTATGCTATTTTGTTATTAAGATCTT
TGCCTGATAGTTGGGAACCCATGCGAGCTGCTATTTCGAATTCTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGAAGAAATTCGCAGG
AAGGACTCTGGTATCGCTTCTACTTTTGGTTCAGTATTGAATGTGGACAGAGGAAGAAATAATAATAGAAGTTATAGGAATCGAGGCAAGTCGAAAAACAACAGAAGCAG
GTCGAGAAACAGCATGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGCAAGGCCCCAAAGAAAAATGAAGGGAACAAAGCCGGTGCTAATGTTG
CTGAGCAGATACATGATGCTTTGGTTCTTGCAGTTGAGAGCGCTCATGACACATGGGTGATGGATTCAGGTACGTCTTTTCATACTACAGGACAACGTGACATTCTTGAA
AATTATATTGTAGGAAATCATGGAAAGGTCTATCTTGCCGATGGAGAGCCTTTAGATATCATTGGTATCGGTGACGTGAATTTAAAAATGGCAAATGGTTCAATTTGGAA
AATTCGCAAGGTACGTCACGTTCAGAATATGATGAAAAACTTGATTTCTATGGGGTAG
Protein sequenceShow/hide protein sequence
MTTMGLMNALANMYEKPLVNNKVYLATKFFNLKMAKGTPITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALAEEIRR
KDSGIASTFGSVLNVDRGRNNNRSYRNRGKSKNNRSRSRNSMFECWNCGKTGHLKRNCKAPKKNEGNKAGANVAEQIHDALVLAVESAHDTWVMDSGTSFHTTGQRDILE
NYIVGNHGKVYLADGEPLDIIGIGDVNLKMANGSIWKIRKVRHVQNMMKNLISMG