; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g11690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g11690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:9943457..9957671
RNA-Seq ExpressionMoc09g11690
SyntenyMoc09g11690
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]4.5e-6746.04Show/hide
Query:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEF
        +I DYL+ K+L  PL  KP+ M + +W  L+R+VLG IRLTL+KNV  +VAKE TT GLM  LS++YEKP  NNKV L  K F+LKM+EG  VA H+NEF
Subjt:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEF

Query:  DTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRG--KSRNNI
        +T++N+L +V++ F DE+  ++L+ SLP+SWEPM+AA+SNS G +KLKF DVRD  LGEE+RR D+G  STS      +RGR+ N+ +  RG  KSRN  
Subjt:  DTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRG--KSRNNI

Query:  SKSRNSK-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEVHDALVL-------------------------------------------------
         +S++ K +ECWNCGKTGH + NC A PKK   K  GAN V +E+ DAL++                                                 
Subjt:  SKSRNSK-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEVHDALVL-------------------------------------------------

Query:  ---------VVYLADGEPLDINEIGDVNLKMANGSVWNIRK
                  VYLA+G PLDI  IGD+NLKM++G VW I K
Subjt:  ---------VVYLADGEPLDINEIGDVNLKMANGSVWNIRK

PON60333.1 Zinc finger, CCHC-type [Parasponia andersonii]3.3e-7049.06Show/hide
Query:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYLH+K+L  PL  KKP+ ME+  W+ L+R+VLG IRLTLTKNV  +VA+  TT  +MS LS++YEKP  NNKV L  K F LKM EG  VA HINE
Subjt:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR
        F+T++++L +V++ F +E+  ++LL SLP SWEPM+AA+SNS GKEKL+F DVRD  L EE+RR DSG  ++S + L++   DR    N + G+ + KSR
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR

Query:  NNISKSRNS-KLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI
        N  +KSR+  ++ECWNCGKTGH+++NC+AP+K + +   AN + +EV DAL+L                                    VYLADGEPLDI
Subjt:  NNISKSRNS-KLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI

Query:  NEIGDVNLKMANGSVWNIRK
          +GD+ LKM+NGSVW I+K
Subjt:  NEIGDVNLKMANGSVWNIRK

PON63051.1 Zinc finger, CCHC-type [Trema orientale]1.2e-6749.38Show/hide
Query:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYLHSK+L  PL  KKP+ ME+  W+  +R+VLG IRLTLTKNV  +VA+  TT  +MS LS+IYEK   NNKV L  K F LKM EG  VA HINE
Subjt:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR
        F+T++++L +V++ F DE+  ++LL SLP SWE M+AA+SNS GK KL+F DVRD  L EE+RR DSG  ++S + L++   DR    N + G+ R KSR
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR

Query:  NNISKSRNSKL-ECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI
        N   KSR+ +  ECWNCGKTGH+++NC+AP+K + +   AN V +EV DAL+L                                    VYLADGEPLDI
Subjt:  NNISKSRNSKL-ECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI

Query:  NEIGDVNLKMANGSVWNIRK
          +GD+ LKM+NGSVW I+K
Subjt:  NEIGDVNLKMANGSVWNIRK

RVX12493.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.7e-6750Show/hide
Query:  RIVDYLHSKELELP-LDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYL+ ++L LP L  KP+ ++  +W  L+R+VLG IRLTL+++V  +V KE TT  LM ALS +YEKP  NNKV L  K FNLKM E   VA H+NE
Subjt:  RIVDYLHSKELELP-LDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVD---RGRNNNKDHGKRGKSRN
        F+T+ N+L +V++ F DE+  +++L SLP+SWE M+ A+SNS GKEKLK+ D+RD  L EEIRR+D+G  S SG+ L+++   RG N N + G R  SRN
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVD---RGRNNNKDHGKRGKSRN

Query:  ---NISKSRN-SKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-------VYLADGEPLDINEIGDVNLKMANGSVWNIRKNTQYIS
           N SKSR+  +++CWNCGKTGH +R CK+PKK + ++  A+ V EEV DAL+L        VYLADG  LD+  +GDV + + NGSVW + K  ++I 
Subjt:  ---NISKSRN-SKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-------VYLADGEPLDINEIGDVNLKMANGSVWNIRKNTQYIS

Query:  PPEVET
         PE E+
Subjt:  PPEVET

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]9.0e-10873.22Show/hide
Query:  IVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEFD
        ++DYLHSKELE PL+ KPDDM E +WKKL+RKVLGTIRLTLTKNVQSSVAK  TTMGLM+AL+N+YEK  VNNKV LATKFFNLKM E T + AH+NEFD
Subjt:  IVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEFD

Query:  TLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRGKSRNNISKS
         LINKLVAVDL F+ E+  ILLLRSLPDSWEPM+AAISNSC KEKLKF DVRDAAL EEIRRKDSGIA TSG+ L+VDRGRNNN+ +G RGKS+NN S+S
Subjt:  TLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRGKSRNNISKS

Query:  RNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-----------------VYLADGEPLDINEIGDVNLKMANGSVWNIRK
        RNS+ ECWNCGK GHL+ NCKAPKK EG EA AN VAE++HDALV+                  VYLADGEPLDI  IG+VNLKMANGSVW IRK
Subjt:  RNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-----------------VYLADGEPLDINEIGDVNLKMANGSVWNIRK

TrEMBL top hitse value%identityAlignment
A0A0D3CS45 Uncharacterized protein8.0e-7047.37Show/hide
Query:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEF
        +I DYL+ K+L  PL KKP+ M++ +W+ L+R+VLG IRLTL+KNV  +VAKE  T GLM  LS++YEKP  NNKV L  K F+LKM+EG  VAAH+NEF
Subjt:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEF

Query:  DTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRN---NNKDHGKRGKSRNN
        +T++N+L +V++ F DE+  ++LL SLP+SWEPM+AA+SNS G +KLKF DVRD  L EE+RR DSG ASTS      +RGRN   NN+ +G R KSRN 
Subjt:  DTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRN---NNKDHGKRGKSRNN

Query:  ISKSR-NSKLECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEVHDALVL------------------------------------------------
          +S+     ECWNCGKTGH+++NC+A PKK +    GAN V  E+ DALV+                                                
Subjt:  ISKSR-NSKLECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEVHDALVL------------------------------------------------

Query:  ----------VVYLADGEPLDINEIGDVNLKMANGSVWNIRK
                   VYLADG PLDI  IGD+NLKM++G VW I K
Subjt:  ----------VVYLADGEPLDINEIGDVNLKMANGSVWNIRK

A0A2N9IKI1 Uncharacterized protein9.8e-6848.43Show/hide
Query:  RIVDYLHSKELELP-LDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYL+ K+L LP L +KP+DME+A+W  L+R+VLG IRLTL++ V  +V KE TT  LM+AL  +YEKP  NNKV L  K FNLKM EGT VA H+NE
Subjt:  RIVDYLHSKELELP-LDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVD-RGRNNNKDHGK-RGKSRNN
        F+T+ N+L +V++ F DE+  +++L SLP+SWE M+ A+SNS GK KLK+ D+RD  LGEE+RR+D+G  S+SG+ L+++ RGR  ++++ + R KSR  
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVD-RGRNNNKDHGK-RGKSRNN

Query:  ISKSR-NSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDINE
         SKS+   +LECWNCGKTGH+R+NC   KK + +   ANVV EEVHDAL+L                                    VYLAD E LD+  
Subjt:  ISKSR-NSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDINE

Query:  IGDVNLKMANGSVWNIRK
        +GDV + + NGSVW ++K
Subjt:  IGDVNLKMANGSVWNIRK

A0A2P5CH01 Zinc finger, CCHC-type1.6e-7049.06Show/hide
Query:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYLH+K+L  PL  KKP+ ME+  W+ L+R+VLG IRLTLTKNV  +VA+  TT  +MS LS++YEKP  NNKV L  K F LKM EG  VA HINE
Subjt:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR
        F+T++++L +V++ F +E+  ++LL SLP SWEPM+AA+SNS GKEKL+F DVRD  L EE+RR DSG  ++S + L++   DR    N + G+ + KSR
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR

Query:  NNISKSRNS-KLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI
        N  +KSR+  ++ECWNCGKTGH+++NC+AP+K + +   AN + +EV DAL+L                                    VYLADGEPLDI
Subjt:  NNISKSRNS-KLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI

Query:  NEIGDVNLKMANGSVWNIRK
          +GD+ LKM+NGSVW I+K
Subjt:  NEIGDVNLKMANGSVWNIRK

A0A2P5CPV0 Zinc finger, CCHC-type5.8e-6849.38Show/hide
Query:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE
        +I DYLHSK+L  PL  KKP+ ME+  W+  +R+VLG IRLTLTKNV  +VA+  TT  +MS LS+IYEK   NNKV L  K F LKM EG  VA HINE
Subjt:  RIVDYLHSKELELPLD-KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINE

Query:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR
        F+T++++L +V++ F DE+  ++LL SLP SWE M+AA+SNS GK KL+F DVRD  L EE+RR DSG  ++S + L++   DR    N + G+ R KSR
Subjt:  FDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHV---DRGRNNNKDHGK-RGKSR

Query:  NNISKSRNSKL-ECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI
        N   KSR+ +  ECWNCGKTGH+++NC+AP+K + +   AN V +EV DAL+L                                    VYLADGEPLDI
Subjt:  NNISKSRNSKL-ECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVL-----------------------------------VVYLADGEPLDI

Query:  NEIGDVNLKMANGSVWNIRK
          +GD+ LKM+NGSVW I+K
Subjt:  NEIGDVNLKMANGSVWNIRK

A0A6J1DF43 uncharacterized protein LOC1110204694.4e-10873.22Show/hide
Query:  IVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEFD
        ++DYLHSKELE PL+ KPDDM E +WKKL+RKVLGTIRLTLTKNVQSSVAK  TTMGLM+AL+N+YEK  VNNKV LATKFFNLKM E T + AH+NEFD
Subjt:  IVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEFD

Query:  TLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRGKSRNNISKS
         LINKLVAVDL F+ E+  ILLLRSLPDSWEPM+AAISNSC KEKLKF DVRDAAL EEIRRKDSGIA TSG+ L+VDRGRNNN+ +G RGKS+NN S+S
Subjt:  TLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRGKSRNNISKS

Query:  RNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-----------------VYLADGEPLDINEIGDVNLKMANGSVWNIRK
        RNS+ ECWNCGK GHL+ NCKAPKK EG EA AN VAE++HDALV+                  VYLADGEPLDI  IG+VNLKMANGSVW IRK
Subjt:  RNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLV-----------------VYLADGEPLDINEIGDVNLKMANGSVWNIRK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-2332.09Show/hide
Query:  RIVDYLHSKELELPLD---KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHI
        R+ D L  + L   LD   KKPD M+   W  L+ +    IRL L+ +V +++  E T  G+ + L ++Y    + NK+ L  + + L M EGT   +H+
Subjt:  RIVDYLHSKELELPLD---KKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHI

Query:  NEFDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDA-ALGEEIRRKDSGIAS---TSGTTLHVDRGRNNNKDHGKRGK
        N F+ LI +L  + +   +E   ILLL SLP S++ +   I +  GK  ++  DV  A  L E++R+K         T G      R  NN    G RGK
Subjt:  NEFDTLINKLVAVDLTFTDELNVILLLRSLPDSWEPMKAAISNSCGKEKLKFADVRDA-ALGEEIRRKDSGIAS---TSGTTLHVDRGRNNNKDHGKRGK

Query:  SRNNISKSRNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLVVYLADGEPLDINE
        S+N   +S++    C+NC + GH +R+C  P+K +G+ +G     ++  D    +V   D   L INE
Subjt:  SRNNISKSRNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEVHDALVLVVYLADGEPLDINE

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein2.0e-1250Show/hide
Query:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKV
        +I DYL+ K+L  PL KK + M +  W  L R+VL  IRLT++KN+  +VAKE +  GLM  LS+IY+KP  NN V
Subjt:  RIVDYLHSKELELPLDKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGGTGATGATAGACGATGGCACAACCAATGATGGCAGCAATGGTGTGTGGACGGCGGATGGTGACAGTGACAAACGACGGTGGAGGCACAGTTCCGCTATGGT
TGTTGCCTTCTTCCTCGCCTCCTTAGCTTCTTTATATTCAGTTGAGGTCCACATTCTCGAGTTTGATCGTCGGCTTCCTGCAATCCTTCGTCACTACTTCCGAATGCTCT
TCGCCGTCGCTTCTTCCTTCCTTGATCTCTCTTTTTCCTCTCTAACATTCGGCAAGGATGAGCTAAGCAGTGAGCTCCAGATGCCGTACAGAGGACCTGCAAAACAGAGT
AAGAACTTCGATGCTAAAGTGACTCTTCCGACAACCGTCCCGACAGTGCCTCAACGACTCTTTGAGGTTGTCGTGAATCACGACTTGCTGTCGTGGAATTTGTCGGAGTC
TGAGGCTGAGTCGAGATCAGTGCTCTCAGCATCCTCACTAACCCAACTTTGCAAGCACGCGAGAGTCTTGACTAAATTTGGAGCCAATGTTAAGCACATAGAATCAATCG
CTTTCCCACTAATAGTAAATGCAGGCTTTAAAGCAATAGTGGACACTGAGATTGCTAATACATCACGGATAGTAGATTATCTACATTCCAAGGAATTGGAATTGCCATTA
GACAAGAAGCCGGATGACATGGAAGAAGCCAAATGGAAAAAGTTGAACAGGAAGGTGTTGGGTACGATTCGCCTAACATTAACAAAAAATGTGCAGAGCAGCGTAGCTAA
GGAGATTACCACAATGGGGCTGATGAGTGCACTGTCCAACATATATGAGAAGCCCTTAGTAAATAATAAGGTGGATCTTGCAACGAAATTTTTTAATTTGAAGATGGATG
AAGGTACATTTGTGGCTGCCCATATAAATGAATTTGATACGTTGATTAACAAACTGGTTGCTGTAGATTTAACATTTACGGATGAATTAAATGTTATCTTGTTGTTGAGA
TCTTTACCTGACAGTTGGGAGCCTATGAAGGCAGCTATTTCAAATTCTTGTGGAAAAGAGAAATTGAAATTCGCAGATGTAAGAGATGCAGCTCTTGGAGAGGAGATTCG
CAGAAAGGATTCTGGTATTGCGTCTACTTCTGGTACAACATTGCATGTGGATAGAGGAAGAAATAATAACAAAGACCACGGAAAACGTGGAAAGTCAAGAAACAACATAA
GTAAGTCTAGAAACAGCAAACTAGAATGTTGGAATTGTGGTAAGACAGGACATCTGAGGAGGAACTGCAAAGCTCCAAAGAAAGCTGAGGGGAAAGAAGCTGGTGCAAAT
GTTGTTGCCGAAGAAGTACATGATGCTCTAGTTCTTGTAGTGTATCTTGCTGATGGAGAACCTTTGGACATCAATGAGATTGGTGACGTTAATTTAAAAATGGCGAACGG
TTCAGTCTGGAATATTCGCAAGAATACTCAATATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAGATTGAAGATCAGAATACAATTACTCCTGAAGAAACAGCTG
TGGGATCTGATGAACAAGTTGAGGAATTTGATGAACCAGTTGTGGAAACTGATCAGTTGGACTTCTCCACGACTAGAGAGGAGTCGCTACCGTTGCAGAGGAGACATACT
GCTTGCTTTATCTTCAAGTGGGAGATTGTTGGGATTATGGAGCCAAACCAAGGAGAAAATCTTTTTCATATAAGGAAAAGAACTCTTTTGCAATTTGGATTGCTCGATAA
AACTATCTCCCTTTCTCACAATATCTGCAAGAAATTTCCTGTTTCAAAAGACAATTCCCACAATACTGGTTCTCATCCAGAGGATAGTGAGGAAGACCTTGTGGTGGTGT
CGATTGGTCGTCGTCTGGACCTTCGTAGCGTCGAGACGCTGGTCCAACACGCGCGTGCACGTCGAGATGCTGTCGCTGGCTTCTCAAAGCTGCCCCGGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGGTGATGATAGACGATGGCACAACCAATGATGGCAGCAATGGTGTGTGGACGGCGGATGGTGACAGTGACAAACGACGGTGGAGGCACAGTTCCGCTATGGT
TGTTGCCTTCTTCCTCGCCTCCTTAGCTTCTTTATATTCAGTTGAGGTCCACATTCTCGAGTTTGATCGTCGGCTTCCTGCAATCCTTCGTCACTACTTCCGAATGCTCT
TCGCCGTCGCTTCTTCCTTCCTTGATCTCTCTTTTTCCTCTCTAACATTCGGCAAGGATGAGCTAAGCAGTGAGCTCCAGATGCCGTACAGAGGACCTGCAAAACAGAGT
AAGAACTTCGATGCTAAAGTGACTCTTCCGACAACCGTCCCGACAGTGCCTCAACGACTCTTTGAGGTTGTCGTGAATCACGACTTGCTGTCGTGGAATTTGTCGGAGTC
TGAGGCTGAGTCGAGATCAGTGCTCTCAGCATCCTCACTAACCCAACTTTGCAAGCACGCGAGAGTCTTGACTAAATTTGGAGCCAATGTTAAGCACATAGAATCAATCG
CTTTCCCACTAATAGTAAATGCAGGCTTTAAAGCAATAGTGGACACTGAGATTGCTAATACATCACGGATAGTAGATTATCTACATTCCAAGGAATTGGAATTGCCATTA
GACAAGAAGCCGGATGACATGGAAGAAGCCAAATGGAAAAAGTTGAACAGGAAGGTGTTGGGTACGATTCGCCTAACATTAACAAAAAATGTGCAGAGCAGCGTAGCTAA
GGAGATTACCACAATGGGGCTGATGAGTGCACTGTCCAACATATATGAGAAGCCCTTAGTAAATAATAAGGTGGATCTTGCAACGAAATTTTTTAATTTGAAGATGGATG
AAGGTACATTTGTGGCTGCCCATATAAATGAATTTGATACGTTGATTAACAAACTGGTTGCTGTAGATTTAACATTTACGGATGAATTAAATGTTATCTTGTTGTTGAGA
TCTTTACCTGACAGTTGGGAGCCTATGAAGGCAGCTATTTCAAATTCTTGTGGAAAAGAGAAATTGAAATTCGCAGATGTAAGAGATGCAGCTCTTGGAGAGGAGATTCG
CAGAAAGGATTCTGGTATTGCGTCTACTTCTGGTACAACATTGCATGTGGATAGAGGAAGAAATAATAACAAAGACCACGGAAAACGTGGAAAGTCAAGAAACAACATAA
GTAAGTCTAGAAACAGCAAACTAGAATGTTGGAATTGTGGTAAGACAGGACATCTGAGGAGGAACTGCAAAGCTCCAAAGAAAGCTGAGGGGAAAGAAGCTGGTGCAAAT
GTTGTTGCCGAAGAAGTACATGATGCTCTAGTTCTTGTAGTGTATCTTGCTGATGGAGAACCTTTGGACATCAATGAGATTGGTGACGTTAATTTAAAAATGGCGAACGG
TTCAGTCTGGAATATTCGCAAGAATACTCAATATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAGATTGAAGATCAGAATACAATTACTCCTGAAGAAACAGCTG
TGGGATCTGATGAACAAGTTGAGGAATTTGATGAACCAGTTGTGGAAACTGATCAGTTGGACTTCTCCACGACTAGAGAGGAGTCGCTACCGTTGCAGAGGAGACATACT
GCTTGCTTTATCTTCAAGTGGGAGATTGTTGGGATTATGGAGCCAAACCAAGGAGAAAATCTTTTTCATATAAGGAAAAGAACTCTTTTGCAATTTGGATTGCTCGATAA
AACTATCTCCCTTTCTCACAATATCTGCAAGAAATTTCCTGTTTCAAAAGACAATTCCCACAATACTGGTTCTCATCCAGAGGATAGTGAGGAAGACCTTGTGGTGGTGT
CGATTGGTCGTCGTCTGGACCTTCGTAGCGTCGAGACGCTGGTCCAACACGCGCGTGCACGTCGAGATGCTGTCGCTGGCTTCTCAAAGCTGCCCCGGTGGTGA
Protein sequenceShow/hide protein sequence
MVVVMIDDGTTNDGSNGVWTADGDSDKRRWRHSSAMVVAFFLASLASLYSVEVHILEFDRRLPAILRHYFRMLFAVASSFLDLSFSSLTFGKDELSSELQMPYRGPAKQS
KNFDAKVTLPTTVPTVPQRLFEVVVNHDLLSWNLSESEAESRSVLSASSLTQLCKHARVLTKFGANVKHIESIAFPLIVNAGFKAIVDTEIANTSRIVDYLHSKELELPL
DKKPDDMEEAKWKKLNRKVLGTIRLTLTKNVQSSVAKEITTMGLMSALSNIYEKPLVNNKVDLATKFFNLKMDEGTFVAAHINEFDTLINKLVAVDLTFTDELNVILLLR
SLPDSWEPMKAAISNSCGKEKLKFADVRDAALGEEIRRKDSGIASTSGTTLHVDRGRNNNKDHGKRGKSRNNISKSRNSKLECWNCGKTGHLRRNCKAPKKAEGKEAGAN
VVAEEVHDALVLVVYLADGEPLDINEIGDVNLKMANGSVWNIRKNTQYISPPEVETKTTEIEDQNTITPEETAVGSDEQVEEFDEPVVETDQLDFSTTREESLPLQRRHT
ACFIFKWEIVGIMEPNQGENLFHIRKRTLLQFGLLDKTISLSHNICKKFPVSKDNSHNTGSHPEDSEEDLVVVSIGRRLDLRSVETLVQHARARRDAVAGFSKLPRW