; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g33230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g33230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr6:25178013..25180578
RNA-Seq ExpressionMoc06g33230
SyntenyMoc06g33230
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145141.1 uncharacterized protein LOC111014656 [Momordica charantia]9.2e-10092.93Show/hide
Query:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA
        HARKLPVTALLDHIRGVLQRWFYE +TLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV DGNLN DVDLQ+QTCT R+F+YFKV CSHAIA 
Subjt:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA

Query:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        A+SRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP KKVVRVGRRQTVRIPSTG+VRPPRKCSRCGTSGHNRKTCREPLN V
Subjt:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]3.5e-9137.63Show/hide
Query:  EDNVVHETASNRLTGQLEADRLQVMVQSAGTNDVKEGDVFDSKKEL------------------------------------------------------
        ++  V +   + LTGQ   + LQ++VQS+GTNDVKEG+VFD+KKEL                                                      
Subjt:  EDNVVHETASNRLTGQLEADRLQVMVQSAGTNDVKEGDVFDSKKEL------------------------------------------------------

Query:  -------------NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS----------------------
                     +HRQAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDKAWRSSEEALRLIRGDPASS                      
Subjt:  -------------NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAI
          HARKLPVTALLDHIRG+LQ WFY+ +TLASSR +TLS YAE  +AE SDNARRH+V+NIDQF+ +V DGNL+G VD  S+TC CR+F+YFK+ CSHAI
Subjt:  --HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAI

Query:  AAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLN
        A A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+    V RVGRR+TVRIPSTG+VR  RKC RCGTSGHN KTC EPLN
Subjt:  AAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLN

XP_022154963.1 uncharacterized protein LOC111022107 [Momordica charantia]1.1e-9791.41Show/hide
Query:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA
        HARKLPVT LLDHIRGVLQRWFYE +TLASSRQSTLSDYAEEMI EASDNAR HIVMNID FNFEV DGNLNGDVDLQSQTCTCR+F+YFKV CSHAIAA
Subjt:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA

Query:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        ANSRSINPYTLCDEAYTVNSWML  AEPIFPVGSSSTWKSSPGFVNIDVQP KKVVRVGRRQTVRIP TG+VRPPRKCSRCGTSGHN KTCREPLN V
Subjt:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

XP_022158655.1 uncharacterized protein LOC111025117 [Momordica charantia]3.9e-9091.71Show/hide
Query:  LQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYT
        + RWFYE QTLASSRQSTLSDYAEEMIAEA+DN+RRHIVMNIDQFNFEV DGNLNGDVDLQSQTCTCR+F+YFKV CSHAIAAANSRSINPYTLCDEAYT
Subjt:  LQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYT

Query:  VNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        VNSWMLA+AEPIF VGSS+TWKSSPGFVNIDVQP KKVVRVGRRQTVRIPSTG+VRPPRKCSRCGTSGHNRKTCREPLN V
Subjt:  VNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]2.1e-8844.18Show/hide
Query:  NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS-----------------------------------
        +HRQ KSWVVGHLVQ KFTDVSRTYRPKDIIQD+R EYGVN+SYD+AWRSSEEALRLIRGDPASS                                   
Subjt:  NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEM
                                                                 H RKLPVTALLDHIRG LQ WFY+ +TLA+SR +TLSDYAE M
Subjt:  ---------------------------------------------------------HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEM

Query:  IAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
         AE SD+ARRH+V NIDQF+F+V DGNL+G VDL +  C+CR+F+YFK+ CSHAIAAA  R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP 
Subjt:  IAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

Query:  FVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        FVNI V+P K V RVGRR+TVRIPSTG+VR  RKC RCG  GHNRKTC EPL  +
Subjt:  FVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

TrEMBL top hitse value%identityAlignment
A0A6J1CVG8 uncharacterized protein LOC1110146564.5e-10092.93Show/hide
Query:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA
        HARKLPVTALLDHIRGVLQRWFYE +TLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV DGNLN DVDLQ+QTCT R+F+YFKV CSHAIA 
Subjt:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA

Query:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        A+SRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP KKVVRVGRRQTVRIPSTG+VRPPRKCSRCGTSGHNRKTCREPLN V
Subjt:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

A0A6J1DJT1 uncharacterized protein LOC1110207151.7e-9137.63Show/hide
Query:  EDNVVHETASNRLTGQLEADRLQVMVQSAGTNDVKEGDVFDSKKEL------------------------------------------------------
        ++  V +   + LTGQ   + LQ++VQS+GTNDVKEG+VFD+KKEL                                                      
Subjt:  EDNVVHETASNRLTGQLEADRLQVMVQSAGTNDVKEGDVFDSKKEL------------------------------------------------------

Query:  -------------NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS----------------------
                     +HRQAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDKAWRSSEEALRLIRGDPASS                      
Subjt:  -------------NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAI
          HARKLPVTALLDHIRG+LQ WFY+ +TLASSR +TLS YAE  +AE SDNARRH+V+NIDQF+ +V DGNL+G VD  S+TC CR+F+YFK+ CSHAI
Subjt:  --HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAI

Query:  AAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLN
        A A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+    V RVGRR+TVRIPSTG+VR  RKC RCGTSGHN KTC EPLN
Subjt:  AAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLN

A0A6J1DLQ2 uncharacterized protein LOC1110221075.5e-9891.41Show/hide
Query:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA
        HARKLPVT LLDHIRGVLQRWFYE +TLASSRQSTLSDYAEEMI EASDNAR HIVMNID FNFEV DGNLNGDVDLQSQTCTCR+F+YFKV CSHAIAA
Subjt:  HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAA

Query:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        ANSRSINPYTLCDEAYTVNSWML  AEPIFPVGSSSTWKSSPGFVNIDVQP KKVVRVGRRQTVRIP TG+VRPPRKCSRCGTSGHN KTCREPLN V
Subjt:  ANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

A0A6J1DWF8 uncharacterized protein LOC1110251171.9e-9091.71Show/hide
Query:  LQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYT
        + RWFYE QTLASSRQSTLSDYAEEMIAEA+DN+RRHIVMNIDQFNFEV DGNLNGDVDLQSQTCTCR+F+YFKV CSHAIAAANSRSINPYTLCDEAYT
Subjt:  LQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYT

Query:  VNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        VNSWMLA+AEPIF VGSS+TWKSSPGFVNIDVQP KKVVRVGRRQTVRIPSTG+VRPPRKCSRCGTSGHNRKTCREPLN V
Subjt:  VNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

A0A6J1DYC4 uncharacterized protein LOC1110256781.0e-8844.18Show/hide
Query:  NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS-----------------------------------
        +HRQ KSWVVGHLVQ KFTDVSRTYRPKDIIQD+R EYGVN+SYD+AWRSSEEALRLIRGDPASS                                   
Subjt:  NHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASS-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEM
                                                                 H RKLPVTALLDHIRG LQ WFY+ +TLA+SR +TLSDYAE M
Subjt:  ---------------------------------------------------------HARKLPVTALLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEM

Query:  IAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
         AE SD+ARRH+V NIDQF+F+V DGNL+G VDL +  C+CR+F+YFK+ CSHAIAAA  R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP 
Subjt:  IAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

Query:  FVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV
        FVNI V+P K V RVGRR+TVRIPSTG+VR  RKC RCG  GHNRKTC EPL  +
Subjt:  FVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.4e-0736.11Show/hide
Query:  NGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
        +G V L   TCTC +F   K  C HA+A  +   INP    D+ YTV  +   Y+    PV   S W  + G
Subjt:  NGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

AT1G64260.1 MuDR family transposase5.5e-1023.81Show/hide
Query:  LQVMVQSAGTNDVKEGDVFDS-KKELNHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSHARKLPVTA
        L+ +V+ AG+ + KE   FDS   ++  +  ++W        K+ D    ++   +  D    YG+ +  D+     E    + RG P  + A    V  
Subjt:  LQVMVQSAGTNDVKEGDVFDS-KKELNHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSHARKLPVTA

Query:  LLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGD--VDLQSQTCTCRKFNYFKVSCSHAIAAANSRSIN
        + D +R    +      + + +R    ++   + + E   ++  +++  +++ +F+V + +   +  V L   TCTCRKF  +K  C HA+A      IN
Subjt:  LLDHIRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGD--VDLQSQTCTCRKFNYFKVSCSHAIAAANSRSIN

Query:  PYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
        P    DE YTV  +   YA    PV   + W
Subjt:  PYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGATGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGA
ATTTCTAGGATATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATGGAAATAACTG
ACGACGATGACCTGACTTTCTTCTTCACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAATCGTACATGCCTTCT
TTCCCATATTATTTAGGCCAACACATGTCCAATGTTCCTATTCCCTCAGCTTATGCCCCCCCATTTGCAAGACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGT
TCCGTCCTCATCGTCGAACCCCTCTTCTTCTAACCGACCCCCCTACTTTGGTCATATTGGTCCTGATATAGCATCTCTCACACAGTTAGGGTCAGATGTTGTTCCTTATA
ATTTGGGAGATGATAGGGCATATGATTGGGATGTGCCTGGATTGTGGAATGGAAGTGAAAATGTGGATGAAGATAATGTCGTCCATGAAACGGCAAGCAATAGACTCACC
GGTCAGTTAGAAGCTGATAGATTGCAAGTCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAAATCATCGTCAGGC
GAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCAAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATA
TGAGTTACGACAAGGCCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCAT
ATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAATGTCAGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAA
TGCACGGAGACACATTGTGATGAACATCGACCAGTTTAATTTTGAGGTACTCGACGGGAACCTGAATGGGGACGTTGACTTGCAATCACAGACGTGTACTTGTCGGAAGT
TCAATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGCAGCCAATTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACAGTCAACAGTTGGATGTTG
GCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCATCGAAGAAGGTCGTTAGGGTTGGACG
GCGACAGACGGTGAGGATTCCTTCCACAGGCAAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACATCAGGACACAATCGTAAAACTTGTCGCGAACCACTAAATA
ATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGATGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGA
ATTTCTAGGATATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATGGAAATAACTG
ACGACGATGACCTGACTTTCTTCTTCACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAATCGTACATGCCTTCT
TTCCCATATTATTTAGGCCAACACATGTCCAATGTTCCTATTCCCTCAGCTTATGCCCCCCCATTTGCAAGACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGT
TCCGTCCTCATCGTCGAACCCCTCTTCTTCTAACCGACCCCCCTACTTTGGTCATATTGGTCCTGATATAGCATCTCTCACACAGTTAGGGTCAGATGTTGTTCCTTATA
ATTTGGGAGATGATAGGGCATATGATTGGGATGTGCCTGGATTGTGGAATGGAAGTGAAAATGTGGATGAAGATAATGTCGTCCATGAAACGGCAAGCAATAGACTCACC
GGTCAGTTAGAAGCTGATAGATTGCAAGTCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAAATCATCGTCAGGC
GAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCAAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATA
TGAGTTACGACAAGGCCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCAT
ATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAATGTCAGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAA
TGCACGGAGACACATTGTGATGAACATCGACCAGTTTAATTTTGAGGTACTCGACGGGAACCTGAATGGGGACGTTGACTTGCAATCACAGACGTGTACTTGTCGGAAGT
TCAATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGCAGCCAATTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACAGTCAACAGTTGGATGTTG
GCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCATCGAAGAAGGTCGTTAGGGTTGGACG
GCGACAGACGGTGAGGATTCCTTCCACAGGCAAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACATCAGGACACAATCGTAAAACTTGTCGCGAACCACTAAATA
ATGTGTAG
Protein sequenceShow/hide protein sequence
MPRVFISFGGEWKDIEKDYVDGRTRGLTVDSKITYAEFLGYVCRLSSINPLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFFTGEDVSELPLYISTVPKKVHQNESYMPS
FPYYLGQHMSNVPIPSAYAPPFARPLFPRPSFSSPSVPSSSSNPSSSNRPPYFGHIGPDIASLTQLGSDVVPYNLGDDRAYDWDVPGLWNGSENVDEDNVVHETASNRLT
GQLEADRLQVMVQSAGTNDVKEGDVFDSKKELNHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSHARKLPVTALLDH
IRGVLQRWFYECQTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVLDGNLNGDVDLQSQTCTCRKFNYFKVSCSHAIAAANSRSINPYTLCDEAYTVNSWML
AYAEPIFPVGSSSTWKSSPGFVNIDVQPSKKVVRVGRRQTVRIPSTGKVRPPRKCSRCGTSGHNRKTCREPLNNV