; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:16697305..16703582
RNA-Seq ExpressionMoc04g23050
SyntenyMoc04g23050
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW24095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.3e-4734.46Show/hide
Query:  NKSRN-SRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGI
        +KSR+  +++CWNCGK GH +   ++PKK K ++   N V EE+HDAL LAV+   + WV+DSGASFHTT   +I+ NYV  + GKVYLADG  LDI+G+
Subjt:  NKSRN-SRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGI

Query:  GDVNLKMANGLIWKIRKKVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKGKTEL----------------------------
        GD+ + + NG +W + +KVTK + V+A G+K  T Y+    +D IAV D S+ T LWH RL HMSEKG   L                            
Subjt:  GDVNLKMANGLIWKIRKKVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKGKTEL----------------------------

Query:  ----DGSTEKTQDI------------------SPPEVETKTTKIED--------------QNIITPEETVVGFDEQ------------------------
             G T K + +                  S    E+K  + ++                 I  E+T+ G  +Q                        
Subjt:  ----DGSTEKTQDI------------------SPPEVETKTTKIED--------------QNIITPEETVVGFDEQ------------------------

Query:  ----------------------------------VVESDEPVVETDQVQRTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWE
                                            E D+  V +     TP   VRR S+ IRPPQRY   LNY+LLTD G+P  Y+EA+Q E+S KWE
Subjt:  ----------------------------------VVESDEPVVETDQVQRTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWE

Query:  LAMKDEMNYLMINQT
        LAMKDEM+ L+ NQT
Subjt:  LAMKDEMNYLMINQT

RVW34552.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.4e-5233.88Show/hide
Query:  VSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNY
        + + + ++RG    ++ N  +     + +++CWNCGKIGH R   ++PKK K ++   NVV EE+HDAL LAV+     WV+DSGASFHTT   +I+ N+
Subjt:  VSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNY

Query:  VVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVV
         + + GKVYLADG  LD++G+GDV + + NG +W + K                               KVTKG+ V+A G+K AT Y+    +D IAV 
Subjt:  VVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVV

Query:  DHSSQTQLWHSRLAHMSEKGKTEL--DGSTEKTQDISPPEVET---------------KTTKIEDQNII-------TPEETVVGFDEQVVESDEPVVETD
        D S+ T LWH RL HMSEKG   L   G   + + I     E+               +T K E   ++       +P  ++ G    +   D+   +  
Subjt:  DHSSQTQLWHSRLAHMSEKGKTEL--DGSTEKTQDISPPEVET---------------KTTKIEDQNII-------TPEETVVGFDEQVVESDEPVVETD

Query:  QVQR-----------------------------------------------------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYY
          +R                                                           TP   VRR SR IRPPQRY   LNY+LLTD GEP  Y
Subjt:  QVQR-----------------------------------------------------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYY

Query:  EEAVQLEDSVKWELAMKDEMNYLMINQT
        +EA+Q E+S KWELAMKDEM+ L+ NQT
Subjt:  EEAVQLEDSVKWELAMKDEMNYLMINQT

RVW43552.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.4e-4635.43Show/hide
Query:  LFVSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILV
        +   +++R D G+         L  + R +          GH +   ++PKK K ++   N V +E+HDAL LAV+   + WV+DSGASFHTT   +I+ 
Subjt:  LFVSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILV

Query:  NYVVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIA
        NYV  + GKV LADG  LD++G+GDV + + NG +W + K                               KVTKG+ V+A G+K  T Y+    +D IA
Subjt:  NYVVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIA

Query:  VVDHSSQTQLWHSRLAHMSEKG-----------------------------KTELDGSTEKTQDISPPEVETKTTKIEDQNI--------ITPEETVVGF
        VVD S+ T LWH RL HMSEKG                             K E+ G+   T  +   +VE    +  D           I  E+T+ G 
Subjt:  VVDHSSQTQLWHSRLAHMSEKG-----------------------------KTELDGSTEKTQDISPPEVETKTTKIEDQNI--------ITPEETVVGF

Query:  DEQVV-------------ESDEPVVETDQVQ-RTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT
                             E V    +V   TP   VRR SR IRPPQRY   LNY+LLTD GEP  Y+EA+Q E+S KWELAMKDEM+ L+ NQT
Subjt:  DEQVV-------------ESDEPVVETDQVQ-RTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT

RVX19178.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.4e-4535.21Show/hide
Query:  KSRN-SRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG
        KSR+  +++CWNCGKIGH R   + PKK K ++   N   EE+HDAL LAV+   + WV+DSG  ++TT   +I+ NY V + GKVYLADG  LD++G+G
Subjt:  KSRN-SRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG

Query:  DVNLKMANGLIWKIRKKVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKGKTEL-------DGSTEKTQDIS-----------
        DV + + NG +W + +KVTKG+ V+A G+K  T Y+    +D IAV D ++ T LWH RL HMSEK    L       +  T K + +            
Subjt:  DVNLKMANGLIWKIRKKVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKGKTEL-------DGSTEKTQDIS-----------

Query:  --PPEVETKT-------------------------------------------------------TKIEDQNIITPEE-----------TVVGFDEQVVE
          P EV+  T                                                       T + +Q    P E            V+  D   + 
Subjt:  --PPEVETKT-------------------------------------------------------TKIEDQNIITPEE-----------TVVGFDEQVVE

Query:  SDEPVVETDQ---------------VQR----------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDE
        SD  V+E DQ               VQ+                TP   VRR SR IRPP+ Y   LNY+LL D GEP  Y+EA+Q E+S KWELAMKDE
Subjt:  SDEPVVETDQ---------------VQR----------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDE

Query:  MNYLMINQT
        M+ L+ NQT
Subjt:  MNYLMINQT

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]1.9e-5262.09Show/hide
Query:  NKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG
        ++SRNSR ECWNCGKIGHL+ N +APKK +G EA  N VAE+IHDAL +AVE A++TWV+DSG                  NHGKVYLADGEPLDIIGIG
Subjt:  NKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG

Query:  DVNLKMANGLIWKIRK---------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKG
        +VNLKMANG +WKIRK               KVTKG+MV+A G K  T YVN+NDKDM+AVVDHSS TQLWH+ L HMSEKG
Subjt:  DVNLKMANGLIWKIRK---------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKG

TrEMBL top hitse value%identityAlignment
A0A2N9ESU1 Uncharacterized protein8.7e-4831.84Show/hide
Query:  DVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVV
        +V+R D G+       S L  ++R  +LECWNCGK GH+R N    KK K +    NVV EE+HDAL L+V+    +WV+DSGASFHTT   +I+ NYV 
Subjt:  DVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVV

Query:  ENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDH
         + GKVYLAD E LD++G+GDV + + NG +W ++K                               K+TKG+MVVA G+K  T Y+  + +D IAV + 
Subjt:  ENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDH

Query:  SSQTQLWHSRLAHMSEKG--------------KTELD------------GS----------TEKTQDISP------------------------------
         + T LWH RL HMSEKG                E D            GS           EKT   +P                              
Subjt:  SSQTQLWHSRLAHMSEKG--------------KTELD------------GS----------TEKTQDISP------------------------------

Query:  ------------------------PE-------------------------------VETKTTKI---------------EDQNIITPEETVVGFDEQVV
                                PE                               ++ K+ K                +DQN        V F+EQV+
Subjt:  ------------------------PE-------------------------------VETKTTKI---------------EDQNIITPEETVVGFDEQVV

Query:  ESDEPVVETDQV------------------------------------QRTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWE
          D    + D V                                    Q TP  TVRR SR IRPPQR+  +L YILLTD GEP  Y+EA+Q+EDS+KWE
Subjt:  ESDEPVVETDQV------------------------------------QRTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWE

Query:  LAMKDEMNYLMINQT
        LAMKDEMN LM +QT
Subjt:  LAMKDEMNYLMINQT

A0A2N9EY29 Uncharacterized protein2.7e-4933.85Show/hide
Query:  KRADRGDDVEELRNSGLYNKSRNSR-LECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVE
        +  DR  + +  ++    +KS+  R LECWNCGK GH+R N    KK K +    NVV EE+HDAL L+V+    +WV+DSGASFHT    +I+ NYV  
Subjt:  KRADRGDDVEELRNSGLYNKSRNSR-LECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVE

Query:  NHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHS
        +  KVYLAD + LD++G+GDV + + NG IW ++K                               K+TKG+MVVA G+K +T Y+  + +D IAV +  
Subjt:  NHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHS

Query:  SQTQLWHSRLAHMSEKG--------------KTELD------------------GSTEKTQDI------------------SPPEVETKTTKIEDQNIIT
        + T LWH RL HMSEKG                E D                  G T K++++                  S   ++    +    N I 
Subjt:  SQTQLWHSRLAHMSEKG--------------KTELD------------------GSTEKTQDI------------------SPPEVETKTTKIEDQNIIT

Query:  PEETVV------GFDEQV------------------------------------------------------VESDEPVVETDQVQRTPPPTVRRCSRTI
         E+T+       G DE++                                                       E+ +P V+    Q TP  TVRR SR I
Subjt:  PEETVV------GFDEQV------------------------------------------------------VESDEPVVETDQVQRTPPPTVRRCSRTI

Query:  RPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT
        RPPQR+  +L YILLTD GEP  ++EA+Q+EDS+KWELAMKDEMN LM NQT
Subjt:  RPPQRYFLTLNYILLTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT

A0A2N9IKQ5 Uncharacterized protein4.8e-5436.07Show/hide
Query:  RGDDVEELRNSGLYNKSRNS-----RLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVE
        RG D    R      K R+      +LECWNCGK GH+R N    KK K +    NVV EE+HDAL L+V+    +WV+DSGASFHTT   +I+ NYV  
Subjt:  RGDDVEELRNSGLYNKSRNS-----RLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVE

Query:  NHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHS
        + GKVYLAD E LD++G+GDV + + NG +W ++K                               K+TKG+MVVA G+K  T Y+  + +D IAV +  
Subjt:  NHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHS

Query:  SQTQLWHSRLAHMSEKGKTEL----------DGS-----------TEKTQDISP--------------------------PEV-----------------
        + T LWH RL HMSEKG   L          +G             EKT   +P                          PE                  
Subjt:  SQTQLWHSRLAHMSEKGKTEL----------DGS-----------TEKTQDISP--------------------------PEV-----------------

Query:  ---------------------ETKTTKIEDQNIITPEETVVGFDE-----------QVVESDEPVVETDQVQRTPPPTVRRCSRTIRPPQRYFLTLNYIL
                             +  +TK++D  +   +   V  DE           +  E+ +P VE    Q TP  TVRR SR IRPPQR+  +L YIL
Subjt:  ---------------------ETKTTKIEDQNIITPEETVVGFDE-----------QVVESDEPVVETDQVQRTPPPTVRRCSRTIRPPQRYFLTLNYIL

Query:  LTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT
        LTD GEP  Y+EA+Q+EDS+KWELAMKDEMN LM +QT
Subjt:  LTDRGEPGYYEEAVQLEDSVKWELAMKDEMNYLMINQT

A0A438DGG6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-5233.88Show/hide
Query:  VSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNY
        + + + ++RG    ++ N  +     + +++CWNCGKIGH R   ++PKK K ++   NVV EE+HDAL LAV+     WV+DSGASFHTT   +I+ N+
Subjt:  VSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNY

Query:  VVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVV
         + + GKVYLADG  LD++G+GDV + + NG +W + K                               KVTKG+ V+A G+K AT Y+    +D IAV 
Subjt:  VVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRK-------------------------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVV

Query:  DHSSQTQLWHSRLAHMSEKGKTEL--DGSTEKTQDISPPEVET---------------KTTKIEDQNII-------TPEETVVGFDEQVVESDEPVVETD
        D S+ T LWH RL HMSEKG   L   G   + + I     E+               +T K E   ++       +P  ++ G    +   D+   +  
Subjt:  DHSSQTQLWHSRLAHMSEKGKTEL--DGSTEKTQDISPPEVET---------------KTTKIEDQNII-------TPEETVVGFDEQVVESDEPVVETD

Query:  QVQR-----------------------------------------------------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYY
          +R                                                           TP   VRR SR IRPPQRY   LNY+LLTD GEP  Y
Subjt:  QVQR-----------------------------------------------------------TPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPGYY

Query:  EEAVQLEDSVKWELAMKDEMNYLMINQT
        +EA+Q E+S KWELAMKDEM+ L+ NQT
Subjt:  EEAVQLEDSVKWELAMKDEMNYLMINQT

A0A6J1DF43 uncharacterized protein LOC1110204699.0e-5362.09Show/hide
Query:  NKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG
        ++SRNSR ECWNCGKIGHL+ N +APKK +G EA  N VAE+IHDAL +AVE A++TWV+DSG                  NHGKVYLADGEPLDIIGIG
Subjt:  NKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVEGAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIG

Query:  DVNLKMANGLIWKIRK---------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKG
        +VNLKMANG +WKIRK               KVTKG+MV+A G K  T YVN+NDKDM+AVVDHSS TQLWH+ L HMSEKG
Subjt:  DVNLKMANGLIWKIRK---------------KVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWHSRLAHMSEKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCTTCGACAGTGCCTCGTAGACCAGTAGTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCGGTGGAAGATGTCTCGGCAGTGTCTTGTGGAGGAATC
AACAAGTGGAGGAAGCTACCTGAGAAAGGGAAGACGAGATTAGAGCCCGATACCCTGAATTATTCGAACAACGAACTTTCGAGGACGAAAGTTTATAAAGGGGGA
AGTCTTGTTCGTATTTTCTTCTATCGGCACCTCAACCATCCCCGAAGCGCTCAACGAACGGCAACCTCCGACGTAGCGGCTTACCTTCTCCGTTTGACAGCAGCA
GACGGCGCGACAGGCAGTGGCAGCGTGCGACTCCGAGCAGCAGCGGTGCGCGGCGTCGGGCAGCAACGGGTGCACGGTTTCCAGCAGCAGCAGGCATTCGCGAGC
GTCGGGCAGCAGCGGTGCGCGGGTGTCGGGCAGCATCGGACGCGCGGATTCCGGCAGCAGCAGCCGTGCGCGGGCGTCGGGCAAAGGCAGGCGGCCGCGGTGCAC
GTGGGTGCTATCTTCGGGGTGTTTTCCGGCTGCGCAGCCCTTCCCTCGCGAGCAGGTCCGGTCTCCCTTGGTGGTCGAAAGTCGTGGCGCGGGTCCTTGCTGCAC
ACAGTGTCGTGGGTATCGATTTGGGTGAAAGACGTGCTTATGGGTTGTCTTGGAAGGATTAGTAGCAATTGTCAGTTCAAGGCCTTGGGGATAAATGGCAAGGCC
GAACGTCAAGCTTCTGTAGAGGAGTATTGGTATCTTGGGGATAAATGCATTGGGGATAAATGGCAAGGTCGAGCACCAAGCGTGGTAGAGGAAGATCGGTATCTT
GGGGATAAATGTCAAGCGCCGATACGCCTCGAAGAGTTAGACCTGGAGTACCAACATAGGGGGGGTAGAGCAAGTGCTAGCTCAAGTGGGGCTGGGGCTAGTGAG
TCGTGTGGTTGGAGTCTCGGAAATCCTGACGTTATGCCGCTTGGAGGGCTACTTACGGAGTATGTTATGTACTCACCCCCCTCTACCTTATTTGTTTCAGATGTG
AAGAGGGCCGACCGTGGTGATGACGTGGAGGAGCTTAGAAATTCAGGTCTTTACAACAAGTCTAGAAACAGTAGACTAGAATGTTGGAATTGTGGTAAGATAGGA
CATCTGAGAATGAACTCCGAAGCCCCAAAGAAAGCTAAGGGGAAAGAAGCTGGTACAAATGTTGTTGCAGAAGAAATACATGACGCTCTAACTCTCGCAGTTGAA
GGCGCTTATAACACATGGGTGGTGGATTCAGGTGCGTCTTTTCACACTACAGGACAATGTGACATTCTTGTGAATTATGTTGTAGAAAATCATGGAAAGGTGTAT
CTTGCTGATGGAGAGCCTTTGGACATCATTGGGATTGGTGACGTTAATTTAAAAATGGCGAACGGTTTAATCTGGAAGATTCGCAAGAAAGTTACAAAGGGTTCC
ATGGTGGTCGCTGGAGGAAGAAAGTTAGCAACTTTTTATGTCAACGACAACGATAAAGATATGATAGCTGTTGTAGATCATTCGAGTCAGACCCAATTATGGCAC
AGTAGGCTGGCACATATGAGTGAAAAAGGAAAGACAGAGTTAGATGGAAGCACAGAGAAGACTCAGGATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAAAG
ATTGAAGATCAGAATATAATTACTCCTGAAGAAACAGTTGTGGGATTTGATGAACAGGTTGTGGAATCTGATGAACCAGTTGTGGAAACTGATCAGGTTCAAAGA
ACACCACCACCAACAGTTAGAAGATGTAGTAGAACAATCAGACCACCACAGAGGTATTTCCTTACATTGAATTATATTTTACTAACTGACAGAGGTGAACCTGGA
TATTATGAAGAGGCGGTTCAACTGGAAGACTCTGTTAAGTGGGAGTTAGCCATGAAAGATGAAATGAATTATCTAATGATCAATCAGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACCTTCGACAGTGCCTCGTAGACCAGTAGTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCGGTGGAAGATGTCTCGGCAGTGTCTTGTGGAGGAATC
AACAAGTGGAGGAAGCTACCTGAGAAAGGGAAGACGAGATTAGAGCCCGATACCCTGAATTATTCGAACAACGAACTTTCGAGGACGAAAGTTTATAAAGGGGGA
AGTCTTGTTCGTATTTTCTTCTATCGGCACCTCAACCATCCCCGAAGCGCTCAACGAACGGCAACCTCCGACGTAGCGGCTTACCTTCTCCGTTTGACAGCAGCA
GACGGCGCGACAGGCAGTGGCAGCGTGCGACTCCGAGCAGCAGCGGTGCGCGGCGTCGGGCAGCAACGGGTGCACGGTTTCCAGCAGCAGCAGGCATTCGCGAGC
GTCGGGCAGCAGCGGTGCGCGGGTGTCGGGCAGCATCGGACGCGCGGATTCCGGCAGCAGCAGCCGTGCGCGGGCGTCGGGCAAAGGCAGGCGGCCGCGGTGCAC
GTGGGTGCTATCTTCGGGGTGTTTTCCGGCTGCGCAGCCCTTCCCTCGCGAGCAGGTCCGGTCTCCCTTGGTGGTCGAAAGTCGTGGCGCGGGTCCTTGCTGCAC
ACAGTGTCGTGGGTATCGATTTGGGTGAAAGACGTGCTTATGGGTTGTCTTGGAAGGATTAGTAGCAATTGTCAGTTCAAGGCCTTGGGGATAAATGGCAAGGCC
GAACGTCAAGCTTCTGTAGAGGAGTATTGGTATCTTGGGGATAAATGCATTGGGGATAAATGGCAAGGTCGAGCACCAAGCGTGGTAGAGGAAGATCGGTATCTT
GGGGATAAATGTCAAGCGCCGATACGCCTCGAAGAGTTAGACCTGGAGTACCAACATAGGGGGGGTAGAGCAAGTGCTAGCTCAAGTGGGGCTGGGGCTAGTGAG
TCGTGTGGTTGGAGTCTCGGAAATCCTGACGTTATGCCGCTTGGAGGGCTACTTACGGAGTATGTTATGTACTCACCCCCCTCTACCTTATTTGTTTCAGATGTG
AAGAGGGCCGACCGTGGTGATGACGTGGAGGAGCTTAGAAATTCAGGTCTTTACAACAAGTCTAGAAACAGTAGACTAGAATGTTGGAATTGTGGTAAGATAGGA
CATCTGAGAATGAACTCCGAAGCCCCAAAGAAAGCTAAGGGGAAAGAAGCTGGTACAAATGTTGTTGCAGAAGAAATACATGACGCTCTAACTCTCGCAGTTGAA
GGCGCTTATAACACATGGGTGGTGGATTCAGGTGCGTCTTTTCACACTACAGGACAATGTGACATTCTTGTGAATTATGTTGTAGAAAATCATGGAAAGGTGTAT
CTTGCTGATGGAGAGCCTTTGGACATCATTGGGATTGGTGACGTTAATTTAAAAATGGCGAACGGTTTAATCTGGAAGATTCGCAAGAAAGTTACAAAGGGTTCC
ATGGTGGTCGCTGGAGGAAGAAAGTTAGCAACTTTTTATGTCAACGACAACGATAAAGATATGATAGCTGTTGTAGATCATTCGAGTCAGACCCAATTATGGCAC
AGTAGGCTGGCACATATGAGTGAAAAAGGAAAGACAGAGTTAGATGGAAGCACAGAGAAGACTCAGGATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAAAG
ATTGAAGATCAGAATATAATTACTCCTGAAGAAACAGTTGTGGGATTTGATGAACAGGTTGTGGAATCTGATGAACCAGTTGTGGAAACTGATCAGGTTCAAAGA
ACACCACCACCAACAGTTAGAAGATGTAGTAGAACAATCAGACCACCACAGAGGTATTTCCTTACATTGAATTATATTTTACTAACTGACAGAGGTGAACCTGGA
TATTATGAAGAGGCGGTTCAACTGGAAGACTCTGTTAAGTGGGAGTTAGCCATGAAAGATGAAATGAATTATCTAATGATCAATCAGACATGA
Protein sequenceShow/hide protein sequence
MAPSTVPRRPVVDELDRSEVELAVEDVSAVSCGGINKWRKLPEKGKTRLEPDTLNYSNNELSRTKVYKGGSLVRIFFYRHLNHPRSAQRTATSDVAAYLLRLTAA
DGATGSGSVRLRAAAVRGVGQQRVHGFQQQQAFASVGQQRCAGVGQHRTRGFRQQQPCAGVGQRQAAAVHVGAIFGVFSGCAALPSRAGPVSLGGRKSWRGSLLH
TVSWVSIWVKDVLMGCLGRISSNCQFKALGINGKAERQASVEEYWYLGDKCIGDKWQGRAPSVVEEDRYLGDKCQAPIRLEELDLEYQHRGGRASASSSGAGASE
SCGWSLGNPDVMPLGGLLTEYVMYSPPSTLFVSDVKRADRGDDVEELRNSGLYNKSRNSRLECWNCGKIGHLRMNSEAPKKAKGKEAGTNVVAEEIHDALTLAVE
GAYNTWVVDSGASFHTTGQCDILVNYVVENHGKVYLADGEPLDIIGIGDVNLKMANGLIWKIRKKVTKGSMVVAGGRKLATFYVNDNDKDMIAVVDHSSQTQLWH
SRLAHMSEKGKTELDGSTEKTQDISPPEVETKTTKIEDQNIITPEETVVGFDEQVVESDEPVVETDQVQRTPPPTVRRCSRTIRPPQRYFLTLNYILLTDRGEPG
YYEEAVQLEDSVKWELAMKDEMNYLMINQT