; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g13460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g13460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:10222916..10224165
RNA-Seq ExpressionMoc08g13460
SyntenyMoc08g13460
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY61101.1 hypothetical protein CUMW_207140, partial [Citrus unshiu]5.2e-2324.58Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESN-----------------------CRRLWWSGVNLV
        +E ++   FDK +A++I +I +    + D  IWH+   G ++V+SGY+   +     +PS SE +                        + L  S  NL 
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESN-----------------------CRRLWWSGVNLV

Query:  DRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARALSNE------------------------------------
         R +V    C LC    E++ H+   CK A +VW+ S F      A G  +++ L   + + +                                     
Subjt:  DRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARALSNE------------------------------------

Query:  ------------SANSLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELD
                    SA+ +    WNPP EG  KIN DAA +   +  GLG V+RD + Q+  +A    +F  SV  A+A A +WG+++ +D  ++  ++E D
Subjt:  ------------SANSLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELD

Query:  SYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGLSK
        S  V  L  + +   S++ W    ++    S    S  +  R+ N +   LA+L L K
Subjt:  SYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGLSK

XP_018805736.2 uncharacterized protein LOC108979499 [Juglans regia]1.6e-2427.42Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSES-NCRRLW----------------WSGV--------N
        N  +V   F K+EA++I++I +    + D  IW   S G F+V+S Y + Q  I +    PS S N + +W                W           N
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSES-NCRRLW----------------WSGV--------N

Query:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKA----------SRFRFL---FPPASGL-------FVM---------------------NTLT-
        L  R +V + +C +C    E + H+ W C+ A  VW            S   FL      A G+       FVM                     N +T 
Subjt:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKA----------SRFRFL---FPPASGL-------FVM---------------------NTLT-

Query:  ----NARAL------SNESANSLLSDR--WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL
            N R +       NES+ S+ S+   W PPP   YK N+DAA D   + VG+G+VVRDC  +++     +   +    LA+A  A   V    D+GL
Subjt:  ----NARAL------SNESANSLLSDR--WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL

Query:  RPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILAR
           + E DS +  Q  + EE+ LS  G +    K+  ++  S + + +KR+GN +  +LA+
Subjt:  RPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILAR

XP_022149385.1 uncharacterized protein LOC111017816 [Momordica charantia]3.5e-3540Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS
        NEGI+R AFD+ EA+LILRI VGVD  +DMRIWHYDSKGLF+++S Y VDQS++L R+ S S S+ R                           EDLLH+
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS

Query:  FWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNAR-------------------ALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASS---------
         W CK+  RVWKASR R+  P      +M  L + +                   +L N+   + +S R  P   G +   +   F  A++         
Subjt:  FWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNAR-------------------ALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASS---------

Query:  -SVGLGVVV-----RDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL
         S     ++     RDC  Q+MLSAATR+  MA+VDLA+ASAAKWG++L RDMGL
Subjt:  -SVGLGVVV-----RDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.1e-3230.75Show/hide
Query:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILN----RLPSPSESNCRRLWW-------------------------SGV
        +VR  F  DEA  IL I +G   + D  IW+Y+  G++SVRSGY+V    +LN    + PS S S   R WW                         +G 
Subjt:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILN----RLPSPSESNCRRLWW-------------------------SGV

Query:  NLVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFP------------------PASGLFVMNTLTNARALSN----------------
        NL  RGV  +N C  CGR GED +H FW CK A  +W  S+F  L P                      ++ +    NARA ++                
Subjt:  NLVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFP------------------PASGLFVMNTLTNARALSN----------------

Query:  --------ESANSLLSDR--------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPC
                E+ ++ ++ R        W PP EG YKIN DA+F  +    GLG+++ +   Q+M +A   +E + SVD+A+A AA  G++L  ++G+ P 
Subjt:  --------ESANSLLSDR--------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPC

Query:  VLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL
        +                E LS+ G    + K   T    +SF+FVKR GNK   +LAR  L
Subjt:  VLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL

XP_024046732.1 uncharacterized protein LOC112101057 [Citrus clementina]1.4e-2326.39Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS
        N  ++   F K+ A +I  I +    KND R+WHYD KG ++V+SGY+V  +  L  + +P+ S          +L  R ++    C +C    ED  H+
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS

Query:  FWRCKIAARVWKAS-------------------------------------------RFRFLFPPASGLFVMNTLTNARA-------------LSNESAN
        F  CK+A +VWK S                                           R + +F     +  M+ +  A+A              S  S +
Subjt:  FWRCKIAARVWKAS-------------------------------------------RFRFLFPPASGLFVMNTLTNARA-------------LSNESAN

Query:  SLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVL
            D+W PPP G YK+N DAA        GLGVV+R+   Q++++A    +F  +V  A+A A KWG+E+  +  L   ++E D   V  L  S+    
Subjt:  SLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVL

Query:  SDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL
         ++ W    +        S S   V R  N     LA+  L
Subjt:  SDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL

TrEMBL top hitse value%identityAlignment
A0A2I4DF11 uncharacterized protein LOC1089794997.9e-2527.42Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSES-NCRRLW----------------WSGV--------N
        N  +V   F K+EA++I++I +    + D  IW   S G F+V+S Y + Q  I +    PS S N + +W                W           N
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSES-NCRRLW----------------WSGV--------N

Query:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKA----------SRFRFL---FPPASGL-------FVM---------------------NTLT-
        L  R +V + +C +C    E + H+ W C+ A  VW            S   FL      A G+       FVM                     N +T 
Subjt:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKA----------SRFRFL---FPPASGL-------FVM---------------------NTLT-

Query:  ----NARAL------SNESANSLLSDR--WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL
            N R +       NES+ S+ S+   W PPP   YK N+DAA D   + VG+G+VVRDC  +++     +   +    LA+A  A   V    D+GL
Subjt:  ----NARAL------SNESANSLLSDR--WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL

Query:  RPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILAR
           + E DS +  Q  + EE+ LS  G +    K+  ++  S + + +KR+GN +  +LA+
Subjt:  RPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILAR

A0A2N9GIC4 Reverse transcriptase domain-containing protein7.9e-2527.36Show/hide
Query:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYR-----VDQSQILNRLPSPSESNCRRLW------------WSG--------VNLVD
        ++ + F   +A  I +I +      D  IW  ++ G ++VRSGYR      D+S   + +P+P +   + +W            W          +NL  
Subjt:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYR-----VDQSQILNRLPSPSESNCRRLW------------WSG--------VNLVD

Query:  RGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGV
        R +     C +CG   ED LH+ W CK    VW                ++ ++      SN +    L+ RW P  +  YK+NYD A    +++ G+GV
Subjt:  RGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGV

Query:  VVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVR
        +VRD    +M S   ++ F  S+   +A A K  ++ V ++GL     E DS  +       +  L+  G      KV A+ + S SFS VKR GN++  
Subjt:  VVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVR

Query:  ILARLGL
         LAR  L
Subjt:  ILARLGL

A0A2N9HEC7 Reverse transcriptase domain-containing protein1.8e-2427.41Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRV-----DQSQILNRLPSPSESNCRRLW------------WSG--------VN
        +  ++ S F   +A  I +I +      D  IW  ++ G ++VRSGYR      D++   + +P+P +   + +W            W          +N
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRV-----DQSQILNRLPSPSESNCRRLW------------WSG--------VN

Query:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARAL----------SNESANSLLSDRWNPPPEGCYKINYDA
        L  R +     C +CG   ED +H+ W CK    VW   R   L        + +    A++           SN      L+ RW P  +  YK+NYD 
Subjt:  LVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNARAL----------SNESANSLLSDRWNPPPEGCYKINYDA

Query:  AFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSS
        A    +++ G+GV+VRD    +M S + ++ F  SV   +A A K  ++ V ++GL     E DS  +       +  L+  G      KV A+ + S S
Subjt:  AFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSS

Query:  FSFVKRAGNKVVRILARLGLS
        FS VKR GN++   LAR  LS
Subjt:  FSFVKRAGNKVVRILARLGLS

A0A6J1D5K1 uncharacterized protein LOC1110178161.7e-3540Show/hide
Query:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS
        NEGI+R AFD+ EA+LILRI VGVD  +DMRIWHYDSKGLF+++S Y VDQS++L R+ S S S+ R                           EDLLH+
Subjt:  NEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHS

Query:  FWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNAR-------------------ALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASS---------
         W CK+  RVWKASR R+  P      +M  L + +                   +L N+   + +S R  P   G +   +   F  A++         
Subjt:  FWRCKIAARVWKASRFRFLFPPASGLFVMNTLTNAR-------------------ALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASS---------

Query:  -SVGLGVVV-----RDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL
         S     ++     RDC  Q+MLSAATR+  MA+VDLA+ASAAKWG++L RDMGL
Subjt:  -SVGLGVVV-----RDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGL

A0A6J1DAR4 uncharacterized protein LOC1110189541.0e-3230.75Show/hide
Query:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILN----RLPSPSESNCRRLWW-------------------------SGV
        +VR  F  DEA  IL I +G   + D  IW+Y+  G++SVRSGY+V    +LN    + PS S S   R WW                         +G 
Subjt:  IVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILN----RLPSPSESNCRRLWW-------------------------SGV

Query:  NLVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFP------------------PASGLFVMNTLTNARALSN----------------
        NL  RGV  +N C  CGR GED +H FW CK A  +W  S+F  L P                      ++ +    NARA ++                
Subjt:  NLVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAARVWKASRFRFLFP------------------PASGLFVMNTLTNARALSN----------------

Query:  --------ESANSLLSDR--------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPC
                E+ ++ ++ R        W PP EG YKIN DA+F  +    GLG+++ +   Q+M +A   +E + SVD+A+A AA  G++L  ++G+ P 
Subjt:  --------ESANSLLSDR--------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPC

Query:  VLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL
        +                E LS+ G    + K   T    +SF+FVKR GNK   +LAR  L
Subjt:  VLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.8e-0626Show/hide
Query:  TNARALSNESANSLL-SDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSY
        TN    +N    S   S +WNPPPEG  K N+D+ +   S     G  +R+C+  ++L    +++       A+A      ++++   GLR    E DS 
Subjt:  TNARALSNESANSLL-SDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLRPCVLELDSY

Query:  RVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILA
         +  L  + E+  S +G     ++     +   S  FV R  N     LA
Subjt:  RVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILA

AT3G09510.1 Ribonuclease H-like superfamily protein5.4e-1019.68Show/hide
Query:  DKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPS-----------------PSESNCRRLWWSGVN--------LVDRGVVGS
        D+ +   I RI +   +K D  IW+Y++ G ++VRSGY +        +P+                 P     +   W  ++        L  RG+   
Subjt:  DKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPS-----------------PSESNCRRLWWSGVN--------LVDRGVVGS

Query:  NRCCLCGRTGEDLLHSFWRCKIAARVWK-------------------------------ASRFRFLFP--------PASGLFVMN--------TLTNARA
          C  C R  E + H+ + C  A   W+                                S F  L P         A    V N        T+ +A+A
Subjt:  NRCCLCGRTGEDLLHSFWRCKIAARVWK-------------------------------ASRFRFLFP--------PASGLFVMN--------TLTNARA

Query:  LSNESANSLLSDR---------------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLR
         +++  N+  S +               W  PP    K N+DA FD+       G ++R+ +   +   + ++   ++   A+  A    ++     G  
Subjt:  LSNESANSLLSDR---------------WNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDMGLR

Query:  PCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGLSKGVFQFG
           +E D   +  L        S +    E +   A    S  F F++R GNK+  +LA+ G +   F  G
Subjt:  PCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGLSKGVFQFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAGGGGATTGTTCGGAGTGCATTTGATAAAGATGAGGCCAATTTGATTTTACGTATCCAAGTTGGTGTTGATAGGAAGAATGATATGAGAATCTGGCACTATGA
TTCCAAGGGCTTGTTCTCGGTTAGAAGCGGATATCGAGTTGATCAGTCTCAAATTCTTAATCGGCTTCCTTCTCCATCCGAATCAAATTGTCGGCGATTATGGTGGTCGG
GTGTTAATTTGGTGGATCGTGGTGTTGTTGGGTCCAATCGATGCTGTTTGTGTGGGCGGACTGGTGAAGATCTCCTTCACTCTTTCTGGCGCTGTAAAATTGCTGCTCGG
GTATGGAAGGCTTCACGGTTCCGTTTCCTTTTCCCGCCAGCATCTGGATTATTTGTTATGAATACTTTAACTAATGCTCGGGCCCTATCAAACGAATCTGCTAATAGCCT
TTTGTCTGATCGTTGGAATCCTCCTCCTGAGGGTTGTTACAAAATTAATTATGATGCTGCTTTTGATTTGGCGTCGTCATCGGTTGGTTTGGGTGTGGTTGTTAGGGATT
GCCACGATCAGATGATGCTTTCGGCTGCGACTAGAATTGAGTTTATGGCAAGTGTTGATTTGGCTAAAGCTTCGGCGGCGAAGTGGGGTGTTGAGCTTGTGAGGGATATG
GGTCTCAGACCTTGCGTGTTAGAATTGGATTCCTATCGAGTTTTTCAGCTGTTTCAATCGGAGGAGGAAGTTCTGTCAGATGTCGGGTGGTTTGCTGAGAGGTTGAAGGT
GGCGGCAACTAGTATCTGTTCTTCTTCCTTTAGTTTTGTGAAGAGAGCTGGAAATAAAGTGGTTCGTATTCTTGCTAGATTGGGTTTGTCTAAGGGGGTATTTCAATTTG
GTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGAGGGGATTGTTCGGAGTGCATTTGATAAAGATGAGGCCAATTTGATTTTACGTATCCAAGTTGGTGTTGATAGGAAGAATGATATGAGAATCTGGCACTATGA
TTCCAAGGGCTTGTTCTCGGTTAGAAGCGGATATCGAGTTGATCAGTCTCAAATTCTTAATCGGCTTCCTTCTCCATCCGAATCAAATTGTCGGCGATTATGGTGGTCGG
GTGTTAATTTGGTGGATCGTGGTGTTGTTGGGTCCAATCGATGCTGTTTGTGTGGGCGGACTGGTGAAGATCTCCTTCACTCTTTCTGGCGCTGTAAAATTGCTGCTCGG
GTATGGAAGGCTTCACGGTTCCGTTTCCTTTTCCCGCCAGCATCTGGATTATTTGTTATGAATACTTTAACTAATGCTCGGGCCCTATCAAACGAATCTGCTAATAGCCT
TTTGTCTGATCGTTGGAATCCTCCTCCTGAGGGTTGTTACAAAATTAATTATGATGCTGCTTTTGATTTGGCGTCGTCATCGGTTGGTTTGGGTGTGGTTGTTAGGGATT
GCCACGATCAGATGATGCTTTCGGCTGCGACTAGAATTGAGTTTATGGCAAGTGTTGATTTGGCTAAAGCTTCGGCGGCGAAGTGGGGTGTTGAGCTTGTGAGGGATATG
GGTCTCAGACCTTGCGTGTTAGAATTGGATTCCTATCGAGTTTTTCAGCTGTTTCAATCGGAGGAGGAAGTTCTGTCAGATGTCGGGTGGTTTGCTGAGAGGTTGAAGGT
GGCGGCAACTAGTATCTGTTCTTCTTCCTTTAGTTTTGTGAAGAGAGCTGGAAATAAAGTGGTTCGTATTCTTGCTAGATTGGGTTTGTCTAAGGGGGTATTTCAATTTG
GTTAG
Protein sequenceShow/hide protein sequence
MNEGIVRSAFDKDEANLILRIQVGVDRKNDMRIWHYDSKGLFSVRSGYRVDQSQILNRLPSPSESNCRRLWWSGVNLVDRGVVGSNRCCLCGRTGEDLLHSFWRCKIAAR
VWKASRFRFLFPPASGLFVMNTLTNARALSNESANSLLSDRWNPPPEGCYKINYDAAFDLASSSVGLGVVVRDCHDQMMLSAATRIEFMASVDLAKASAAKWGVELVRDM
GLRPCVLELDSYRVFQLFQSEEEVLSDVGWFAERLKVAATSICSSSFSFVKRAGNKVVRILARLGLSKGVFQFG