; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030119 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030119
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold6:11309778..11313231
RNA-Seq ExpressionSpg030119
SyntenySpg030119
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015385738.1 uncharacterized protein LOC107177034 [Citrus sinensis]7.0e-1829.37Show/hide
Query:  ENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNK-ISVGKNHPNFQSLSRQIRRQIEDHFKTKDP
        E   H L  CK+ + +W  Y P   ++      N++ + +   +++ ++K + E    I W IW  RNK +  GK      ++SR     +E + + K P
Subjt:  ENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNK-ISVGKNHPNFQSLSRQIRRQIEDHFKTKDP

Query:  NLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLE
         L      S  N   WSPPP    K+NVDA+ N    + G+G + RDS     VA           +  EA A+  GL    +++ +    I+VESD  E
Subjt:  NLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLE

Query:  VVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVA
        VVK +N +    +E  +++ +I  L+      ++   PR+ NT AHSLA++A
Subjt:  VVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVA

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.3e-1933.49Show/hide
Query:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG
        +EE+ R+ II W IW  RNK      HP  + +   I R I          +     KD +LI  I ++      W PP S   K+N +A+W      GG
Subjt:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG

Query:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRA
        +GWI RD  G  + A C  ++ +    YLE +AI EGL+A     ++    I +ESDSLE +  L++  ++++E  +LLE+I Q+     + +     R 
Subjt:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRA

Query:  SNTVAHSLARVA
        +N VAH LAR A
Subjt:  SNTVAHSLARVA

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]5.8e-2032.88Show/hide
Query:  KLENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHF----
        K E   H+LW CK I+DIWI+  P   +     R N      W+ L +   +EE+ R+ II   IW  RNK      H   + +   I R I +      
Subjt:  KLENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHF----

Query:  ----KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGR
            K+KD + I  I ++      W PP S   K+N DA+W       G+GWI RD  G  +  GC  ++ +    YLE +AI EGL+A     ++    
Subjt:  ----KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGR

Query:  IIVESDSLEVVKALNQDVE
        I +ESDSLE +  L++ V+
Subjt:  IIVESDSLEVVKALNQDVE

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-1832.37Show/hide
Query:  SKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSL
        S E+ +   I  W IWN RN +     H +F ++ +Q+ + + +     + +L   + ++  N   W PPP     +N DASW++S  RGG+GWI R   
Subjt:  SKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSL

Query:  GSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLA
        G  ++AG   ++     K LEA AI EGL+    ++  +   + +E+DS EV   LN+  E+ ++T +++E+I  L     + AF K  R +N  AHSLA
Subjt:  GSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLA

Query:  RVAAGFR
        + A+  R
Subjt:  RVAAGFR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.9e-1832.72Show/hide
Query:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG
        +EE+ R+ II W IW  RNK      H   + +   I R I          +     KD +LI  I ++      W PP S   K+N DA+W      GG
Subjt:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG

Query:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLAS-----SRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFV
        +GWI RD  G  + A C  ++ +    YLE +AI EGL+A          ++    I +ESDSLE +  L++  ++++E  +LLE+I Q+     + +  
Subjt:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLAS-----SRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFV

Query:  KCPRASNTVAHSLARVA
           R +N VAH LAR A
Subjt:  KCPRASNTVAHSLARVA

TrEMBL top hitse value%identityAlignment
A0A0A9TPB5 RNase H domain-containing protein2.9e-1726.29Show/hide
Query:  HMLWRCKSIRDIW-ISYFPNLKDSLLSCRENEEAV-LVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLI
        H  ++CK++R+ W +      +  L  CR   E +  +W      ++   + +  +++W  W+ RNK +  +       +   +   + +  K + PN  
Subjt:  HMLWRCKSIRDIW-ISYFPNLKDSLLSCRENEEAV-LVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLI

Query:  EAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVK
              Q+    W PPP +  K+N DAS+N ++  GG G++AR+S G+ L  GC   QR   +   EALA    L+           RII+E+D+  +  
Subjt:  EAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVK

Query:  ALNQDVEEESETNFLLEDIDQLAVGAGVSAFVK-CPRASNTVAHSLARVAA
        A+     +  +   + + I ++ + + VS  V+ CPR  N VA SLA+  A
Subjt:  ALNQDVEEESETNFLLEDIDQLAVGAGVSAFVK-CPRASNTVAHSLARVAA

A0A6J1CP26 uncharacterized protein LOC1110134126.2e-2033.49Show/hide
Query:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG
        +EE+ R+ II W IW  RNK      HP  + +   I R I          +     KD +LI  I ++      W PP S   K+N +A+W      GG
Subjt:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG

Query:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRA
        +GWI RD  G  + A C  ++ +    YLE +AI EGL+A     ++    I +ESDSLE +  L++  ++++E  +LLE+I Q+     + +     R 
Subjt:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRA

Query:  SNTVAHSLARVA
        +N VAH LAR A
Subjt:  SNTVAHSLARVA

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.8e-2032.88Show/hide
Query:  KLENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHF----
        K E   H+LW CK I+DIWI+  P   +     R N      W+ L +   +EE+ R+ II   IW  RNK      H   + +   I R I +      
Subjt:  KLENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHF----

Query:  ----KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGR
            K+KD + I  I ++      W PP S   K+N DA+W       G+GWI RD  G  +  GC  ++ +    YLE +AI EGL+A     ++    
Subjt:  ----KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGR

Query:  IIVESDSLEVVKALNQDVE
        I +ESDSLE +  L++ V+
Subjt:  IIVESDSLEVVKALNQDVE

A0A6J1DNV9 uncharacterized protein LOC1110224036.9e-1932.37Show/hide
Query:  SKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSL
        S E+ +   I  W IWN RN +     H +F ++ +Q+ + + +     + +L   + ++  N   W PPP     +N DASW++S  RGG+GWI R   
Subjt:  SKEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSL

Query:  GSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLA
        G  ++AG   ++     K LEA AI EGL+    ++  +   + +E+DS EV   LN+  E+ ++T +++E+I  L     + AF K  R +N  AHSLA
Subjt:  GSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLA

Query:  RVAAGFR
        + A+  R
Subjt:  RVAAGFR

A0A6J1DSV1 uncharacterized protein LOC1110236089.0e-1932.72Show/hide
Query:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG
        +EE+ R+ II W IW  RNK      H   + +   I R I          +     KD +LI  I ++      W PP S   K+N DA+W      GG
Subjt:  KEEKERAAIIIWAIWNFRNKISVGKNHPNFQSLSRQIRRQI----------EDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGG

Query:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLAS-----SRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFV
        +GWI RD  G  + A C  ++ +    YLE +AI EGL+A          ++    I +ESDSLE +  L++  ++++E  +LLE+I Q+     + +  
Subjt:  VGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLAS-----SRKISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFV

Query:  KCPRASNTVAHSLARVA
           R +N VAH LAR A
Subjt:  KCPRASNTVAHSLARVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G33160.1 glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein1.5e-0529Show/hide
Query:  IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANH-NAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCA
        I+W +WN RN +   + H  +++  +  +  +++  +   P+ I +   +  +H   W  P + +VK NVD S+    I G  GWI RD  G    AG A
Subjt:  IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANH-NAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1330.5Show/hide
Query:  HMLWRCKSIRDIW-ISYFPNLKDSLLSCRENEEAVLVWD-RLSEGMSKEEKERAAI--IIWAIWNFRNKISV-GKNHPNFQSLSRQIRRQIEDHFKTKDP
        H+L++C   R +W IS  P   +      ++  A L W   L   + K  K    +  ++W +W  RN++   GK +         +RR +ED  +    
Subjt:  HMLWRCKSIRDIW-ISYFPNLKDSLLSCRENEEAVLVWD-RLSEGMSKEEKERAAI--IIWAIWNFRNKISV-GKNHPNFQSLSRQIRRQIEDHFKTKDP

Query:  NLIEA-ITESQANHN---AWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVES
          +E   +  Q   N    W  PP ++VK N DA+W     R G+GWI R+  G  L  G   L R       E  A+R    A L  SR    RII ES
Subjt:  NLIEA-ITESQANHN---AWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVES

Query:  DSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVAAGF
        D+  +V  LN D +        LEDI QL        F   PR  N VA  +AR +  F
Subjt:  DSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVAAGF

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)1.3e-0624.8Show/hide
Query:  IWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQAN------HNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLV
        +W +W  RN+    +       ++++  ++  +  +T   +   + +  Q N         WSPPP  ++K N D+ + +        WI RDS G  + 
Subjt:  IWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQAN------HNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLV

Query:  AGCAKLQRKWPTKYLEALAIREGLK
        +GCAKLQ+ +     EAL     L+
Subjt:  AGCAKLQRKWPTKYLEALAIREGLK

AT4G29090.1 Ribonuclease H-like superfamily protein2.7e-1527.69Show/hide
Query:  HMLWRCKSIRDIW-ISYFP-----NLKDSLLSCRENEEAVLVW-DRLSEGMSKEEKERAAI--IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIED-HF
        H+L++C   R  W IS  P        DS+          L W   L  G  + EK    +  ++W +W  RN++       N Q + R+    +E+   
Subjt:  HMLWRCKSIRDIW-ISYFP-----NLKDSLLSCRENEEAVLVW-DRLSEGMSKEEKERAAI--IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIED-HF

Query:  KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVE
        +T+  +       ++++   W PPP ++VK N DA+WN    R G+GW+ R+  G     G   L +       E  A+R    A L+ SR     +I E
Subjt:  KTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVE

Query:  SDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVAAGF
        SDS  +++ LN D E        ++D+ +L        FV  PR  NT+A  +AR +  F
Subjt:  SDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVAAGF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.0e-0824.7Show/hide
Query:  IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNA-------WSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSP
        ++W IW   N +        FQ+    +   + D  +  D  +     +   N NA       WSPP  + +K N DAS +E     G+GWI R+S G+ 
Subjt:  IIWAIWNFRNKISVGKNHPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNA-------WSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSP

Query:  LVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFL
        +  G  K Q +  T+  E   +   ++A      K   ++I E D+  + + +N         +FL
Subjt:  LVAGCAKLQRKWPTKYLEALAIREGLKAYLASSRKISGRIIVESDSLEVVKALNQDVEEESETNFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTCTCGAAAAGGAGCTCGAAAAGGAACTTGGGAAAGAGCATTTAAGGAGTTGTGTACTTATTTTGTTTAGTAATTTTACTTTTTCTTTGGCAAAATTGGAGAA
TCCGGTGCACATGCTATGGCGATGCAAATCCATAAGGGATATTTGGATTTCTTACTTCCCTAATCTTAAAGATTCCCTGTTATCTTGCAGGGAGAATGAGGAGGCGGTGC
TGGTTTGGGACAGGCTTTCGGAAGGAATGTCTAAAGAGGAAAAAGAGAGAGCAGCGATCATCATTTGGGCTATATGGAATTTTAGGAATAAAATCTCAGTGGGCAAAAAT
CATCCAAATTTCCAGAGTTTATCAAGGCAGATAAGGAGACAGATCGAAGATCATTTCAAGACTAAAGACCCGAACCTGATTGAGGCTATCACAGAGAGCCAGGCGAATCA
TAATGCGTGGTCCCCTCCCCCAAGCGAGTTCGTGAAGATTAATGTTGATGCTTCATGGAACGAGAGTCTGATCCGGGGAGGCGTTGGGTGGATAGCTCGTGATTCGTTAG
GATCTCCGCTGGTAGCTGGCTGCGCAAAATTACAGAGGAAATGGCCAACTAAATACTTAGAAGCGCTTGCAATTCGGGAGGGTTTGAAGGCGTACCTGGCGTCGTCTCGA
AAAATTTCTGGTCGGATTATCGTTGAGTCTGACTCGCTTGAAGTGGTGAAGGCTCTAAATCAGGATGTTGAGGAGGAGTCGGAGACCAATTTTCTGCTCGAAGACATCGA
TCAGTTGGCTGTTGGGGCAGGGGTTTCTGCTTTCGTTAAATGCCCACGGGCATCAAACACAGTCGCGCATTCCCTCGCGCGGGTGGCGGCCGGATTTCGGCCGGAGTTCT
CTCCGTCTACCGTTGTCGTCGACGAGTGTAATTTTTTTGGGCATTCTTCATCTTCCACGCTGGAAGTTTGTTCTTTTTTTGAGGAAGGGACTATCCCTTTTTGGCTATCC
CTTTTAATTATGGATGAATCGGCCTGTTGGCTTGATGATCTCTTGGATCGGCTTGTTGGCTTGTTGATCTCTTGGATCGGCCTCTTGTTGATCCTCTTGGATGATCGGCC
TTCTGACTTGTTGATCCTCTTGATCGGTCGGTTGGCTTACTCTTTGGCTTCTGACGACTCTGGACTCTGGCAGAATGCTCTGGACTCTGAGTTCTCTGACTCTGTCTCTG
AATCTGAATGCCCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTCTCGAAAAGGAGCTCGAAAAGGAACTTGGGAAAGAGCATTTAAGGAGTTGTGTACTTATTTTGTTTAGTAATTTTACTTTTTCTTTGGCAAAATTGGAGAA
TCCGGTGCACATGCTATGGCGATGCAAATCCATAAGGGATATTTGGATTTCTTACTTCCCTAATCTTAAAGATTCCCTGTTATCTTGCAGGGAGAATGAGGAGGCGGTGC
TGGTTTGGGACAGGCTTTCGGAAGGAATGTCTAAAGAGGAAAAAGAGAGAGCAGCGATCATCATTTGGGCTATATGGAATTTTAGGAATAAAATCTCAGTGGGCAAAAAT
CATCCAAATTTCCAGAGTTTATCAAGGCAGATAAGGAGACAGATCGAAGATCATTTCAAGACTAAAGACCCGAACCTGATTGAGGCTATCACAGAGAGCCAGGCGAATCA
TAATGCGTGGTCCCCTCCCCCAAGCGAGTTCGTGAAGATTAATGTTGATGCTTCATGGAACGAGAGTCTGATCCGGGGAGGCGTTGGGTGGATAGCTCGTGATTCGTTAG
GATCTCCGCTGGTAGCTGGCTGCGCAAAATTACAGAGGAAATGGCCAACTAAATACTTAGAAGCGCTTGCAATTCGGGAGGGTTTGAAGGCGTACCTGGCGTCGTCTCGA
AAAATTTCTGGTCGGATTATCGTTGAGTCTGACTCGCTTGAAGTGGTGAAGGCTCTAAATCAGGATGTTGAGGAGGAGTCGGAGACCAATTTTCTGCTCGAAGACATCGA
TCAGTTGGCTGTTGGGGCAGGGGTTTCTGCTTTCGTTAAATGCCCACGGGCATCAAACACAGTCGCGCATTCCCTCGCGCGGGTGGCGGCCGGATTTCGGCCGGAGTTCT
CTCCGTCTACCGTTGTCGTCGACGAGTGTAATTTTTTTGGGCATTCTTCATCTTCCACGCTGGAAGTTTGTTCTTTTTTTGAGGAAGGGACTATCCCTTTTTGGCTATCC
CTTTTAATTATGGATGAATCGGCCTGTTGGCTTGATGATCTCTTGGATCGGCTTGTTGGCTTGTTGATCTCTTGGATCGGCCTCTTGTTGATCCTCTTGGATGATCGGCC
TTCTGACTTGTTGATCCTCTTGATCGGTCGGTTGGCTTACTCTTTGGCTTCTGACGACTCTGGACTCTGGCAGAATGCTCTGGACTCTGAGTTCTCTGACTCTGTCTCTG
AATCTGAATGCCCAAAATGA
Protein sequenceShow/hide protein sequence
MDILEKELEKELGKEHLRSCVLILFSNFTFSLAKLENPVHMLWRCKSIRDIWISYFPNLKDSLLSCRENEEAVLVWDRLSEGMSKEEKERAAIIIWAIWNFRNKISVGKN
HPNFQSLSRQIRRQIEDHFKTKDPNLIEAITESQANHNAWSPPPSEFVKINVDASWNESLIRGGVGWIARDSLGSPLVAGCAKLQRKWPTKYLEALAIREGLKAYLASSR
KISGRIIVESDSLEVVKALNQDVEEESETNFLLEDIDQLAVGAGVSAFVKCPRASNTVAHSLARVAAGFRPEFSPSTVVVDECNFFGHSSSSTLEVCSFFEEGTIPFWLS
LLIMDESACWLDDLLDRLVGLLISWIGLLLILLDDRPSDLLILLIGRLAYSLASDDSGLWQNALDSEFSDSVSESECPK