; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr1:42774838..42775755
RNA-Seq ExpressionLag0012626
SyntenyLag0012626
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RYR44318.1 hypothetical protein Ahy_A08g040672 [Arachis hypogaea]1.9e-2427.21Show/hide
Query:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEG--------ILPLMNKGFWKPM
        S S+S+     W  +WKLKVP K++ F+W++ H  +P   NL++  +  +  CP+C ++ E+T+HAL LC   R VW G           +++ G W   
Subjt:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEG--------ILPLMNKGFWKPM

Query:  DIKDRWLSLGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPN----VQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDAEEIIMRC--DAAYDE
          K   +  G         +    W IW  +N   H R  P+    ++   +  +++ D  +   I   +++    D R   +   +  ++C  DAA+ E
Subjt:  DIKDRWLSLGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPN----VQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDAEEIIMRC--DAAYDE

Query:  INCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGE
        +        VFR+  G+L A     S+I+++SPL AE  AV + L  A++ ++  + V SDSL  I+ ++ +
Subjt:  INCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGE

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]2.7e-2332.85Show/hide
Query:  FVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLGDCQS-KQLDLISIGAWAIWNDKNN
        F+W+ F++ +P++ NL +  +  +  C VC ++ ETTDHALF C RA+EVW  +LP           ++D  L L +  S    DL+ +G WAIWND+N 
Subjt:  FVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLGDCQS-KQLDLISIGAWAIWNDKNN

Query:  VFHQRPVPNVQTRSEWILEYLDEFQIANIVG---GAVNQTVDDVRRILQD------AEEIIMRCDAAYDEINCTVGIGLVFRERQGNLKAVIKVMSKISS
        +  QR +P+ + RS+WIL Y+ +FQ+ ++       + +      R +++      A  I +  DAA  +     GIG+V R  +G + A        +S
Subjt:  VFHQRPVPNVQTRSEWILEYLDEFQIANIVG---GAVNQTVDDVRRILQD------AEEIIMRCDAAYDEINCTVGIGLVFRERQGNLKAVIKVMSKISS

Query:  TSPLGAE
          PL AE
Subjt:  TSPLGAE

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.6e-2930.34Show/hide
Query:  ASASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMN-KGFWKPMDIKDRW
        A+++S+      W  +WKL VP K+K F+W+S HE IPT  NL    +     C +C ++ E+  HA F C RAR++W  + P +        +   + W
Subjt:  ASASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMN-KGFWKPMDIKDRW

Query:  LSLGD-CQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQ-----DAEEIIMRCDAAYDEINCTVG
         SL +  + K L+L +I  W IWND+N++ H + V  V+ + EW+  +LD    A  +     +T  + R ++Q      +  + +  DAA      +  
Subjt:  LSLGD-CQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQ-----DAEEIIMRCDAAYDEINCTVG

Query:  IGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGEV
         G + R+   +L A   +       SPL AE+  +L+ L+FA +     L V SDSL  I+ +R E+
Subjt:  IGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGEV

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]2.2e-2528.31Show/hide
Query:  SSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSL--
        SS     WW+K WKL++P+K++ FVWK FH  +P  + L R H+  S  CP+C+ + ET  HALFLCSRA+EVW+  +  +N  F            L  
Subjt:  SSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSL--

Query:  -GDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANI----------------VGGAVNQTVDDVRRILQDAEEIIMRCDAAY
         G   + + +      W+IW ++N  FH +P        ++ L+Y+ ++Q  +                    +   +VD  +   Q   +  +  DAA+
Subjt:  -GDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANI----------------VGGAVNQTVDDVRRILQDAEEIIMRCDAAY

Query:  DEINCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR
        D+ N  +GIG V R+  G +KA     S   S      E  A++  L++ +S+ +    + +DSL  ++ ++
Subjt:  DEINCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR

XP_030505432.1 uncharacterized protein LOC115720422 [Cannabis sativa]9.4e-2430.08Show/hide
Query:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFW---KPMDIKDR
        S+S++   ++WW   W  K+P K+K+F W++FH  +PT  NL+R  V  S  C  C    ET  HAL  CSR R+VW+ + PL +  F+   +  DIKD 
Subjt:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFW---KPMDIKDR

Query:  WLS-LGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDD-----VRRILQDAEEIIMRCDAAYDEINCTV
         LS   D  +    L+    W+IW  +N    +   P+  +  +WI  YL +++ A +    V  T  +     V R+ + + ++    DAA ++ N  V
Subjt:  WLS-LGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDD-----VRRILQDAEEIIMRCDAAYDEINCTV

Query:  GIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSD
        G+G V ++  G + A +  M   +   PL AE  A+   L +  ++++   S+ +D
Subjt:  GIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSD

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.2e-2930.34Show/hide
Query:  ASASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMN-KGFWKPMDIKDRW
        A+++S+      W  +WKL VP K+K F+W+S HE IPT  NL    +     C +C ++ E+  HA F C RAR++W  + P +        +   + W
Subjt:  ASASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMN-KGFWKPMDIKDRW

Query:  LSLGD-CQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQ-----DAEEIIMRCDAAYDEINCTVG
         SL +  + K L+L +I  W IWND+N++ H + V  V+ + EW+  +LD    A  +     +T  + R ++Q      +  + +  DAA      +  
Subjt:  LSLGD-CQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQ-----DAEEIIMRCDAAYDEINCTVG

Query:  IGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGEV
         G + R+   +L A   +       SPL AE+  +L+ L+FA +     L V SDSL  I+ +R E+
Subjt:  IGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGEV

A0A803NM27 Uncharacterized protein2.4e-2529.23Show/hide
Query:  SSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLG
        S S     WW   W+LK+P KVK F WK+ H  +P  + L++     S  C +C    E+  HA+F C  AR VW+      N      M I+D    + 
Subjt:  SSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLG

Query:  DCQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIV----GGAVNQTVDDVRRILQDAEEII-MRCDAAYDEINCTVGIGLV
        +C +K +L++I    W+IW+D+NNV H +        S     +L  FQ A  +    G A   T    R        ++ +  DAA+D+    +G G +
Subjt:  DCQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIV----GGAVNQTVDDVRRILQDAEEII-MRCDAAYDEINCTVGIGLV

Query:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR
         R+  GN+KA +          P   E   +   L++AR L  +   V +DSL  +  +R
Subjt:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR

A0A803PCG2 Uncharacterized protein4.1e-2529.12Show/hide
Query:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRW-L
        S S+      WW  +W LK+P K+KHF WK+F+  +P   NL+  H   S  C  C  + E+  HAL  C++ +++W   +          +DIKD + L
Subjt:  SASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRW-L

Query:  SLGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDA---EEIIMRCDAAYDEINCTVGIGLV
        SL      Q+ L     W IWN +N    Q+  P    + E ++ +L E+Q A  +    + +V     +L          +  DAA +  N   G GLV
Subjt:  SLGDCQSKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDA---EEIIMRCDAAYDEINCTVGIGLV

Query:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRG
        F +    +    K+  +  ++SPL AE  A+ + L++  +  +Q   V SD L+ I  V G
Subjt:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRG

A0A803PH33 Uncharacterized protein2.7e-2428.41Show/hide
Query:  SSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLGD
        SS     WW+K WKL++P+K++ FVWK FH  +P  + L R H+  S  CP+C+ + ET  HALFLCSRA+EVW      +N         ++  L + +
Subjt:  SSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLGD

Query:  CQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDAEEII-----------------MRCDAAYD
          S  + +      W+IW ++N  FH +P        ++ L+Y+ ++Q    V    N    DV                            +  DAA+D
Subjt:  CQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDAEEII-----------------MRCDAAYD

Query:  EINCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR
        + N  +GIG V R+  G ++A     S   S      E  A++  L++ RS+ +    + +DSL  ++ ++
Subjt:  EINCTVGIGLVFRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR

A0A803QAH8 Uncharacterized protein4.1e-2527.69Show/hide
Query:  SSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLG
        S S     WW   W+LK+P KVK F WK+ H  +P  + L++     S  C +C    E+  HA+F C  AR VW+ +    N      M I+D    + 
Subjt:  SSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLG

Query:  DCQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIA---NIVGGAVNQTVDDVRRILQDAEEIIMR--CDAAYDEINCTVGIGLV
        +C +K +L++I    W+IW+D+NNV H +        S     +L  FQ A   ++  G          +         ++   DAA+D+    +G G +
Subjt:  DCQSK-QLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIA---NIVGGAVNQTVDDVRRILQDAEEIIMR--CDAAYDEINCTVGIGLV

Query:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR
         R+  G +KA +          P   E   +   L++AR L  +   V +DSL  +  +R
Subjt:  FRERQGNLKAVIKVMSKISSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVR

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.3e-0726.09Show/hide
Query:  VWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILP-LMNKGFWKPMDIKDRWLSLGD---CQSKQLD
        +WK++VP +VK F+W   ++ + T     R H+  S  C VC+   E+  H L  C     +W  ++P    +GF+     +  + +LGD   C+     
Subjt:  VWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILP-LMNKGFWKPMDIKDRWLSLGD---CQSKQLD

Query:  LI-SIGAWAIWNDK-NNVFHQRPV--PNVQTRSEWILE
         I ++  W  W  +  N+F +       V+   EW +E
Subjt:  LI-SIGAWAIWNDK-NNVFHQRPV--PNVQTRSEWILE

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.8e-0922.87Show/hide
Query:  VWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKD---RWLSLGDCQ-SKQLD
        +WKL V  K+KHF+W+     + T T L   ++     C  C  + ET  H +F C   + VW     ++   +  P   +D   R + L   Q +  LD
Subjt:  VWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKD---RWLSLGDCQ-SKQLD

Query:  --LISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTV--DDVRRILQDAEE--------IIMRCDAAYDEINCTVGIGLVFR
          L     W +W  +N    Q+   +    +   ++   E+  AN      N  V  + ++   +D+ +        +    D+ Y + +     G   R
Subjt:  --LISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTV--DDVRRILQDAEE--------IIMRCDAAYDEINCTVGIGLVFR

Query:  ERQGNLKAVIKVMSKI-SSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTV
        E  G++  V+   +K+ SST  L AE    L  L+   +  ++ +   SDS S +  +
Subjt:  ERQGNLKAVIKVMSKI-SSTSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTV

AT3G25270.1 Ribonuclease H-like superfamily protein1.6e-1328.12Show/hide
Query:  AKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVW--EGILPLMNKGFWKPMDIKDRWLSLGDCQSKQLD
        AK+WKLK   K+KHF+WK     + T  NL R H+     C  C ++ ET+ H  F C  A++VW   GI     +     M+ K   L      ++Q  
Subjt:  AKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVW--EGILPLMNKGFWKPMDIKDRWLSLGDCQSKQLD

Query:  LISIG---AWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVR
        L ++     W +W  +N +  Q+   + Q   +     + E++  N    ++NQ V   R
Subjt:  LISIG---AWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVR

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-0533.33Show/hide
Query:  SKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRA-REV
        + W   +W LK+  K+K  +WK+ +  +P    L   ++ +   C  CR+  ET  H LF C  A REV
Subjt:  SKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRA-REV

AT4G29090.1 Ribonuclease H-like superfamily protein4.0e-1225.96Show/hide
Query:  WAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWE-GILPLMNKGFW-KPMDIKDRW---LSLGDCQ-
        + K+WK +   K++HF+WK    ++P    L   H+     C  C    ET +H LF C+ AR  W    +P+   G W   + +   W   L  G+ Q 
Subjt:  WAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWE-GILPLMNKGFW-KPMDIKDRW---LSLGDCQ-

Query:  SKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQI---ANIVGGAVNQTVDDVRRILQDAEE-IIMRCDAAYDEINCTVGIGLVFRERQ
         K   L+    W +W ++N +  +    N Q       + L+E++I   A   G           R      + +    DA ++  N   GIG V R  +
Subjt:  SKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQI---ANIVGGAVNQTVDDVRRILQDAEE-IIMRCDAAYDEINCTVGIGLVFRERQ

Query:  GNLKAV-IKVMSKISST--SPLGAEVGAVLQRLRF
        G +K +  + + K+ S   + L A   AVL   RF
Subjt:  GNLKAV-IKVMSKISST--SPLGAEVGAVLQRLRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCATCTTGCCTCTGCCTCTTCCTCAGAGGTAGAATCGAAATGGTGGGCGAAGGTTTGGAAATTAAAAGTTCCAAATAAGGTAAAGCACTTTGTTTGGAAATCTTT
TCATGAAACTATTCCCACTATAACTAACCTCTGGAGGCACCATGTTCCTGTTAGTGGATGCTGCCCAGTTTGTCGGGAGAAGACGGAGACCACTGATCATGCTCTGTTTT
TGTGTTCTCGAGCTCGAGAGGTGTGGGAGGGTATTCTCCCATTGATGAACAAAGGGTTTTGGAAACCAATGGATATCAAAGACAGATGGTTGAGTCTTGGAGACTGTCAA
AGCAAGCAGTTAGATCTTATTAGTATTGGGGCTTGGGCGATTTGGAATGATAAGAACAATGTTTTTCACCAAAGGCCAGTTCCTAATGTCCAGACTCGAAGTGAGTGGAT
TCTAGAGTATTTGGACGAATTCCAAATAGCTAATATAGTGGGTGGAGCTGTTAATCAAACAGTGGATGATGTTCGAAGAATTCTACAAGACGCTGAAGAGATAATTATGC
GTTGTGATGCGGCTTACGATGAGATTAATTGTACGGTGGGTATTGGACTGGTGTTCCGGGAGAGACAAGGGAATCTTAAGGCAGTGATCAAGGTTATGTCTAAAATTAGC
AGTACATCACCGCTGGGAGCGGAAGTGGGAGCAGTCCTCCAAAGACTTCGATTTGCTCGATCTCTAAAGATGCAATGCTTATCGGTGGTGTCCGACTCCTTATCGTTTAT
AAGGACAGTCAGGGGAGAAGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCATCTTGCCTCTGCCTCTTCCTCAGAGGTAGAATCGAAATGGTGGGCGAAGGTTTGGAAATTAAAAGTTCCAAATAAGGTAAAGCACTTTGTTTGGAAATCTTT
TCATGAAACTATTCCCACTATAACTAACCTCTGGAGGCACCATGTTCCTGTTAGTGGATGCTGCCCAGTTTGTCGGGAGAAGACGGAGACCACTGATCATGCTCTGTTTT
TGTGTTCTCGAGCTCGAGAGGTGTGGGAGGGTATTCTCCCATTGATGAACAAAGGGTTTTGGAAACCAATGGATATCAAAGACAGATGGTTGAGTCTTGGAGACTGTCAA
AGCAAGCAGTTAGATCTTATTAGTATTGGGGCTTGGGCGATTTGGAATGATAAGAACAATGTTTTTCACCAAAGGCCAGTTCCTAATGTCCAGACTCGAAGTGAGTGGAT
TCTAGAGTATTTGGACGAATTCCAAATAGCTAATATAGTGGGTGGAGCTGTTAATCAAACAGTGGATGATGTTCGAAGAATTCTACAAGACGCTGAAGAGATAATTATGC
GTTGTGATGCGGCTTACGATGAGATTAATTGTACGGTGGGTATTGGACTGGTGTTCCGGGAGAGACAAGGGAATCTTAAGGCAGTGATCAAGGTTATGTCTAAAATTAGC
AGTACATCACCGCTGGGAGCGGAAGTGGGAGCAGTCCTCCAAAGACTTCGATTTGCTCGATCTCTAAAGATGCAATGCTTATCGGTGGTGTCCGACTCCTTATCGTTTAT
AAGGACAGTCAGGGGAGAAGTGTAG
Protein sequenceShow/hide protein sequence
MPHLASASSSEVESKWWAKVWKLKVPNKVKHFVWKSFHETIPTITNLWRHHVPVSGCCPVCREKTETTDHALFLCSRAREVWEGILPLMNKGFWKPMDIKDRWLSLGDCQ
SKQLDLISIGAWAIWNDKNNVFHQRPVPNVQTRSEWILEYLDEFQIANIVGGAVNQTVDDVRRILQDAEEIIMRCDAAYDEINCTVGIGLVFRERQGNLKAVIKVMSKIS
STSPLGAEVGAVLQRLRFARSLKMQCLSVVSDSLSFIRTVRGEV