; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030114 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030114
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold6:13195783..13196842
RNA-Seq ExpressionSpg030114
SyntenySpg030114
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4262994.1 unnamed protein product [Prunus armeniaca]2.8e-1624.84Show/hide
Query:  MPKRNLAKEDEIIWRKDPKGKFSVKSAYNLAIQIEAQGIKKKDVEHVFWNYKVVRSLWKILFP-KLN-ILFHNCKSWW------------KFKDFWDGAS
        +P  +LA  D +IW  +  G +SVKS Y L  ++E   +  +    V  + +  + +W +  P K+   L+  C +               F++ W    
Subjt:  MPKRNLAKEDEIIWRKDPKGKFSVKSAYNLAIQIEAQGIKKKDVEHVFWNYKVVRSLWKILFP-KLN-ILFHNCKSWW------------KFKDFWDGAS

Query:  RILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWI
             ++    ++L W +W  R++L     ++   Q+L  + + A  F   N    T+HG++S    S   W PP    +K+NVD A     +  GVG +
Subjt:  RILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWI

Query:  VRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEI-SLLINEIRETANQMISVSFVHCPR
        VR+  G  + A  ++       + +E M  +EG        R   D    + V+E DA + +  I  +S +C+ +  LLI E++   N   +V     PR
Subjt:  VRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEI-SLLINEIRETANQMISVSFVHCPR

Query:  ASNFLA
         SN +A
Subjt:  ASNFLA

EEC72753.1 hypothetical protein OsI_06384 [Oryza sativa Indica Group]4.7e-1627.06Show/hide
Query:  VEHVFWNYKVVRSLWKILFPKLNIL-----FHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIEN
        +EHVF       S+WK +    NI        N + W    DF    S I +    +  +   W IW+ RNN K +      ++++Q I+  A+  MI  
Subjt:  VEHVFWNYKVVRSLWKILFPKLNIL-----FHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIEN

Query:  EWEKTNHGKRSE--TLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLH
         +   N  + +E  +LPS   W+PP +  + +N D A F+A    GVG ++RD  G+ + A + +  G    +  EA  +     +  E          H
Subjt:  EWEKTNHGKRSE--TLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLH

Query:  EVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA
         V++  D   +++ +N   +D S+I  L+ +I++   + ISVSF+H    SN  A
Subjt:  EVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]5.0e-1825.79Show/hide
Query:  KKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENE
        K++  +H+ W  KV++ +W    P     F+  ++ W  K++W+        ++   +  +  +IW+ RN       + +T+    DI  A  R++I + 
Subjt:  KKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENE

Query:  WEKTNHGKRSETL--------PSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYP
         + TN  ++S+           +  RW PP S+ WKLN D AW   +N+ G+GWI+RD +G +I  G +       I  LE M + EG   + +      
Subjt:  WEKTNHGKRSETL--------PSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYP

Query:  DCSLHEVVIESDAAEIVRLIN
              + +ESD+ E + L++
Subjt:  DCSLHEVVIESDAAEIVRLIN

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.6e-1631.03Show/hide
Query:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSH----------GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRD
        + W+IW+ RN       + +T+    DI     R++I +    TN   +S     H           RW PP S+ WKLN D AW   +N+ G+GWI+RD
Subjt:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSH----------GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRD

Query:  SEGSLIGAGSKKTSGRLKIKMLEAMTVVEGF-SIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN
         +G +I A  +       I  LE M + EG  +I  E  R         + +ESD+ E + L++   +D +EI  L+ EI +    M  VS  H  R +N
Subjt:  SEGSLIGAGSKKTSGRLKIKMLEAMTVVEGF-SIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN

Query:  FLA
         +A
Subjt:  FLA

XP_024046732.1 uncharacterized protein LOC112101057 [Citrus clementina]1.6e-1625Show/hide
Query:  MPKRNLAKEDEIIWRKDPKGKFSVKSAYNLAIQIEAQGIKKKDVEHVFWNYKVVRSLWKILFPKLNILFH---NCK---SWWKFKDFWDGASRILDAKDA
        +P     K DE +W  D KG ++VKS Y +A++++         E +F    +   + ++    +   FH    CK     WK+         ++     
Subjt:  MPKRNLAKEDEIIWRKDPKGKFSVKSAYNLAIQIEAQGIKKKDVEHVFWNYKVVRSLWKILFPKLNILFH---NCK---SWWKFKDFWDGASRILDAKDA

Query:  SAASHLI---------------WEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASN
        S    LI               WEIW  RN L       +   ++    +A   +   +  E+T+  KRSE +    +W PP    +K+NVD A      
Subjt:  SAASHLI---------------WEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASN

Query:  SSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVS
         +G+G ++R+ +G +I    K T  +  +   EA  V  G  I  E         L  V+IE+D  E+  L N  +    EI   I++I     +  S+S
Subjt:  SSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVS

Query:  FVHCPRASNFLA
          H PR  N  A
Subjt:  FVHCPRASNFLA

TrEMBL top hitse value%identityAlignment
A0A2N9GAE6 Uncharacterized protein1.7e-1628.35Show/hide
Query:  EAQGIKKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILD-AKDASAASHLI--WEIWQKRNNLKQSKGNKDTKQILQDILSAA
        E  G K +D  H  W+ K ++S+W       N ++          DF D  S++L   +D+     +I  W +WQ+RN ++  +      Q+     S  
Subjt:  EAQGIKKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILD-AKDASAASHLI--WEIWQKRNNLKQSKGNKDTKQILQDILSAA

Query:  HRFMIENEWEKTNHGKRSETLPSHG-RWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNY
          +M ENE  K      S+  P+   RW+PP   ++K+N D A F+ +N +G+G IVRDS G ++ + ++K      +  +EA  V      V E     
Subjt:  HRFMIENEWEKTNHGKRSETLPSHG-RWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNY

Query:  PDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA
            L E   E D+  IV  +N      +   LLI + +  A+++ S SF H  R  N LA
Subjt:  PDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.4e-1825.79Show/hide
Query:  KKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENE
        K++  +H+ W  KV++ +W    P     F+  ++ W  K++W+        ++   +  +  +IW+ RN       + +T+    DI  A  R++I + 
Subjt:  KKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENE

Query:  WEKTNHGKRSETL--------PSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYP
         + TN  ++S+           +  RW PP S+ WKLN D AW   +N+ G+GWI+RD +G +I  G +       I  LE M + EG   + +      
Subjt:  WEKTNHGKRSETL--------PSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYP

Query:  DCSLHEVVIESDAAEIVRLIN
              + +ESD+ E + L++
Subjt:  DCSLHEVVIESDAAEIVRLIN

A0A6J1DSV1 uncharacterized protein LOC1110236087.8e-1731.03Show/hide
Query:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSH----------GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRD
        + W+IW+ RN       + +T+    DI     R++I +    TN   +S     H           RW PP S+ WKLN D AW   +N+ G+GWI+RD
Subjt:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSH----------GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRD

Query:  SEGSLIGAGSKKTSGRLKIKMLEAMTVVEGF-SIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN
         +G +I A  +       I  LE M + EG  +I  E  R         + +ESD+ E + L++   +D +EI  L+ EI +    M  VS  H  R +N
Subjt:  SEGSLIGAGSKKTSGRLKIKMLEAMTVVEGF-SIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN

Query:  FLA
         +A
Subjt:  FLA

A0A7N2N157 RNase H domain-containing protein5.1e-1626.8Show/hide
Query:  KDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWE
        +D  H  W  +V++ +W       N L      +  F+D W G       K A   +++ W IW KRN  +    +    +I QD+      F    E  
Subjt:  KDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWE

Query:  KTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIE
             ++    P+H  W PP  +Q K+N D A F  +  +G+G +VR+S G++ GA S +    + +  +EA+        V +         L EVV E
Subjt:  KTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIE

Query:  SDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA
         D+  I + +N  S   S    +I++++  A   +SVSF+H  R  N +A
Subjt:  SDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA

B8AE80 Uncharacterized protein2.3e-1627.06Show/hide
Query:  VEHVFWNYKVVRSLWKILFPKLNIL-----FHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIEN
        +EHVF       S+WK +    NI        N + W    DF    S I +    +  +   W IW+ RNN K +      ++++Q I+  A+  MI  
Subjt:  VEHVFWNYKVVRSLWKILFPKLNIL-----FHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIEN

Query:  EWEKTNHGKRSE--TLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLH
         +   N  + +E  +LPS   W+PP +  + +N D A F+A    GVG ++RD  G+ + A + +  G    +  EA  +     +  E          H
Subjt:  EWEKTNHGKRSE--TLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLH

Query:  EVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA
         V++  D   +++ +N   +D S+I  L+ +I++   + ISVSF+H    SN  A
Subjt:  EVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein5.2e-0524.6Show/hide
Query:  SSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEIS
        S + K N D +  E    SG+GW++R+S+G+++  G  K  GR+  +  E   ++      S            +V+ E D + + RLIN    D   + 
Subjt:  SSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEIS

Query:  LLINEIRETANQMISVSFVHCPRASN
          ++ I+       S  F+   R  N
Subjt:  LLINEIRETANQMISVSFVHCPRASN

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.2e-0525Show/hide
Query:  MIENEWEKTNHG-----KRSETLPSH-GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFR
        M   EW    H      K  +T  S   +W  PG+   K N DV+       SG+ WI+R+S+G+ +  G  K  GR  IK  E   ++           
Subjt:  MIENEWEKTNHG-----KRSETLPSH-GRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGFSIVSERFR

Query:  NYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN
           D     V  E D   + RLI     +   +   +  I++ +    +V F    R  N
Subjt:  NYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASN

AT3G25270.1 Ribonuclease H-like superfamily protein8.9e-0522.17Show/hide
Query:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTN----------HGKR-SETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVR
        ++W +W+ RN L   + +   +  LQ   +         EWE TN          H  R  +   +  +W  P S+  K N D A+   + ++  GW++R
Subjt:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTN----------HGKR-SETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVR

Query:  DSEGSLIGAG---SKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPR
        D  G  +G+G      TS  L+ +    +  ++     S+ +R        +V+ E D+ ++  L+N    +    +  I E R    +     F   PR
Subjt:  DSEGSLIGAG---SKKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPR

Query:  ASN
         +N
Subjt:  ASN

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.8e-1226.58Show/hide
Query:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGS
        L+W IW+  N+L  +      +  ++  L+    ++      +  +G R+     + +W PPG  + K N D +  E +  SG+GWI+R+S+G++I  G 
Subjt:  LIWEIWQKRNNLKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGS

Query:  KKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKD
         K  GR+  +  E  T++      S  F +       +V+ E D   I R+IN  S +
Subjt:  KKTSGRLKIKMLEAMTVVEGFSIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAAGAGAAATCTAGCGAAAGAGGACGAGATCATTTGGAGGAAGGATCCAAAGGGGAAGTTCTCTGTTAAGAGCGCCTATAACTTAGCTATCCAAATTGAGGCTCA
AGGAATAAAAAAAAAAGATGTGGAACATGTGTTCTGGAACTACAAAGTGGTAAGATCTTTGTGGAAGATTTTATTCCCTAAACTTAATATATTATTCCACAATTGCAAGA
GTTGGTGGAAGTTCAAAGACTTCTGGGATGGAGCCTCGCGAATTCTGGATGCTAAAGACGCAAGTGCGGCTAGCCACTTAATTTGGGAAATTTGGCAGAAGAGGAACAAT
TTGAAGCAGTCAAAAGGCAACAAAGACACAAAGCAAATTTTGCAAGACATTCTCAGTGCAGCACACAGATTCATGATCGAAAACGAGTGGGAGAAGACTAACCATGGCAA
AAGGTCAGAGACCCTCCCGAGTCACGGGCGTTGGATCCCGCCGGGCAGTTCGCAATGGAAGCTGAATGTGGACGTAGCCTGGTTTGAAGCCTCAAACTCAAGCGGAGTGG
GGTGGATAGTCCGGGACTCCGAAGGTTCTCTGATAGGAGCGGGCTCCAAGAAGACAAGTGGAAGATTAAAGATAAAGATGCTTGAAGCCATGACCGTGGTCGAAGGTTTC
TCTATTGTATCGGAAAGATTCAGAAATTACCCGGATTGCTCGCTGCATGAGGTCGTCATTGAATCAGATGCCGCTGAAATTGTGAGGTTGATTAACGGAGTTTCCAAAGA
TTGCTCAGAGATCTCCCTCCTGATCAACGAGATTCGCGAGACAGCGAATCAAATGATCTCGGTTTCGTTCGTGCATTGCCCACGGGCTTCGAACTTTTTGGCCATTCTCT
AG
mRNA sequenceShow/hide mRNA sequence
ATGCCGAAGAGAAATCTAGCGAAAGAGGACGAGATCATTTGGAGGAAGGATCCAAAGGGGAAGTTCTCTGTTAAGAGCGCCTATAACTTAGCTATCCAAATTGAGGCTCA
AGGAATAAAAAAAAAAGATGTGGAACATGTGTTCTGGAACTACAAAGTGGTAAGATCTTTGTGGAAGATTTTATTCCCTAAACTTAATATATTATTCCACAATTGCAAGA
GTTGGTGGAAGTTCAAAGACTTCTGGGATGGAGCCTCGCGAATTCTGGATGCTAAAGACGCAAGTGCGGCTAGCCACTTAATTTGGGAAATTTGGCAGAAGAGGAACAAT
TTGAAGCAGTCAAAAGGCAACAAAGACACAAAGCAAATTTTGCAAGACATTCTCAGTGCAGCACACAGATTCATGATCGAAAACGAGTGGGAGAAGACTAACCATGGCAA
AAGGTCAGAGACCCTCCCGAGTCACGGGCGTTGGATCCCGCCGGGCAGTTCGCAATGGAAGCTGAATGTGGACGTAGCCTGGTTTGAAGCCTCAAACTCAAGCGGAGTGG
GGTGGATAGTCCGGGACTCCGAAGGTTCTCTGATAGGAGCGGGCTCCAAGAAGACAAGTGGAAGATTAAAGATAAAGATGCTTGAAGCCATGACCGTGGTCGAAGGTTTC
TCTATTGTATCGGAAAGATTCAGAAATTACCCGGATTGCTCGCTGCATGAGGTCGTCATTGAATCAGATGCCGCTGAAATTGTGAGGTTGATTAACGGAGTTTCCAAAGA
TTGCTCAGAGATCTCCCTCCTGATCAACGAGATTCGCGAGACAGCGAATCAAATGATCTCGGTTTCGTTCGTGCATTGCCCACGGGCTTCGAACTTTTTGGCCATTCTCT
AG
Protein sequenceShow/hide protein sequence
MPKRNLAKEDEIIWRKDPKGKFSVKSAYNLAIQIEAQGIKKKDVEHVFWNYKVVRSLWKILFPKLNILFHNCKSWWKFKDFWDGASRILDAKDASAASHLIWEIWQKRNN
LKQSKGNKDTKQILQDILSAAHRFMIENEWEKTNHGKRSETLPSHGRWIPPGSSQWKLNVDVAWFEASNSSGVGWIVRDSEGSLIGAGSKKTSGRLKIKMLEAMTVVEGF
SIVSERFRNYPDCSLHEVVIESDAAEIVRLINGVSKDCSEISLLINEIRETANQMISVSFVHCPRASNFLAIL