; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015237 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015237
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold3:40941650..40945966
RNA-Seq ExpressionSpg015237
SyntenySpg015237
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383853.1 uncharacterized protein LOC107176237 [Citrus sinensis]2.2e-1227Show/hide
Query:  KESTVHTFWDCKIARNLWLLFIPASLGLFSLC---RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVE--RNL
        KE+T H    CK A+ +W  + P     F  C     N   +E    ++  L+  ++E +V + W +W  +NKF+      +     +++ + VE  R +
Subjt:  KESTVHTFWDCKIARNLWLLFIPASLGLFSLC---RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVE--RNL

Query:  IELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLP
         +LK + KA  +S         ++   W PPP N++K+N DA+ N E    GL  + RDS   + +  +KT      +K  EA+A++  +     AA   
Subjt:  IELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLP

Query:  SSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREA
          I++ESDC +++K + + E   +E+  V   I   +K          PRS N  AH + + A
Subjt:  SSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREA

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]2.6e-1327.6Show/hide
Query:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNL-----IELKSKSKANLASIAAPRSE-NLRSHDGWYPP
        R +W+  + W+WL + LS  E+   +++ W +W SRN+ I    +     +   I   +  N+     I    +S+ N   +   R   N+       PP
Subjt:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNL-----IELKSKSKANLASIAAPRSE-NLRSHDGWYPP

Query:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDIS
         N WK+N+DASW+ E  +GG+ WI  D RG +++ G   ++EK  I  LE   +   +  I + +  P  I +ESD  ++I+ ++  + D++
Subjt:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDIS

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]2.3e-2231.53Show/hide
Query:  LDYMKKKESTVHTFWDCKIARNLWLLFIPASLGLFSLCRNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRS
        L++ KK+E+T H  W+CK+ +++W+   P     F + R NW+  EYW WL D     E    +I+   +W  RNK I   + S++  +        I S
Subjt:  LDYMKKKESTVHTFWDCKIARNLWLLFIPASLGLFSLCRNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRS

Query:  QVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIP
          +   ++ KSK    +  I     +N R+   W PP  NSWK+N+DA+W A+ +  G+ WI RD +G +I  G + ++ + +I  LE  A+ E + +I 
Subjt:  QVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIP

Query:  VAAGLPSSIIMESDCADLIKCL
             P  I +ESD  + I  L
Subjt:  VAAGLPSSIIMESDCADLIKCL

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]2.2e-1231.52Show/hide
Query:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-
        R NW+  EYW WL D     E    +I+   +W  RNK I   + S++  +        I S  +   ++ KSK    +  I     +N R+   W PP 
Subjt:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-

Query:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCL
         NSWK+N+DA+W A+ +  G+ WI RD +G +I  G + ++ + +I  LE  A+ E + +I      P  I +ESD  + I  L
Subjt:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCL

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.3e-1227.78Show/hide
Query:  DNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIG
        D  S  +L+ ++I  W +W+ RN  I      + +S S  I+ Q+ + + E   +S+ +L+ +    +  L+    W PPP + W +N+DASW+     G
Subjt:  DNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIG

Query:  GLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRS
        G+ WI R   G +++ G + V+  +++K+LEA A+ E + ++    G+   + +E+D A++   L    ED+++   V + IL+   S ++L F +  R 
Subjt:  GLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRS

Query:  QNVAAHLILREAAGFR
         N  AH + + A+  R
Subjt:  QNVAAHLILREAAGFR

TrEMBL top hitse value%identityAlignment
A0A6J1CQG0 uncharacterized protein LOC1110132161.3e-1327.6Show/hide
Query:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNL-----IELKSKSKANLASIAAPRSE-NLRSHDGWYPP
        R +W+  + W+WL + LS  E+   +++ W +W SRN+ I    +     +   I   +  N+     I    +S+ N   +   R   N+       PP
Subjt:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNL-----IELKSKSKANLASIAAPRSE-NLRSHDGWYPP

Query:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDIS
         N WK+N+DASW+ E  +GG+ WI  D RG +++ G   ++EK  I  LE   +   +  I + +  P  I +ESD  ++I+ ++  + D++
Subjt:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDIS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.1e-2231.53Show/hide
Query:  LDYMKKKESTVHTFWDCKIARNLWLLFIPASLGLFSLCRNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRS
        L++ KK+E+T H  W+CK+ +++W+   P     F + R NW+  EYW WL D     E    +I+   +W  RNK I   + S++  +        I S
Subjt:  LDYMKKKESTVHTFWDCKIARNLWLLFIPASLGLFSLCRNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRS

Query:  QVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIP
          +   ++ KSK    +  I     +N R+   W PP  NSWK+N+DA+W A+ +  G+ WI RD +G +I  G + ++ + +I  LE  A+ E + +I 
Subjt:  QVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIP

Query:  VAAGLPSSIIMESDCADLIKCL
             P  I +ESD  + I  L
Subjt:  VAAGLPSSIIMESDCADLIKCL

A0A6J1DNV9 uncharacterized protein LOC1110224036.2e-1327.78Show/hide
Query:  DNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIG
        D  S  +L+ ++I  W +W+ RN  I      + +S S  I+ Q+ + + E   +S+ +L+ +    +  L+    W PPP + W +N+DASW+     G
Subjt:  DNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPP-NSWKINSDASWNAEDSIG

Query:  GLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRS
        G+ WI R   G +++ G + V+  +++K+LEA A+ E + ++    G+   + +E+D A++   L    ED+++   V + IL+   S ++L F +  R 
Subjt:  GLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRS

Query:  QNVAAHLILREAAGFR
         N  AH + + A+  R
Subjt:  QNVAAHLILREAAGFR

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.1e-1231.52Show/hide
Query:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-
        R NW+  EYW WL D     E    +I+   +W  RNK I   + S++  +        I S  +   ++ KSK    +  I     +N R+   W PP 
Subjt:  RNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQ-----IRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-

Query:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCL
         NSWK+N+DA+W A+ +  G+ WI RD +G +I  G + ++ + +I  LE  A+ E + +I      P  I +ESD  + I  L
Subjt:  PNSWKINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCL

Q9SZD8 Putative reverse transcriptase/RNA-dependent DNA polymerase1.7e-1026.67Show/hide
Query:  KESTVHTFWDCKIARNLWLL-FIPASLGLFSLCRNNWSGMEYWS--WLSDNLSSGE------LEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV
        KE+  H  + C  AR  W +  IP  LG        W+   Y +  W+  NL +G        + V  L+W LW +RN+ +       +  V  +    +
Subjt:  KESTVHTFWDCKIARNLWLL-FIPASLGLFSLCRNNWSGMEYWS--WLSDNLSSGE------LEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV

Query:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA
        E   I  +++S      +      N  S   W PPP+ W K N+DA+WN ++   G+ W+ R+ +G +  +G + + +  S+   E +AM+ A+ S+  +
Subjt:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA

Query:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGF
            + +I ESD   LI+ L + E   S    + DL    ++  +V +F   PR  N  A  + RE+  F
Subjt:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-0824.54Show/hide
Query:  KESTVHTFWDCKIARNLWLLF-IPASLGLFSLCRNNWSGMEYWS--WLSDNL-----SSGELEDVV-ILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV
        +E+  H  + C  AR +W +  IPA           W+   Y +  W+  NL       G++ ++V  L+W LW SRN+ +       +  V  +     
Subjt:  KESTVHTFWDCKIARNLWLLF-IPASLGLFSLCRNNWSGMEYWS--WLSDNL-----SSGELEDVV-ILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV

Query:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA
        E      + + KA     + P+ E   S   W  PP  W K N+DA+W  E+   G+ WI R+  G ++ +G + +    ++   E +A++ A+  + ++
Subjt:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA

Query:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGFRRH
              II ESD   L+  L + ++    L+   + I       + ++F+  PR  N  A  I RE+  F  +
Subjt:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGFRRH

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-1426.67Show/hide
Query:  KESTVHTFWDCKIARNLWLL-FIPASLGLFSLCRNNWSGMEYWS--WLSDNLSSGE------LEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV
        KE+  H  + C  AR  W +  IP  LG        W+   Y +  W+  NL +G        + V  L+W LW +RN+ +       +  V  +    +
Subjt:  KESTVHTFWDCKIARNLWLL-FIPASLGLFSLCRNNWSGMEYWS--WLSDNLSSGE------LEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQV

Query:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA
        E   I  +++S      +      N  S   W PPP+ W K N+DA+WN ++   G+ W+ R+ +G +  +G + + +  S+   E +AM+ A+ S+  +
Subjt:  ERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSW-KINSDASWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVA

Query:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGF
            + +I ESD   LI+ L + E   S    + DL    ++  +V +F   PR  N  A  + RE+  F
Subjt:  AGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREAAGF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.1e-0524Show/hide
Query:  LMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSL
        LMW +W S N  + N   +K  +      +  +  L    +  + N    A P S N +    W PP  +  K N DAS +  +++ GL WI R+S+G++
Subjt:  LMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPP-PNSWKINSDASWNAEDSIGGLSWIARDSRGSL

Query:  IIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGL-PSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREA
        I  G+   + + + +  E   +   I +I  + G     +I E D   + + +     +   L+   D I     S + ++F    R QN  A  + ++A
Subjt:  IIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGL-PSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQNVAAHLILREA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGTAGTTGGAAATTCCGATGAGTTGGGTGAAGGAAATGTGAACTTTGGTGTTGTGAAATGGCAGGCAAATCCATTTGCAAACCAATGGGGCATATGGAAACAAAA
AACAGATAAAAATAAGAAGAGAAGTTTGGAAGTTGGAATGAAAATGGTAACAAAGCAACGTGAAAGGCAAGGTGGTAATGATATAACAAGCTTAACTGAGCATTACGTTG
AAGTTAGTGTTGTACTCGACTATATGAAGAAAAAGGAATCTACTGTCCATACTTTCTGGGATTGCAAGATCGCTAGAAATCTATGGTTATTATTTATCCCTGCCTCATTG
GGTCTATTTTCTCTCTGTAGGAACAATTGGTCGGGGATGGAGTATTGGAGTTGGCTTTCAGATAATCTTAGCAGTGGGGAGCTTGAAGATGTTGTCATCCTTATGTGGAA
TCTATGGAGTAGTAGAAACAAATTCATTCGCAACCCTATCAGCTCTAAATCAGCATCAGTTTCAAACCAGATTCGATCCCAAGTTGAAAGAAATTTGATCGAATTAAAGA
GCAAATCGAAAGCAAACCTGGCATCGATTGCGGCCCCGAGAAGCGAGAACCTCAGGAGTCATGATGGATGGTATCCCCCTCCAAACAGCTGGAAAATCAACTCTGATGCC
TCGTGGAATGCAGAGGATTCGATAGGAGGCCTGAGTTGGATCGCTCGTGACTCCAGGGGATCTCTCATTATCGTGGGTCTGAAAACAGTTAAGGAGAAAAGCTCAATCAA
AGTTTTAGAGGCAAAAGCGATGAAAGAAGCCATAGCCTCCATACCTGTTGCGGCTGGTCTCCCATCCTCGATAATTATGGAGTCGGATTGTGCCGATCTCATCAAATGTC
TACAAGATCCCGAAGAAGACATTTCGGAGCTGAAAGTTGTTGCCGATCTCATCCTTGATTCGGCAAAGTCTCTCGACGTGCTTCAATTCAAGCAATGTCCTAGGTCCCAA
AATGTAGCAGCCCATCTCATCTTGCGGGAAGCTGCTGGTTTTAGGCGCCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAGTAGTTGGAAATTCCGATGAGTTGGGTGAAGGAAATGTGAACTTTGGTGTTGTGAAATGGCAGGCAAATCCATTTGCAAACCAATGGGGCATATGGAAACAAAA
AACAGATAAAAATAAGAAGAGAAGTTTGGAAGTTGGAATGAAAATGGTAACAAAGCAACGTGAAAGGCAAGGTGGTAATGATATAACAAGCTTAACTGAGCATTACGTTG
AAGTTAGTGTTGTACTCGACTATATGAAGAAAAAGGAATCTACTGTCCATACTTTCTGGGATTGCAAGATCGCTAGAAATCTATGGTTATTATTTATCCCTGCCTCATTG
GGTCTATTTTCTCTCTGTAGGAACAATTGGTCGGGGATGGAGTATTGGAGTTGGCTTTCAGATAATCTTAGCAGTGGGGAGCTTGAAGATGTTGTCATCCTTATGTGGAA
TCTATGGAGTAGTAGAAACAAATTCATTCGCAACCCTATCAGCTCTAAATCAGCATCAGTTTCAAACCAGATTCGATCCCAAGTTGAAAGAAATTTGATCGAATTAAAGA
GCAAATCGAAAGCAAACCTGGCATCGATTGCGGCCCCGAGAAGCGAGAACCTCAGGAGTCATGATGGATGGTATCCCCCTCCAAACAGCTGGAAAATCAACTCTGATGCC
TCGTGGAATGCAGAGGATTCGATAGGAGGCCTGAGTTGGATCGCTCGTGACTCCAGGGGATCTCTCATTATCGTGGGTCTGAAAACAGTTAAGGAGAAAAGCTCAATCAA
AGTTTTAGAGGCAAAAGCGATGAAAGAAGCCATAGCCTCCATACCTGTTGCGGCTGGTCTCCCATCCTCGATAATTATGGAGTCGGATTGTGCCGATCTCATCAAATGTC
TACAAGATCCCGAAGAAGACATTTCGGAGCTGAAAGTTGTTGCCGATCTCATCCTTGATTCGGCAAAGTCTCTCGACGTGCTTCAATTCAAGCAATGTCCTAGGTCCCAA
AATGTAGCAGCCCATCTCATCTTGCGGGAAGCTGCTGGTTTTAGGCGCCATTAG
Protein sequenceShow/hide protein sequence
MLVVGNSDELGEGNVNFGVVKWQANPFANQWGIWKQKTDKNKKRSLEVGMKMVTKQRERQGGNDITSLTEHYVEVSVVLDYMKKKESTVHTFWDCKIARNLWLLFIPASL
GLFSLCRNNWSGMEYWSWLSDNLSSGELEDVVILMWNLWSSRNKFIRNPISSKSASVSNQIRSQVERNLIELKSKSKANLASIAAPRSENLRSHDGWYPPPNSWKINSDA
SWNAEDSIGGLSWIARDSRGSLIIVGLKTVKEKSSIKVLEAKAMKEAIASIPVAAGLPSSIIMESDCADLIKCLQDPEEDISELKVVADLILDSAKSLDVLQFKQCPRSQ
NVAAHLILREAAGFRRH