; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016012 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016012
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold6:43593048..43600550
RNA-Seq ExpressionSpg016012
SyntenySpg016012
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4381998.1 hypothetical protein G4B88_006630 [Cannabis sativa]7.1e-2229.23Show/hide
Query:  KSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRR-GTLIWERLDWFLCNSSFDSLFSSVD
        +SE+  L    K  F++PW+  GDFNEIL   EK GG  R    M  F+  L  C L DL   G  FTW   R+ G  + ERLD + CN  +  LF  V 
Subjt:  KSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRR-GTLIWERLDWFLCNSSFDSLFSSVD

Query:  TQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLI-------------------------AQNGSWNDSKVAEFITPSGVWDLDKI
          N D++ SD++PI   +++   +++  ++  F+FE  W    EC ++I                          Q G WN SK      P  V +  K 
Subjt:  TQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLI-------------------------AQNGSWNDSKVAEFITPSGVWDLDKI

Query:  KRVVVYFDIDSVR----TSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYL
           ++      VR     S+++  LV +  W +W DRN ++  +    +N   ++ R+ L
Subjt:  KRVVVYFDIDSVR----TSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYL

MBA0707733.1 hypothetical protein [Gossypium laxum]1.9e-2228.38Show/hide
Query:  FLALRYLIDKSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSS
        F    Y  +KS   NL        N PWL +GDFNEI+Y+ EK GG  R+   M+ FR+ L +C L DL   G  FTW  GN   T I ERLD  + N  
Subjt:  FLALRYLIDKSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSS

Query:  FDSLFSSVDTQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNG-SWN--------DSKVAEFI-------TPSGVWDLDKI
        +  LF     + L W       I +  DS  P  ++ +           N  + ADLI QN  +WN           VAE I        P   + ++  
Subjt:  FDSLFSSVDTQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNG-SWN--------DSKVAEFI-------TPSGVWDLDKI

Query:  KRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFACWSPPPASFWKVNTD
        + +   F     + S           WA+W DRN  VH +    +++  + + ++++SY+    +  +  +  + + R        W   P  F K+N D
Subjt:  KRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFACWSPPPASFWKVNTD

Query:  AACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKS
        A  D        G++ RD +G  L S S       +  +AE  +  +  K   +M   K+IIE D  L+I   SK S
Subjt:  AACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKS

RYQ96673.1 hypothetical protein Ahy_B08g092519 [Arachis hypogaea]1.4e-2230.03Show/hide
Query:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTL-IWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        N+PWL+ GDFN+IL  +EK+GG       M  F++ L   +L DL   G  FTW  N+ G + I ERLD  +    +   F     ++L    SD+ P+ 
Subjt:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTL-IWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGV-WDLDKIKRVVVYFDIDSV------RTSQEDLGLVVVTCWALWIDR
        I V  Q  KR+R + N F+FEE W    EC ++I +  SW    V + I    + W  D I++   +F+ + +      +  QED       C+     +
Subjt:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGV-WDLDKIKRVVVYFDIDSV------RTSQEDLGLVVVTCWALWIDR

Query:  NKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPP--KNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDIL
        N      DI  +  R+          L+  E     +   N    PP       CW PP  + +K+N DAA  SN  + G G + R+ +G+I+
Subjt:  NKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPP--KNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDIL

TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]3.3e-2727.08Show/hide
Query:  IPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGT--LIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        +PW + GDFNEI+   EK GG TR    M NF++ L DC LRDL   G +FTW  NRR +   I ERLD  + N+ +  LFS    ++LD+  SD++PI 
Subjt:  IPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGT--LIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGVWDLDKIKRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNE
        + +  +  K +   R++F ++  W   ++   L      ++     +F+         K K  ++YF+             + V  W +W  RN++V+ +
Subjt:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGVWDLDKIKRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNE

Query:  DIPLINQRS--QWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFA-CWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNP
            ++      W   ++  +  A    +  +          K R A  W P P+  +K+NTDA  D  +   G G++ RD  G ++ S        + P
Subjt:  DIPLINQRS--QWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFA-CWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNP

Query:  LLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKK-------SPVWSDLEALGKSIWSLASSFRGRLAH
           E  ++ +G + A E G     IESD    +N ++           V  D+ A+G + +  + SF  RL +
Subjt:  LLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKK-------SPVWSDLEALGKSIWSLASSFRGRLAH

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]2.6e-2444.76Show/hide
Query:  NLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDW
        NL  N     N PWLL GDFNEIL + EK GGP R  R MD FR+ L    L+DL   G QFTW GNR G  I  RLD F+ N S+  +F +    +L+ 
Subjt:  NLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDW

Query:  IYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLI
          SD+ PI ++V  +R +R +K++ +F+FE+FW   N+C D++
Subjt:  IYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLI

TrEMBL top hitse value%identityAlignment
A0A444Y447 Uncharacterized protein7.0e-2330.03Show/hide
Query:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTL-IWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        N+PWL+ GDFN+IL  +EK+GG       M  F++ L   +L DL   G  FTW  N+ G + I ERLD  +    +   F     ++L    SD+ P+ 
Subjt:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTL-IWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGV-WDLDKIKRVVVYFDIDSV------RTSQEDLGLVVVTCWALWIDR
        I V  Q  KR+R + N F+FEE W    EC ++I +  SW    V + I    + W  D I++   +F+ + +      +  QED       C+     +
Subjt:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGV-WDLDKIKRVVVYFDIDSV------RTSQEDLGLVVVTCWALWIDR

Query:  NKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPP--KNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDIL
        N      DI  +  R+          L+  E     +   N    PP       CW PP  + +K+N DAA  SN  + G G + R+ +G+I+
Subjt:  NKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPP--KNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDIL

A0A5C7IIT4 Uncharacterized protein1.6e-2727.08Show/hide
Query:  IPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGT--LIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        +PW + GDFNEI+   EK GG TR    M NF++ L DC LRDL   G +FTW  NRR +   I ERLD  + N+ +  LFS    ++LD+  SD++PI 
Subjt:  IPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGT--LIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGVWDLDKIKRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNE
        + +  +  K +   R++F ++  W   ++   L      ++     +F+         K K  ++YF+             + V  W +W  RN++V+ +
Subjt:  IRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGVWDLDKIKRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNE

Query:  DIPLINQRS--QWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFA-CWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNP
            ++      W   ++  +  A    +  +          K R A  W P P+  +K+NTDA  D  +   G G++ RD  G ++ S        + P
Subjt:  DIPLINQRS--QWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFA-CWSPPPASFWKVNTDAACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNP

Query:  LLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKK-------SPVWSDLEALGKSIWSLASSFRGRLAH
           E  ++ +G + A E G     IESD    +N ++           V  D+ A+G + +  + SF  RL +
Subjt:  LLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKK-------SPVWSDLEALGKSIWSLASSFRGRLAH

A0A7J8Z7F1 RNase H domain-containing protein9.1e-2328.38Show/hide
Query:  FLALRYLIDKSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSS
        F    Y  +KS   NL        N PWL +GDFNEI+Y+ EK GG  R+   M+ FR+ L +C L DL   G  FTW  GN   T I ERLD  + N  
Subjt:  FLALRYLIDKSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSS

Query:  FDSLFSSVDTQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNG-SWN--------DSKVAEFI-------TPSGVWDLDKI
        +  LF     + L W       I +  DS  P  ++ +           N  + ADLI QN  +WN           VAE I        P   + ++  
Subjt:  FDSLFSSVDTQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNG-SWN--------DSKVAEFI-------TPSGVWDLDKI

Query:  KRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFACWSPPPASFWKVNTD
        + +   F     + S           WA+W DRN  VH +    +++  + + ++++SY+    +  +  +  + + R        W   P  F K+N D
Subjt:  KRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFACWSPPPASFWKVNTD

Query:  AACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKS
        A  D        G++ RD +G  L S S       +  +AE  +  +  K   +M   K+IIE D  L+I   SK S
Subjt:  AACDSNSPSMGRGMIGRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKS

A0A803NQ77 Uncharacterized protein2.2e-2441.67Show/hide
Query:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        N+PWL+ GDFNE+L N++K GGP R+  LM+NFR  +  C+LR +P  GD FTW + N  G +I ERLD    NS +  +F+  + Q+LD+ +SD++ I 
Subjt:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTW-HGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVD-SQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDS
          V    +P   +K+ ++F+FE+FW    +C+ +IA +  WN S
Subjt:  IRVD-SQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDS

A0A803Q5N3 Uncharacterized protein1.4e-2342.65Show/hide
Query:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGN-RRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK
        N PWL+ GDFNEIL N +K GGP ++ RLM++FR CL  C L D P  GD FTW  N    T + ERLDW   N+ +D++F      +LD+ +SD++ I 
Subjt:  NIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGN-RRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIK

Query:  IRVDSQR-PKRQRKQRNQFKFEEFWTNYNECADLIA
          + +Q  P +Q K+R++F FE+ W   +E  ++I+
Subjt:  IRVDSQR-PKRQRKQRNQFKFEEFWTNYNECADLIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.5e-0626.45Show/hide
Query:  NIPWLLAGDFNEIL-YNEEKSGGPTR-DRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPI
        N PWL+ GDFN+I    E  S  P+    + +++ + C+ D +L DLP  G  +TW  +++   I  +LD  + N  + + F +          SD+   
Subjt:  NIPWLLAGDFNEIL-YNEEKSGGPTR-DRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPI

Query:  KIRVDSQRPKRQRKQRNQFKF
         + +++  P  ++K    F F
Subjt:  KIRVDSQRPKRQRKQRNQFKF

AT1G43760.1 DNAse I-like superfamily protein7.6e-0629.75Show/hide
Query:  LLAGDFNEIL-----YNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSS-FDSLFSSVDTQNLDWIYSDYKPI
        +L GDF++I      Y+  ++  P R    ++ F+ CL D +L D+P  G  +TW  ++    I  +LD  + N   F S  S++    L  + SD+ P 
Subjt:  LLAGDFNEIL-----YNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPPPGDQFTWHGNRRGTLIWERLDWFLCNSS-FDSLFSSVDTQNLDWIYSDYKPI

Query:  KIRVDSQRPKRQRKQRNQFKF
         I +++  PKR +K    F F
Subjt:  KIRVDSQRPKRQRKQRNQFKF

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.3e-0926.92Show/hide
Query:  LVVVTCWALWIDRNKIV---HNEDIPLINQRSQWIRKYLDSYLKAHEKCSSR--LTGHNGVNRPPKNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMI
        LV    W LW  RN+++      D P + +R+          ++  E+ S+R  L G     +  +N    W  PP  + K NTDA     +P  G G I
Subjt:  LVVVTCWALWIDRNKIV---HNEDIPLINQRSQWIRKYLDSYLKAHEKCSSR--LTGHNGVNRPPKNRFACWSPPPASFWKVNTDAACDSNSPSMGRGMI

Query:  GRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKSPVWSDLEALGKSIWSLASSF
         R++ G +L+  +  +    N L AEL ++   V         ++I ESD Q  +N L+     W  L+   + I  L   F
Subjt:  GRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKSPVWSDLEALGKSIWSLASSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAGGCCGACCAAGGGCATGGGCCCAAACCCGTCATGGATCGACCCATCTTTGTCTCCCAGAGCTCCAAAGAAAACCCTAGCTGAAGAGAAGATAAATAGAGACAA
AGGCCACCTGAGAAAGGGATCGAAAAAGAGAATACTAAATACAGCAGATTTAACCGAGGAAGACTCAAAATGGGAAGATATACAAGCAGTAATCCACGGTGAAGAGTCAG
CCACCAGGCGAGAAGGTTCAGCTGAGGAGGATGAGGATCATATAGAATTGACATTACCCAAAAGGAAGAAAATGAGCAAGAAAAGGAAGGCTAGAATTGGTCTGGTTGAA
TGCTCTTTGGAAGATAACATGAGTCAACAGAAAAGGAAAAAGGGAGTTGCATGCTTAGAATCATTCAAGCTTGACTTGATCAATCATGCTTGTGGAGTGATTCGAATCTC
AAGCAAGTTTCTTGCCTTGAGATATTTGATCGACAAGTCAGAAAACCTTAACCTGGGAACTAATTCGAAAGCTCCATTCAATATTCCATGGCTACTGGCAGGAGACTTCA
ATGAAATACTCTATAATGAAGAGAAATCAGGTGGCCCAACTCGAGACAGGAGACTCATGGATAACTTCAGACAGTGCTTGGTTGACTGTGAGTTGAGAGATTTGCCCCCT
CCTGGAGATCAATTCACTTGGCATGGGAACAGGCGAGGAACCCTCATTTGGGAAAGACTTGATTGGTTCTTATGCAATTCCAGTTTCGATTCCTTGTTCAGTAGTGTGGA
TACACAAAACCTTGACTGGATATATTCGGATTACAAACCTATCAAAATCAGGGTGGATTCACAACGTCCCAAGCGACAGAGGAAACAGAGGAACCAATTTAAATTTGAGG
AATTTTGGACCAACTACAATGAATGTGCGGATCTAATAGCTCAAAATGGGAGTTGGAATGATAGTAAGGTGGCTGAATTCATAACTCCTTCAGGGGTCTGGGATTTAGAT
AAAATTAAGAGGGTTGTTGTCTATTTTGATATTGATTCTGTCAGGACATCTCAAGAGGATCTCGGGCTTGTCGTTGTCACTTGTTGGGCGCTTTGGATCGACAGGAACAA
AATTGTCCACAACGAGGATATTCCCTTGATTAATCAAAGAAGCCAGTGGATTAGGAAGTATCTAGATTCCTATTTGAAAGCACATGAGAAATGTTCTTCAAGACTTACCG
GACACAATGGAGTGAATCGTCCTCCGAAAAATCGTTTTGCATGTTGGTCTCCTCCCCCAGCAAGCTTCTGGAAAGTAAACACCGATGCAGCTTGTGATTCTAATTCTCCT
TCAATGGGACGGGGCATGATCGGTAGAGATGATAAGGGTGATATTTTGTTCTCTTCTTCTCTTTGGGTCGACTTCCAAATGAACCCCCTTTTGGCTGAACTTTCAAGCAT
TTACCAAGGAGTGAAGAAAGCAAAGGAAATGGGTTGCTCTAAGCTCATTATTGAATCCGACTGCCAGCTCGCTATTAATTTTTTGTCAAAGAAATCTCCTGTCTGGAGTG
ACCTGGAGGCTTTGGGTAAATCCATTTGGAGTCTTGCATCTTCCTTCAGGGGGCGTTTGGCCCATAGGCGGATTCTGGCGGGCATTGGAGTGGGGAGAGGGATTGTGAAG
CATGTAGTGGACGTGATTCCATCTTGCCGCTCCATAAACTCGACCCTCTCTCTTGCTGCCTCCAAATTTCCATCAATTTTCTTGTATCGTGGCTTCCTCCCATTCAAAAA
AAGTGTTCTCTGTTCTGATTGTGAGGTAGCGTCTGAGGCCTGGTATCAGGGAGGTGGTCGGATGAGTGGAGCTTGGTGGGGGGCTTTCAGTGTCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAGGCCGACCAAGGGCATGGGCCCAAACCCGTCATGGATCGACCCATCTTTGTCTCCCAGAGCTCCAAAGAAAACCCTAGCTGAAGAGAAGATAAATAGAGACAA
AGGCCACCTGAGAAAGGGATCGAAAAAGAGAATACTAAATACAGCAGATTTAACCGAGGAAGACTCAAAATGGGAAGATATACAAGCAGTAATCCACGGTGAAGAGTCAG
CCACCAGGCGAGAAGGTTCAGCTGAGGAGGATGAGGATCATATAGAATTGACATTACCCAAAAGGAAGAAAATGAGCAAGAAAAGGAAGGCTAGAATTGGTCTGGTTGAA
TGCTCTTTGGAAGATAACATGAGTCAACAGAAAAGGAAAAAGGGAGTTGCATGCTTAGAATCATTCAAGCTTGACTTGATCAATCATGCTTGTGGAGTGATTCGAATCTC
AAGCAAGTTTCTTGCCTTGAGATATTTGATCGACAAGTCAGAAAACCTTAACCTGGGAACTAATTCGAAAGCTCCATTCAATATTCCATGGCTACTGGCAGGAGACTTCA
ATGAAATACTCTATAATGAAGAGAAATCAGGTGGCCCAACTCGAGACAGGAGACTCATGGATAACTTCAGACAGTGCTTGGTTGACTGTGAGTTGAGAGATTTGCCCCCT
CCTGGAGATCAATTCACTTGGCATGGGAACAGGCGAGGAACCCTCATTTGGGAAAGACTTGATTGGTTCTTATGCAATTCCAGTTTCGATTCCTTGTTCAGTAGTGTGGA
TACACAAAACCTTGACTGGATATATTCGGATTACAAACCTATCAAAATCAGGGTGGATTCACAACGTCCCAAGCGACAGAGGAAACAGAGGAACCAATTTAAATTTGAGG
AATTTTGGACCAACTACAATGAATGTGCGGATCTAATAGCTCAAAATGGGAGTTGGAATGATAGTAAGGTGGCTGAATTCATAACTCCTTCAGGGGTCTGGGATTTAGAT
AAAATTAAGAGGGTTGTTGTCTATTTTGATATTGATTCTGTCAGGACATCTCAAGAGGATCTCGGGCTTGTCGTTGTCACTTGTTGGGCGCTTTGGATCGACAGGAACAA
AATTGTCCACAACGAGGATATTCCCTTGATTAATCAAAGAAGCCAGTGGATTAGGAAGTATCTAGATTCCTATTTGAAAGCACATGAGAAATGTTCTTCAAGACTTACCG
GACACAATGGAGTGAATCGTCCTCCGAAAAATCGTTTTGCATGTTGGTCTCCTCCCCCAGCAAGCTTCTGGAAAGTAAACACCGATGCAGCTTGTGATTCTAATTCTCCT
TCAATGGGACGGGGCATGATCGGTAGAGATGATAAGGGTGATATTTTGTTCTCTTCTTCTCTTTGGGTCGACTTCCAAATGAACCCCCTTTTGGCTGAACTTTCAAGCAT
TTACCAAGGAGTGAAGAAAGCAAAGGAAATGGGTTGCTCTAAGCTCATTATTGAATCCGACTGCCAGCTCGCTATTAATTTTTTGTCAAAGAAATCTCCTGTCTGGAGTG
ACCTGGAGGCTTTGGGTAAATCCATTTGGAGTCTTGCATCTTCCTTCAGGGGGCGTTTGGCCCATAGGCGGATTCTGGCGGGCATTGGAGTGGGGAGAGGGATTGTGAAG
CATGTAGTGGACGTGATTCCATCTTGCCGCTCCATAAACTCGACCCTCTCTCTTGCTGCCTCCAAATTTCCATCAATTTTCTTGTATCGTGGCTTCCTCCCATTCAAAAA
AAGTGTTCTCTGTTCTGATTGTGAGGTAGCGTCTGAGGCCTGGTATCAGGGAGGTGGTCGGATGAGTGGAGCTTGGTGGGGGGCTTTCAGTGTCCACTGA
Protein sequenceShow/hide protein sequence
MLRPTKGMGPNPSWIDPSLSPRAPKKTLAEEKINRDKGHLRKGSKKRILNTADLTEEDSKWEDIQAVIHGEESATRREGSAEEDEDHIELTLPKRKKMSKKRKARIGLVE
CSLEDNMSQQKRKKGVACLESFKLDLINHACGVIRISSKFLALRYLIDKSENLNLGTNSKAPFNIPWLLAGDFNEILYNEEKSGGPTRDRRLMDNFRQCLVDCELRDLPP
PGDQFTWHGNRRGTLIWERLDWFLCNSSFDSLFSSVDTQNLDWIYSDYKPIKIRVDSQRPKRQRKQRNQFKFEEFWTNYNECADLIAQNGSWNDSKVAEFITPSGVWDLD
KIKRVVVYFDIDSVRTSQEDLGLVVVTCWALWIDRNKIVHNEDIPLINQRSQWIRKYLDSYLKAHEKCSSRLTGHNGVNRPPKNRFACWSPPPASFWKVNTDAACDSNSP
SMGRGMIGRDDKGDILFSSSLWVDFQMNPLLAELSSIYQGVKKAKEMGCSKLIIESDCQLAINFLSKKSPVWSDLEALGKSIWSLASSFRGRLAHRRILAGIGVGRGIVK
HVVDVIPSCRSINSTLSLAASKFPSIFLYRGFLPFKKSVLCSDCEVASEAWYQGGGRMSGAWWGAFSVH