; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022778 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022778
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold2:10786855..10800234
RNA-Seq ExpressionSpg022778
SyntenySpg022778
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]5.7e-4427Show/hide
Query:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP
        F+   VV  +G  GGLC+LWK E  +++ SYS  HID  +  +N K WR +G YG P   Q+   W L+R+L      PW +GGD NEI+   EK+GG  
Subjt:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP

Query:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG--HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINRKRNQFRFEEL
        R    M +F++ ++DC L+DL   G  FTW  NRR   H I+ERLDR + N  + +LFS     +LD++ SDH+PI   +E+  +K     R++F ++  
Subjt:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG--HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINRKRNQFRFEEL

Query:  WTRYEECSDLIKENGDWEGIDPIYSDSKVADFITPSGGWDIDLIHKAVINFDCDSIKAVPINNNLEDKLIWHYDRTAKSIWNLMHNCVFLEENFAGSFID
        W           E  D+  +D  +S     DF+    G  +D+++     F C                                               
Subjt:  WTRYEECSDLIKENGDWEGIDPIYSDSKVADFITPSGGWDIDLIHKAVINFDCDSIKAVPINNNLEDKLIWHYDRTAKSIWNLMHNCVFLEENFAGSFID

Query:  RWIKIDATSSSENLGKAAVICWALWSDRNKISHGENISPIS--IRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSD
                          V+ W +W  RN++ + ++   +       W   ++  F+ A       +     S    R+A K            W P   
Subjt:  RWIKIDATSSSENLGKAAVICWALWSDRNKISHGENISPIS--IRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSD

Query:  GFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWE
        G +K+NTDA        TG+G++ RD  G V  +        L     E   +L G R+AL  G     +ESDS   +N I    I+ +++  ++  I  
Subjt:  GFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWE

Query:  LAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETW
        +       S  F+PR+ N  A SLAK +   +    W
Subjt:  LAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETW

XP_004310201.2 PREDICTED: uncharacterized protein LOC101298860 [Fragaria vesca subsp. vesca]2.8e-4347.47Show/hide
Query:  GGLCILWKEEEMLSIKSYSCNHIDCDIQWKN-FKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCI
        GGLC+LW+E   LS+ SYS NHI   I+      WRF+G+YGF +  ++  TW+LIR L   G   WLLGGD NEILR  EK GG PRD + ME F+KC+
Subjt:  GGLCILWKEEEMLSIKSYSCNHIDCDIQWKN-FKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCI

Query:  DDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINR-KRNQFRFEELWTRYEECSDLIK
        DDC L DL  SG +FTW G R G  +K RLDR++ N  ++ +FSA   T++    SDH P+   +E+   + I R KR +F  EE W R  EC D+++
Subjt:  DDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINR-KRNQFRFEELWTRYEECSDLIK

XP_015382608.1 uncharacterized protein LOC107175577 [Citrus sinensis]3.7e-4339.83Show/hide
Query:  KFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQ
        KF+ CF V   G  GGL +LW EE  + IKS+S +HID ++Q  N +  R + +YG PE  QK  TW L+R+L     SPWL  GD NEIL   EK GG 
Subjt:  KFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQ

Query:  PRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG-HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRK--SINRKRNQFRFE
         R+   +  FR+ + DCEL D+R  G  FTW   R G  FI+ERLDR++CN  + + F      NL+ + SDH P+   V+ +G +  +  R+ ++  +E
Subjt:  PRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG-HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRK--SINRKRNQFRFE

Query:  ELWTRYEECSDLIKE----NGDWEGIDPIYSDSKVA
        ++W+ YE C +++ E    N +W+  +P+Y   K+A
Subjt:  ELWTRYEECSDLIKE----NGDWEGIDPIYSDSKVA

XP_024036939.1 uncharacterized protein LOC112096938 [Citrus clementina]1.8e-4240.87Show/hide
Query:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKN-FKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP
        FE C  V   G  GG+ +LWK +  + I SYS +HID + Q  N  + R +G+YG P+T QK  TW L+R+L +  +SPWL  GD NEIL  +EK GG  
Subjt:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKN-FKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP

Query:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGH-FIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELK--GRKSINRKRNQFRFEE
        R+  L+ DFR+ + DC LKD+   G  FTW   R G  F++ERLDR++CN  + + F   E +NLD + SDH P+   V+ +  G     R  +Q  +E+
Subjt:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGH-FIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELK--GRKSINRKRNQFRFEE

Query:  LWTRYEECSDLIKEN----GDWEGIDPIYS
         W+ Y+ C ++IKE      +W   DP  S
Subjt:  LWTRYEECSDLIKEN----GDWEGIDPIYS

XP_024039545.1 uncharacterized protein LOC112098147 [Citrus clementina]1.1e-4238.17Show/hide
Query:  LKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFKW-RFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEE
        + R   +E CF V S G  GGL +LWK E  + IKS++ +HID ++  +N K  R +G+YG P+ RQ+  TW L+R+L    ++PW   GD NEIL   E
Subjt:  LKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFKW-RFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEE

Query:  KLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG-HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKG-RKSINRKR-N
        K GG  R   L+ DFR+ + DC+L D+   G  FTW   R G  F++ERLDR++CN  + ++F  +  TN+D + SDH P+   V+++G   + N++R  
Subjt:  KLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG-HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKG-RKSINRKR-N

Query:  QFRFEELWTRYEECSDLIKE----NGDWEGIDPIYSDSKVA
        +  +E++W+ Y+ C ++++E     G W   +P+    KVA
Subjt:  QFRFEELWTRYEECSDLIKE----NGDWEGIDPIYSDSKVA

TrEMBL top hitse value%identityAlignment
A0A2P6SDG4 Putative RNA-directed DNA polymerase1.2e-4237.11Show/hide
Query:  KVRCSQKDKLPMDEKGIRKRKGERLKRVCKFEGCFVVRSQG----ARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK--WRFSGLYGFPETRQKAQT
        K RC+  +      KG+R+R G   K        FV + +G      GGLC+LW ++  +S++SYS NHID  I  +     WRF+G+YGFP+  ++++T
Subjt:  KVRCSQKDKLPMDEKGIRKRKGERLKRVCKFEGCFVVRSQG----ARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK--WRFSGLYGFPETRQKAQT

Query:  WELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLD
        W L+++L + GN PW++GGD NEI    +K+GG  R  +LM D ++ +  CEL D++  G  FTW G R G  ++ RLDR+ C+  + +LF A    +LD
Subjt:  WELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLD

Query:  WFFSDHKPIEARVELKGRKSINRKRNQFRFEELWTRYEECSDLIKENGDWEGIDPI
           SDH PI   V ++ ++   RK+ +F+FEE W   E C +++K +  WE +  +
Subjt:  WFFSDHKPIEARVELKGRKSINRKRNQFRFEELWTRYEECSDLIKENGDWEGIDPI

A0A5C7IIT4 Uncharacterized protein2.7e-4427Show/hide
Query:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP
        F+   VV  +G  GGLC+LWK E  +++ SYS  HID  +  +N K WR +G YG P   Q+   W L+R+L      PW +GGD NEI+   EK+GG  
Subjt:  FEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQP

Query:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG--HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINRKRNQFRFEEL
        R    M +F++ ++DC L+DL   G  FTW  NRR   H I+ERLDR + N  + +LFS     +LD++ SDH+PI   +E+  +K     R++F ++  
Subjt:  RDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRG--HFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINRKRNQFRFEEL

Query:  WTRYEECSDLIKENGDWEGIDPIYSDSKVADFITPSGGWDIDLIHKAVINFDCDSIKAVPINNNLEDKLIWHYDRTAKSIWNLMHNCVFLEENFAGSFID
        W           E  D+  +D  +S     DF+    G  +D+++     F C                                               
Subjt:  WTRYEECSDLIKENGDWEGIDPIYSDSKVADFITPSGGWDIDLIHKAVINFDCDSIKAVPINNNLEDKLIWHYDRTAKSIWNLMHNCVFLEENFAGSFID

Query:  RWIKIDATSSSENLGKAAVICWALWSDRNKISHGENISPIS--IRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSD
                          V+ W +W  RN++ + ++   +       W   ++  F+ A       +     S    R+A K            W P   
Subjt:  RWIKIDATSSSENLGKAAVICWALWSDRNKISHGENISPIS--IRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSD

Query:  GFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWE
        G +K+NTDA        TG+G++ RD  G V  +        L     E   +L G R+AL  G     +ESDS   +N I    I+ +++  ++  I  
Subjt:  GFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWE

Query:  LAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETW
        +       S  F+PR+ N  A SLAK +   +    W
Subjt:  LAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETW

A0A803NSJ4 Uncharacterized protein1.5e-4235.56Show/hide
Query:  KKEGDGREIDKEDLPIEEASEVSFPILRRKSSW-----KRKVRCSQKDKLPMDEKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKS
        KK+G  R+I  E +PI  + E    I    + W     K  V+    D + + E  ++K K E L+    FEGCFVV + G  GGL +LW      +I S
Subjt:  KKEGDGREIDKEDLPIEEASEVSFPILRRKSSW-----KRKVRCSQKDKLPMDEKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKS

Query:  YSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTW
        +S  HID  I+ +  + WRF+G YG P+  Q+ ++W+L+ ++    + PW++GGD NEILR +EK+GGQP+   L+ +FRK +D   LK++   G  +TW
Subjt:  YSCNHIDCDIQWKNFK-WRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTW

Query:  YGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSI--NRKRNQFRFEELWTRYEECSDLIKEN
           R+ +FI ERLDR   N  + ++F A +  +L+   SDH P+  +   +  +++  +R  ++F FE  W   E+CS ++ EN
Subjt:  YGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSI--NRKRNQFRFEELWTRYEECSDLIKEN

A0A803PF28 Uncharacterized protein3.1e-4035.63Show/hide
Query:  EKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWK-NFKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLG
        E  +   + E ++    FEGCF+V S+G  GGL +LW +E  + IKSY+ +HID  ++    F WRF+G YG P+   + ++W+L+ +LK   +  W+ G
Subjt:  EKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWK-NFKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLG

Query:  GDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKP--IEARVELK
        GD NEI+  +EK GG P+   LM++FR  I  C L +L + G  +TW+  R  + I E+LDR LCN  ++  F   + + L+W+ SDH+P  ++ ++  K
Subjt:  GDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKP--IEARVELK

Query:  GRKSINRKRNQFRFEELWTRYEECSDLIKEN-GDWEGIDPIYSDSKV
        G     R  ++F FE+ W   EEC  +I++  G  + + P+ + S++
Subjt:  GRKSINRKRNQFRFEELWTRYEECSDLIKEN-GDWEGIDPIYSDSKV

A0A803PF28 Uncharacterized protein5.8e-1030.87Show/hide
Query:  WTPPSDGFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEAL
        W PP   F  +NTDA+         L  + RD  G + VA + F       +  E   IL G+++A+     K ++ SDS   I  I   S  +SD   L
Subjt:  WTPPSDGFWKLNTDAACSDSPPYTGLGMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEAL

Query:  VTSIWELAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETWDSVLPS
        +  +  +   F  + F F+PR  N  A+SLAK+++  + S  W   LPS
Subjt:  VTSIWELAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSETWDSVLPS

A0A803PF28 Uncharacterized protein5.4e-4037.5Show/hide
Query:  MDEKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDI-QWKNFKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWL
        + E  + K + E L+    F GCFVV ++G  GGL +LW E+   SI S+S  HID  I + +N  WRF+G YG P+  ++  +W L+ ++      PW+
Subjt:  MDEKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDI-QWKNFKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWL

Query:  LGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELK
        +GGD NEILR +EK GG P+ R LM +FRK +DDC L+++   G +FTW   R+ + I ERLDR   N  + +LF A +  +L+   SDH P+    + +
Subjt:  LGGDMNEILRDEEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELK

Query:  GRKSIN--RKRNQFRFEELWTRYEECSDLIKE
         + +I   +  ++F FE  W   + C +L+ +
Subjt:  GRKSIN--RKRNQFRFEELWTRYEECSDLIKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein3.1e-0834.04Show/hide
Query:  QKAQTWELIRKLKLSG---NSPWLLGGDMNEILRDEEKLGGQPRDRKL--MEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCN
        ++   W+ I +L  S    NSPWL+ GD N+I    E     P +  L  +ED + C+ D +L DL   G L+TW  +++ + I  +LDR + N
Subjt:  QKAQTWELIRKLKLSG---NSPWLLGGDMNEILRDEEKLGGQPRDRKL--MEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCN

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0534.57Show/hide
Query:  IMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWELAQLFSEISFKFIPRMQNGEADSLAK
        +M E   +   ++ A S G  K+ + SDSQQ I  I  +S   ++   ++  I  L+  F+++SF F+PR +N  AD LAK
Subjt:  IMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWELAQLFSEISFKFIPRMQNGEADSLAK

AT4G29090.1 Ribonuclease H-like superfamily protein7.2e-1327.95Show/hide
Query:  ICWALWSDRNK-ISHGENISPISIRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSDGFWKLNTDAACSDSPPYTGL
        + W LW +RN+ +  G   +   + R+  ED L+ +          R+ T A S  T+         +R+    W PP   + K NTDA  +      G+
Subjt:  ICWALWSDRNK-ISHGENISPISIRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSDGFWKLNTDAACSDSPPYTGL

Query:  GMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMR---MALSR-GCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWELAQLFSEISFKFIPRM
        G + R++ GEVK   +      L  +   L   LE MR   ++LSR     +  ESDSQ  I  +     IW  ++  +  +  L   F+E+ F FIPR 
Subjt:  GMICRDKIGEVKVAASIFLDYRLDSIMGELRGILEGMR---MALSR-GCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWELAQLFSEISFKFIPRM

Query:  QNGEADSLAKFA-KFTKCSETWDSVLPSW
         N  A+ +A+ +  F        S++PSW
Subjt:  QNGEADSLAKFA-KFTKCSETWDSVLPSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTGCTGATATCGTTGAAGGTCCTGTTGACAAAGTTGATCTGGAATTTCAAAGTTCCTCGTAGGGTGAAAATCTTTCTCTGGCCGTTGGTTCACAGGAGTTTAAA
CATGCAGGAGAGATTGCAAAGGAAGCGGTGTCCACAGCTCTTTGCCCCTCAATTTGCTCGCTTTTGCTGGGAGAGGATCTTTAGCCTTTTTGACCTGGAAGTCTGTCTTC
CTAGACAAGTGGATGGGTGGTTCTCTGAAGCTCAGTGGGTGGAAGTTGAAAAGAAAAGCCTAGATTTTATGGCGGTTTGTGTCGAGAGCCCTTTTATGGGGTTTATGGCT
GGAATGGAACAAGAGGGAGTTGAAGAAGGCTCTGCAGTCTATTTGCTAATAAGAGAAAAAGGGTGCTTAAGCAAATATGGTTGTGGCACTTGTTGGGAGTACATCAGTTA
TGGCAGATGCAATGAAGGTGGCTGCAGAGGAGACCTCCCACCTTCTCGAAGTGGTGGAGGAATGTACATTCCTGATAACAAAGACAAGGAGATGTGTCATTTGGTTCTGT
TAAAGAACTGGAACATTTTCCGAGGTTTCTCTTCACCATCAAAACCTCCAGAAGGATCAAGAAAACAAGAAGATAAAGCTGAACATGAAGACTTAGTAAAGAACATAAAT
AAAGAAACAGGGGATAGAAACAAAAAAAGAATTGATGTTGATCTAAACCTGGAAAGCCCAATAGAAGAGGATGCAGAGCTCGGTTTGAATGGAAGAAAAGAAATTGATAT
TTTAGAATTAATGGAAGAAAGTGATGAAACTATCAGAGAAAGGTGCACATGGAATGTTAACCAAAATGAAAATTCTGAATCGAAAAAAGAAGGTGATGGAAGAGAAATAG
ACAAGGAAGATTTACCAATCGAGGAAGCAAGTGAGGTTTCATTTCCTATCCTAAGAAGGAAGTCAAGTTGGAAGAGGAAGGTAAGGTGTTCTCAGAAAGATAAGCTTCCA
ATGGATGAGAAAGGAATCAGGAAAAGGAAGGGAGAAAGATTGAAGCGGGTATGCAAATTTGAAGGCTGTTTTGTGGTTCGAAGTCAAGGGGCTAGAGGTGGCTTATGTAT
TCTTTGGAAAGAAGAGGAAATGCTCTCGATTAAATCTTACTCATGTAATCATATAGATTGCGACATTCAGTGGAAGAATTTCAAATGGAGGTTTTCTGGTCTATATGGGT
TTCCAGAAACAAGACAGAAAGCTCAGACCTGGGAGCTTATTCGCAAATTGAAGCTTTCTGGTAACTCACCTTGGTTACTTGGTGGAGATATGAATGAAATACTAAGAGAT
GAAGAGAAACTGGGTGGCCAACCAAGAGATAGAAAATTAATGGAAGACTTTCGAAAATGTATTGATGACTGTGAGTTAAAAGACTTGAGATCTAGTGGAGAATTGTTCAC
TTGGTATGGTAATAGGAGAGGTCATTTTATAAAGGAAAGACTGGATCGTTATCTCTGCAATCACTTGTTTGAAAATTTATTTTCTGCCATAGAAACAACAAACCTCGATT
GGTTTTTCTCTGACCATAAACCAATTGAGGCTCGGGTAGAACTAAAAGGAAGAAAATCTATAAACAGGAAGAGAAATCAATTCAGGTTTGAGGAGCTATGGACTAGATAT
GAGGAATGTTCTGATCTAATCAAAGAAAATGGCGACTGGGAAGGTATTGATCCAATTTATTCAGACAGCAAAGTCGCTGACTTTATAACACCTTCAGGAGGGTGGGATAT
CGATTTAATTCATAAGGCAGTAATAAATTTTGATTGTGATTCCATCAAGGCGGTTCCTATTAATAATAATCTAGAAGACAAACTTATATGGCACTATGATAGGACAGCTA
AAAGTATCTGGAATCTAATGCATAACTGTGTTTTTTTGGAGGAAAATTTCGCTGGGAGCTTTATTGATAGGTGGATTAAGATCGATGCTACAAGCTCATCAGAAAATTTG
GGGAAAGCAGCCGTTATCTGTTGGGCGCTATGGTCAGACAGAAACAAAATATCGCACGGGGAGAATATTTCACCTATCTCTATTCGAAGAAAGTGGATCGAGGATTACCT
GGATTCGTTTCGGCATGCAAATGTTAATCGTCACACGATCAGAGTTCCCACTAATGCTTCTTCTTCCTCGACTAGAATTGCAGCCAAGCGCCTCCATCCTCATGACCGAA
ATGGTGATGCATGCTGGACTCCCCCTTCCGATGGATTCTGGAAGCTAAATACAGATGCAGCGTGTTCCGATTCTCCTCCTTACACGGGGTTAGGCATGATCTGTAGAGAC
AAAATCGGTGAAGTGAAGGTGGCAGCATCTATTTTTCTGGATTACCGGTTGGATTCGATAATGGGAGAGTTGAGAGGAATTTTGGAAGGGATGAGGATGGCTCTGAGTAG
GGGCTGTGAGAAAATCGAAGTGGAGTCTGATAGTCAACAAGCCATTAATTTCATCCAGCGAAAATCTATCATTTGGAGTGACGTGGAAGCTTTAGTGACATCCATCTGGG
AGCTAGCGCAATTATTTTCTGAGATTTCGTTCAAATTCATTCCAAGGATGCAAAATGGAGAAGCAGATTCGTTAGCTAAATTCGCGAAATTCACAAAATGTAGTGAGACT
TGGGATTCTGTACTCCCAAGTTGGCTGAGTATTGGGCCTGATGGGCCTTTTTCTTTGCCCTTGTGGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATATTGCTGATATCGTTGAAGGTCCTGTTGACAAAGTTGATCTGGAATTTCAAAGTTCCTCGTAGGGTGAAAATCTTTCTCTGGCCGTTGGTTCACAGGAGTTTAAA
CATGCAGGAGAGATTGCAAAGGAAGCGGTGTCCACAGCTCTTTGCCCCTCAATTTGCTCGCTTTTGCTGGGAGAGGATCTTTAGCCTTTTTGACCTGGAAGTCTGTCTTC
CTAGACAAGTGGATGGGTGGTTCTCTGAAGCTCAGTGGGTGGAAGTTGAAAAGAAAAGCCTAGATTTTATGGCGGTTTGTGTCGAGAGCCCTTTTATGGGGTTTATGGCT
GGAATGGAACAAGAGGGAGTTGAAGAAGGCTCTGCAGTCTATTTGCTAATAAGAGAAAAAGGGTGCTTAAGCAAATATGGTTGTGGCACTTGTTGGGAGTACATCAGTTA
TGGCAGATGCAATGAAGGTGGCTGCAGAGGAGACCTCCCACCTTCTCGAAGTGGTGGAGGAATGTACATTCCTGATAACAAAGACAAGGAGATGTGTCATTTGGTTCTGT
TAAAGAACTGGAACATTTTCCGAGGTTTCTCTTCACCATCAAAACCTCCAGAAGGATCAAGAAAACAAGAAGATAAAGCTGAACATGAAGACTTAGTAAAGAACATAAAT
AAAGAAACAGGGGATAGAAACAAAAAAAGAATTGATGTTGATCTAAACCTGGAAAGCCCAATAGAAGAGGATGCAGAGCTCGGTTTGAATGGAAGAAAAGAAATTGATAT
TTTAGAATTAATGGAAGAAAGTGATGAAACTATCAGAGAAAGGTGCACATGGAATGTTAACCAAAATGAAAATTCTGAATCGAAAAAAGAAGGTGATGGAAGAGAAATAG
ACAAGGAAGATTTACCAATCGAGGAAGCAAGTGAGGTTTCATTTCCTATCCTAAGAAGGAAGTCAAGTTGGAAGAGGAAGGTAAGGTGTTCTCAGAAAGATAAGCTTCCA
ATGGATGAGAAAGGAATCAGGAAAAGGAAGGGAGAAAGATTGAAGCGGGTATGCAAATTTGAAGGCTGTTTTGTGGTTCGAAGTCAAGGGGCTAGAGGTGGCTTATGTAT
TCTTTGGAAAGAAGAGGAAATGCTCTCGATTAAATCTTACTCATGTAATCATATAGATTGCGACATTCAGTGGAAGAATTTCAAATGGAGGTTTTCTGGTCTATATGGGT
TTCCAGAAACAAGACAGAAAGCTCAGACCTGGGAGCTTATTCGCAAATTGAAGCTTTCTGGTAACTCACCTTGGTTACTTGGTGGAGATATGAATGAAATACTAAGAGAT
GAAGAGAAACTGGGTGGCCAACCAAGAGATAGAAAATTAATGGAAGACTTTCGAAAATGTATTGATGACTGTGAGTTAAAAGACTTGAGATCTAGTGGAGAATTGTTCAC
TTGGTATGGTAATAGGAGAGGTCATTTTATAAAGGAAAGACTGGATCGTTATCTCTGCAATCACTTGTTTGAAAATTTATTTTCTGCCATAGAAACAACAAACCTCGATT
GGTTTTTCTCTGACCATAAACCAATTGAGGCTCGGGTAGAACTAAAAGGAAGAAAATCTATAAACAGGAAGAGAAATCAATTCAGGTTTGAGGAGCTATGGACTAGATAT
GAGGAATGTTCTGATCTAATCAAAGAAAATGGCGACTGGGAAGGTATTGATCCAATTTATTCAGACAGCAAAGTCGCTGACTTTATAACACCTTCAGGAGGGTGGGATAT
CGATTTAATTCATAAGGCAGTAATAAATTTTGATTGTGATTCCATCAAGGCGGTTCCTATTAATAATAATCTAGAAGACAAACTTATATGGCACTATGATAGGACAGCTA
AAAGTATCTGGAATCTAATGCATAACTGTGTTTTTTTGGAGGAAAATTTCGCTGGGAGCTTTATTGATAGGTGGATTAAGATCGATGCTACAAGCTCATCAGAAAATTTG
GGGAAAGCAGCCGTTATCTGTTGGGCGCTATGGTCAGACAGAAACAAAATATCGCACGGGGAGAATATTTCACCTATCTCTATTCGAAGAAAGTGGATCGAGGATTACCT
GGATTCGTTTCGGCATGCAAATGTTAATCGTCACACGATCAGAGTTCCCACTAATGCTTCTTCTTCCTCGACTAGAATTGCAGCCAAGCGCCTCCATCCTCATGACCGAA
ATGGTGATGCATGCTGGACTCCCCCTTCCGATGGATTCTGGAAGCTAAATACAGATGCAGCGTGTTCCGATTCTCCTCCTTACACGGGGTTAGGCATGATCTGTAGAGAC
AAAATCGGTGAAGTGAAGGTGGCAGCATCTATTTTTCTGGATTACCGGTTGGATTCGATAATGGGAGAGTTGAGAGGAATTTTGGAAGGGATGAGGATGGCTCTGAGTAG
GGGCTGTGAGAAAATCGAAGTGGAGTCTGATAGTCAACAAGCCATTAATTTCATCCAGCGAAAATCTATCATTTGGAGTGACGTGGAAGCTTTAGTGACATCCATCTGGG
AGCTAGCGCAATTATTTTCTGAGATTTCGTTCAAATTCATTCCAAGGATGCAAAATGGAGAAGCAGATTCGTTAGCTAAATTCGCGAAATTCACAAAATGTAGTGAGACT
TGGGATTCTGTACTCCCAAGTTGGCTGAGTATTGGGCCTGATGGGCCTTTTTCTTTGCCCTTGTGGCGTTAA
Protein sequenceShow/hide protein sequence
MILLISLKVLLTKLIWNFKVPRRVKIFLWPLVHRSLNMQERLQRKRCPQLFAPQFARFCWERIFSLFDLEVCLPRQVDGWFSEAQWVEVEKKSLDFMAVCVESPFMGFMA
GMEQEGVEEGSAVYLLIREKGCLSKYGCGTCWEYISYGRCNEGGCRGDLPPSRSGGGMYIPDNKDKEMCHLVLLKNWNIFRGFSSPSKPPEGSRKQEDKAEHEDLVKNIN
KETGDRNKKRIDVDLNLESPIEEDAELGLNGRKEIDILELMEESDETIRERCTWNVNQNENSESKKEGDGREIDKEDLPIEEASEVSFPILRRKSSWKRKVRCSQKDKLP
MDEKGIRKRKGERLKRVCKFEGCFVVRSQGARGGLCILWKEEEMLSIKSYSCNHIDCDIQWKNFKWRFSGLYGFPETRQKAQTWELIRKLKLSGNSPWLLGGDMNEILRD
EEKLGGQPRDRKLMEDFRKCIDDCELKDLRSSGELFTWYGNRRGHFIKERLDRYLCNHLFENLFSAIETTNLDWFFSDHKPIEARVELKGRKSINRKRNQFRFEELWTRY
EECSDLIKENGDWEGIDPIYSDSKVADFITPSGGWDIDLIHKAVINFDCDSIKAVPINNNLEDKLIWHYDRTAKSIWNLMHNCVFLEENFAGSFIDRWIKIDATSSSENL
GKAAVICWALWSDRNKISHGENISPISIRRKWIEDYLDSFRHANVNRHTIRVPTNASSSSTRIAAKRLHPHDRNGDACWTPPSDGFWKLNTDAACSDSPPYTGLGMICRD
KIGEVKVAASIFLDYRLDSIMGELRGILEGMRMALSRGCEKIEVESDSQQAINFIQRKSIIWSDVEALVTSIWELAQLFSEISFKFIPRMQNGEADSLAKFAKFTKCSET
WDSVLPSWLSIGPDGPFSLPLWR