; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029357 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029357
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:32609572..32617288
RNA-Seq ExpressionSpg029357
SyntenySpg029357
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]1.3e-3939.27Show/hide
Query:  QKLQRWHIDAVVKEKDW-HWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEF
        Q     HIDA++++ D   WRF+G+YGNP     + TW L+ RL  G   PW+VGGDFNE+ + NEK+GG  ++E  M+ FR++I +C+L D G+ GP++
Subjt:  QKLQRWHIDAVVKEKDW-HWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEF

Query:  TWCNNHVNGQMIWERLDRLLLTHDMQEKCSLFKVF---HLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGD--RMP
        TWCN     + I ERLDR L  +     C+LF  F   H     SDH P+   WF+ + ++++ +  +  RFE MWV  E+C  I+ RVW   G+  RM 
Subjt:  TWCNNHVNGQMIWERLDRLLLTHDMQEKCSLFKVF---HLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGD--RMP

Query:  GNIMNKTRDCLERLGNWSR
         +++   + C E+L  W++
Subjt:  GNIMNKTRDCLERLGNWSR

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]4.5e-4545.89Show/hide
Query:  VVKEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQM
        +VKE  ++WRF+GIYG+  +D    TW L+ RL     LPWI+GGDFNEI   +EK  G+ + +  MQ F+D +D C L DPG++G  FTWC+ H   Q 
Subjt:  VVKEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQM

Query:  IWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWF--EDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNIMNKTRDCLERL
        IWERLDR L+   + +     ++ HL ++ASDHRPILAEW    +  V R+ + +RP RFEE W  ++ECK+IVRRVW   GD        K   CLE L
Subjt:  IWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWF--EDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNIMNKTRDCLERL

Query:  GNWSRDR
          W+  R
Subjt:  GNWSRDR

XP_023871634.1 uncharacterized protein LOC111984238 [Quercus suber]4.4e-4040.98Show/hide
Query:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HIDA++ K K+  WRF+G YG      HH +W  + RLK   +LPWI  GDFNEI   +EK GG  +    M+EFRD++DEC   D GY G +FTWCN H
Subjt:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFE-DKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMP-GNIMNKTRD
          G  +WER+DR + T D        KV HL    SDH+PIL       KRV + W      RFE+MW++ E C++++   W  +    P   +  K   
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFE-DKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMP-GNIMNKTRD

Query:  CLERL
        C + L
Subjt:  CLERL

XP_030927178.1 uncharacterized protein LOC115953579 [Quercus lobata]7.5e-4042.86Show/hide
Query:  HIDAVVKEK-DWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HID  +K+  +  WRF+G YG P+    H  W  +  LK   S PW+  GDFNEIT Q+EK+GG  +    MQ FRD++DEC   D GYIGP+FTW + H
Subjt:  HIDAVVKEK-DWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNI--MNKTRD
          G  +WERLDR + T+D  EK    KV+HL    SDH+P+   W   + +   ++ +RP RFE+MW+    C D +  VW+ SGD  P +I  +NK   
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNI--MNKTRD

Query:  CLERLGNWSR
        C   L  WS+
Subjt:  CLERLGNWSR

XP_030970475.1 uncharacterized protein LOC115990845 [Quercus lobata]9.8e-4039.37Show/hide
Query:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HID +V K KD  WRF G YG PN    H  W  +  LK   S PWI  GDFNEIT Q+EK+GG  +    MQ FR+++DEC   D G++G EFTW + H
Subjt:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQ-ESGDRMPGNIMNKTRDC
             +WERLDR + T D        K++HL   ASDHRPIL               +RP RFE+MW+  + C + V+ VWQ ++G+     ++ K  +C
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQ-ESGDRMPGNIMNKTRDC

Query:  LERLGNWSRDRRETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLE
         + L  WSR+      ++  E + T+ L  K     +LV +N   N   K YLE
Subjt:  LERLGNWSRDRRETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLE

TrEMBL top hitse value%identityAlignment
A0A2N9IIR5 Uncharacterized protein2.6e-3838.28Show/hide
Query:  HIDAVVKEKDWH-WRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HID+++ E     WRF+G YG P     H +W ++  L    SLPW   GDFNE+ + +EK+GG  + +  MQ FRD++DEC   D G+ GPEFTWCNN 
Subjt:  HIDAVVKEKDWH-WRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGD-RMPGNIMNKTRDC
        +NG  +WERLDR+++  +   +     V+H+    SDH P+   W     V      K+  RFE MW+  E C++ V   W+ + D      + N+   C
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGD-RMPGNIMNKTRDC

Query:  LERLGNWSR
          RL  WSR
Subjt:  LERLGNWSR

A0A2N9IXK4 RNase H domain-containing protein2.6e-3840.28Show/hide
Query:  HIDAVVKEKDWH-WRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HIDA++ E + + WRF+G YG P     H +W+L+  L    SLPW   GDFNE+ +  EK+GG  ++   MQ+FRD ID C   D G+ GP FTWCNN 
Subjt:  HIDAVVKEKDWH-WRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVW--QESGDRMPGNIMNKTRD
        +    +WERLDR+L T        L +V HL  ++SDH PI  ++      + +    R  RFEEMW+ +  CK+ +   W  Q+ G  M   + +K R 
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVW--QESGDRMPGNIMNKTRD

Query:  CLERLGNWSRD
        C   L  WSRD
Subjt:  CLERLGNWSRD

A0A5C7HUN0 Uncharacterized protein9.9e-3828.69Show/hide
Query:  MRVTDAERACVFHLREGTIDESKQKLENVILCKVFTSKTINPEMFCSKMPKIWSQ-EQTMIACVGFNLFLCKFKNACIKGKIIESGLWFFDKAMLLMEDP
        + + DA+ A +  + E       + L++ ++ KV + K +N E F S + ++WS   Q  I   G N F+  F       ++ + G W+FDK +L +E P
Subjt:  MRVTDAERACVFHLREGTIDESKQKLENVILCKVFTSKTINPEMFCSKMPKIWSQ-EQTMIACVGFNLFLCKFKNACIKGKIIESGLWFFDKAMLLMEDP

Query:  KGDSCGEETEF------------------RNTTMEIGSQLEKVEQIDLEDGTEQHWGISLRIKIQVEVSSPLK-----------------CEVTSMEGEE
        +G     +  F                  R T   +  Q+ KV  I++   +++  G  LR+KIQV++S P K                 C      G  
Subjt:  KGDSCGEETEF------------------RNTTMEIGSQLEKVEQIDLEDGTEQHWGISLRIKIQVEVSSPLK-----------------CEVTSMEGEE

Query:  EVGEMVDEEIREKDIGNSVTKVGGRPVAPEHMLE-----------------SSRKVEKNQTPARKVTDCLSEKEKAKNPKQGNKLDENNGIEISSGSKDH
         + E  DE   ++    + TK G    AP  MLE                 +S+K E +   A  V   +   ++   P + +     + I +   S   
Subjt:  EVGEMVDEEIREKDIGNSVTKVGGRPVAPEHMLE-----------------SSRKVEKNQTPARKVTDCLSEKEKAKNPKQGNKLDENNGIEISSGSKDH

Query:  TEGDKSPYAAEAHRLDLT-PCLKLLYELPVGPN-GPADQK--LQRW--------------HIDAVVKEKD-WHWRFSGIYGNPNRDMHHTTWTLMNRLKE
        +E   S   A+ ++  ++ P  + L E P   N   AD+K  L  W              HIDA ++ +D + WRFSG YG+PN      +WTL+ RL+E
Subjt:  TEGDKSPYAAEAHRLDLT-PCLKLLYELPVGPN-GPADQK--LQRW--------------HIDAVVKEKD-WHWRFSGIYGNPNRDMHHTTWTLMNRLKE

Query:  GCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRP
           LPW+  GDFNE+ +QNE  GG  K  L+M  FR  +++C L D GY GP + W N       I ERLDR+L  +  ++     +V HL +I SDHRP
Subjt:  GCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRP

Query:  IL
        +L
Subjt:  IL

A0A6J1DRA0 uncharacterized protein LOC1110224232.2e-4545.89Show/hide
Query:  VVKEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQM
        +VKE  ++WRF+GIYG+  +D    TW L+ RL     LPWI+GGDFNEI   +EK  G+ + +  MQ F+D +D C L DPG++G  FTWC+ H   Q 
Subjt:  VVKEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQM

Query:  IWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWF--EDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNIMNKTRDCLERL
        IWERLDR L+   + +     ++ HL ++ASDHRPILAEW    +  V R+ + +RP RFEE W  ++ECK+IVRRVW   GD        K   CLE L
Subjt:  IWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWF--EDKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNIMNKTRDCLERL

Query:  GNWSRDR
          W+  R
Subjt:  GNWSRDR

A0A7N2LUL7 Uncharacterized protein2.3e-3940Show/hide
Query:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH
        HIDA++ K K+  WRF+G YG      H+ +W  + RLK   +LPWI  GDFNEI   +EK GG  +    M+EFRD++DEC   D GY G +FTWCN H
Subjt:  HIDAVV-KEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGCSLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNH

Query:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFE-DKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMP-GNIMNKTRD
          G  +WERLDR + T D        KV HL    SDH+PI+       K+V + W      RFE+MW++ E C +++   W       P   +  K   
Subjt:  VNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFE-DKRVKRKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMP-GNIMNKTRD

Query:  CLERLGNWSR
        C + L  WS+
Subjt:  CLERLGNWSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein4.4e-0621.75Show/hide
Query:  ETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKK--SESEESETNKMSKSL--ILCWQIWDYRNK-LIHNKTRSDPTMLQSHIDKYM
        ET  H+ + C  T+ +W       +++  N    W      E+   +    S+   TN + + L   + W++W  RN  L   K +S     +  I    
Subjt:  ETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKK--SESEESETNKMSKSL--ILCWQIWDYRNK-LIHNKTRSDPTMLQSHIDKYM

Query:  EELQGRGEIYQDISHEASTAAI------GPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPTD
        E L    E  ++ +   +T  I         W  PP G  K N D+ +++       GW  R+ NG+++  G   +         EA+     ++++   
Subjt:  EELQGRGEIYQDISHEASTAAI------GPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPTD

Query:  S-PPVQIESDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLA-HKACMSGSSDSWTHSFPSWLLD
            V  ESD+  ++  LI   ED + L     +++             V ++ N  A  LA H         S+T + PSWL++
Subjt:  S-PPVQIESDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLA-HKACMSGSSDSWTHSFPSWLLD

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.8e-1525.36Show/hide
Query:  DRRETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKKSESEESETNKMSK--SLI--LCWQIWDYRNKLIHNKTRSD-PTMLQSHID
        D RET +HL ++C   + +W         +       W    Y    W    + E E  K+ K  +L+  L W++W  RN+L+      D P +L+  ++
Subjt:  DRRETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKKSESEESETNKMSK--SLI--LCWQIWDYRNKLIHNKTRSD-PTMLQSHID

Query:  KYMEELQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPT-DSP
         + EE   R E+    S       +   W  PP    K N DA W  ++ R GIGWI R  +G +++ G R + R   +   E  A+   +  +   +  
Subjt:  KYMEELQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPT-DSP

Query:  PVQIESDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDSWTHSF-PSWL
         +  ESDA  ++NLL   D+    L+ + E+++ L        F    +  N++A ++A ++    + D    S  P WL
Subjt:  PVQIESDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDSWTHSF-PSWL

AT4G29090.1 Ribonuclease H-like superfamily protein5.8e-1423.72Show/hide
Query:  RETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKKSESEESETNKMSKSLI--LCWQIWDYRNKLIHNKTRSDPTMLQSHIDKYMEE
        +ET +HL ++C   +  W     P  L        WA   Y+   W  +    +   + +  L+  L W++W  RN+L+      +   +    +  +EE
Subjt:  RETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKKSESEESETNKMSKSLI--LCWQIWDYRNKLIHNKTRSDPTMLQSHIDKYMEE

Query:  LQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPT-DSPPVQIE
         + R E     +      +    W  PP+   K N DA W+ D++R GIGW+ R   G + + G R + +   +   E  A+   +  L       V  E
Subjt:  LQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGCRTISRPWKIPWLEAMAVCEGIRLLPT-DSPPVQIE

Query:  SDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDSWTHSF-PSW
        SD+  +I +L   DE    LK + ++++ L S      F  + ++ N +A ++A ++    + D   +S  PSW
Subjt:  SDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDSWTHSF-PSW

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0423.44Show/hide
Query:  LCWQIWDYRNKLIHNKTRSD-PTMLQSHIDKYMEELQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGC
        L W+IW   N L+ N TR+   T ++  ++   E L       Q   +  +  +    W  P     K N DA+  E +   G+GWI R   G +I  G 
Subjt:  LCWQIWDYRNKLIHNKTRSD-PTMLQSHIDKYMEELQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAGC

Query:  RTISRPWKIPWLEAMAVCEGIRLLPTDSPPVQIESDALKVINLLIGKDEDETELKLSTEEVKS-LRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDS
                    E   +   I+          I     + I  +I        L+   + ++S + S  +IE F    ++ N  A  LA +A    +  S
Subjt:  RTISRPWKIPWLEAMAVCEGIRLLPTDSPPVQIESDALKVINLLIGKDEDETELKLSTEEVKS-LRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDS

Query:  WTHSFPSWL
          HS P +L
Subjt:  WTHSFPSWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTCACGGATGCCGAAAGAGCTTGCGTGTTTCATCTACGAGAAGGAACAATTGACGAATCGAAACAAAAACTGGAGAATGTCATCCTGTGCAAAGTCTTTACAAG
CAAAACGATCAACCCTGAGATGTTCTGCTCCAAAATGCCAAAGATTTGGAGTCAGGAGCAAACGATGATAGCTTGCGTTGGATTCAACCTGTTTTTATGCAAATTCAAAA
ACGCGTGTATAAAAGGCAAAATCATCGAATCTGGACTGTGGTTCTTCGACAAAGCTATGCTTCTAATGGAGGACCCAAAAGGAGACAGTTGTGGCGAGGAAACGGAGTTC
AGGAACACAACAATGGAAATTGGAAGCCAATTGGAGAAAGTAGAACAAATTGATCTCGAGGATGGAACTGAGCAACATTGGGGCATATCGTTAAGGATAAAGATTCAAGT
AGAGGTCAGTTCACCATTAAAATGTGAGGTCACATCCATGGAAGGGGAAGAGGAAGTGGGAGAGATGGTGGATGAGGAAATAAGAGAAAAGGACATAGGTAATTCGGTGA
CTAAAGTTGGTGGTCGGCCGGTAGCACCCGAACACATGCTGGAATCTTCACGGAAAGTAGAGAAGAACCAAACGCCGGCAAGGAAGGTAACAGATTGTTTGTCCGAAAAG
GAAAAGGCGAAAAATCCGAAGCAAGGCAATAAATTGGACGAGAATAATGGGATAGAGATTTCGTCGGGTTCAAAAGATCATACTGAAGGGGATAAAAGCCCCTACGCAGC
GGAAGCGCATCGATTGGACCTTACGCCTTGCCTCAAATTACTTTATGAGCTACCAGTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATGGCATATTGACGCGG
TGGTTAAAGAGAAGGATTGGCATTGGAGGTTCTCTGGCATCTATGGCAACCCCAATAGAGATATGCATCATACTACTTGGACTTTGATGAATCGACTAAAAGAGGGATGT
AGTCTCCCTTGGATTGTAGGAGGAGATTTTAACGAGATCACTAATCAGAATGAGAAAAAAGGAGGTATGACGAAAGCAGAGTTGGATATGCAAGAGTTCCGTGATATTAT
AGACGAATGTGCGCTTCACGATCCTGGATATATTGGCCCTGAATTCACATGGTGTAATAACCATGTCAATGGCCAGATGATTTGGGAGCGGTTGGATCGGCTCCTTCTTA
CTCATGATATGCAAGAGAAGTGTAGCTTATTTAAAGTCTTTCACTTATCATGGATTGCGTCGGATCATAGGCCAATTTTGGCTGAATGGTTTGAAGATAAAAGGGTTAAG
AGGAAATGGGAGATAAAAAGGCCTAGACGGTTTGAGGAGATGTGGGTTAAATATGAGGAGTGCAAGGACATAGTTCGTAGGGTTTGGCAGGAGAGTGGAGACAGGATGCC
TGGTAACATTATGAATAAAACTAGGGATTGTCTCGAGAGATTAGGGAACTGGAGCAGGGATAGGAGGGAGACAACTTCTCATCTCTTTTGGGAATGCAAAACAACCAAAG
GTCTTTGGCCGAAATACTTCCATCCTACTGACTTAGTATGTTTGAATGACAGGAAAAACTGGGCGGCTAAGGACTACTTGGAAAACGTTTGGAAGAAATCAGAATCGGAA
GAATCAGAGACTAATAAGATGAGCAAGAGTCTAATTCTCTGTTGGCAGATATGGGATTATAGAAACAAACTAATCCATAACAAGACTCGATCAGATCCGACAATGCTTCA
ATCGCATATTGACAAATACATGGAGGAATTGCAAGGAAGAGGAGAAATATACCAGGACATCTCCCACGAAGCTTCGACTGCTGCGATCGGCCCAACATGGCTCCGGCCTC
CAAACGGAATGTGGAAAATTAACTGTGATGCGGCGTGGTCCGAGGATCATCAAAGAGGAGGGATAGGGTGGATTTTTCGTCAATGGAATGGAAACCTTATTTACGCTGGC
TGCCGAACCATCTCCAGACCTTGGAAAATACCGTGGCTGGAAGCGATGGCGGTGTGTGAAGGCATCCGATTGCTGCCGACAGACTCTCCTCCCGTTCAGATTGAAAGCGA
CGCGCTTAAGGTGATCAATCTTCTAATCGGGAAAGACGAAGATGAGACAGAACTAAAACTGTCCACTGAAGAAGTAAAATCCCTAAGGTCAGGTAGGAATATTGAAGGTT
TCTATCATGTGAAGAAGAAACACAATCAGATGGCTCACAAGTTGGCACACAAGGCTTGTATGAGTGGGAGTTCTGATAGTTGGACCCACTCTTTTCCTAGTTGGTTACTT
GATTTAAATGCTACTGATGTTGGTGGTGTTAATACCATTGGTGGGGGATCCTATCCCACAGGTATTATTCCATTGGTTCCTATTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTCACGGATGCCGAAAGAGCTTGCGTGTTTCATCTACGAGAAGGAACAATTGACGAATCGAAACAAAAACTGGAGAATGTCATCCTGTGCAAAGTCTTTACAAG
CAAAACGATCAACCCTGAGATGTTCTGCTCCAAAATGCCAAAGATTTGGAGTCAGGAGCAAACGATGATAGCTTGCGTTGGATTCAACCTGTTTTTATGCAAATTCAAAA
ACGCGTGTATAAAAGGCAAAATCATCGAATCTGGACTGTGGTTCTTCGACAAAGCTATGCTTCTAATGGAGGACCCAAAAGGAGACAGTTGTGGCGAGGAAACGGAGTTC
AGGAACACAACAATGGAAATTGGAAGCCAATTGGAGAAAGTAGAACAAATTGATCTCGAGGATGGAACTGAGCAACATTGGGGCATATCGTTAAGGATAAAGATTCAAGT
AGAGGTCAGTTCACCATTAAAATGTGAGGTCACATCCATGGAAGGGGAAGAGGAAGTGGGAGAGATGGTGGATGAGGAAATAAGAGAAAAGGACATAGGTAATTCGGTGA
CTAAAGTTGGTGGTCGGCCGGTAGCACCCGAACACATGCTGGAATCTTCACGGAAAGTAGAGAAGAACCAAACGCCGGCAAGGAAGGTAACAGATTGTTTGTCCGAAAAG
GAAAAGGCGAAAAATCCGAAGCAAGGCAATAAATTGGACGAGAATAATGGGATAGAGATTTCGTCGGGTTCAAAAGATCATACTGAAGGGGATAAAAGCCCCTACGCAGC
GGAAGCGCATCGATTGGACCTTACGCCTTGCCTCAAATTACTTTATGAGCTACCAGTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATGGCATATTGACGCGG
TGGTTAAAGAGAAGGATTGGCATTGGAGGTTCTCTGGCATCTATGGCAACCCCAATAGAGATATGCATCATACTACTTGGACTTTGATGAATCGACTAAAAGAGGGATGT
AGTCTCCCTTGGATTGTAGGAGGAGATTTTAACGAGATCACTAATCAGAATGAGAAAAAAGGAGGTATGACGAAAGCAGAGTTGGATATGCAAGAGTTCCGTGATATTAT
AGACGAATGTGCGCTTCACGATCCTGGATATATTGGCCCTGAATTCACATGGTGTAATAACCATGTCAATGGCCAGATGATTTGGGAGCGGTTGGATCGGCTCCTTCTTA
CTCATGATATGCAAGAGAAGTGTAGCTTATTTAAAGTCTTTCACTTATCATGGATTGCGTCGGATCATAGGCCAATTTTGGCTGAATGGTTTGAAGATAAAAGGGTTAAG
AGGAAATGGGAGATAAAAAGGCCTAGACGGTTTGAGGAGATGTGGGTTAAATATGAGGAGTGCAAGGACATAGTTCGTAGGGTTTGGCAGGAGAGTGGAGACAGGATGCC
TGGTAACATTATGAATAAAACTAGGGATTGTCTCGAGAGATTAGGGAACTGGAGCAGGGATAGGAGGGAGACAACTTCTCATCTCTTTTGGGAATGCAAAACAACCAAAG
GTCTTTGGCCGAAATACTTCCATCCTACTGACTTAGTATGTTTGAATGACAGGAAAAACTGGGCGGCTAAGGACTACTTGGAAAACGTTTGGAAGAAATCAGAATCGGAA
GAATCAGAGACTAATAAGATGAGCAAGAGTCTAATTCTCTGTTGGCAGATATGGGATTATAGAAACAAACTAATCCATAACAAGACTCGATCAGATCCGACAATGCTTCA
ATCGCATATTGACAAATACATGGAGGAATTGCAAGGAAGAGGAGAAATATACCAGGACATCTCCCACGAAGCTTCGACTGCTGCGATCGGCCCAACATGGCTCCGGCCTC
CAAACGGAATGTGGAAAATTAACTGTGATGCGGCGTGGTCCGAGGATCATCAAAGAGGAGGGATAGGGTGGATTTTTCGTCAATGGAATGGAAACCTTATTTACGCTGGC
TGCCGAACCATCTCCAGACCTTGGAAAATACCGTGGCTGGAAGCGATGGCGGTGTGTGAAGGCATCCGATTGCTGCCGACAGACTCTCCTCCCGTTCAGATTGAAAGCGA
CGCGCTTAAGGTGATCAATCTTCTAATCGGGAAAGACGAAGATGAGACAGAACTAAAACTGTCCACTGAAGAAGTAAAATCCCTAAGGTCAGGTAGGAATATTGAAGGTT
TCTATCATGTGAAGAAGAAACACAATCAGATGGCTCACAAGTTGGCACACAAGGCTTGTATGAGTGGGAGTTCTGATAGTTGGACCCACTCTTTTCCTAGTTGGTTACTT
GATTTAAATGCTACTGATGTTGGTGGTGTTAATACCATTGGTGGGGGATCCTATCCCACAGGTATTATTCCATTGGTTCCTATTTTTTAA
Protein sequenceShow/hide protein sequence
MRVTDAERACVFHLREGTIDESKQKLENVILCKVFTSKTINPEMFCSKMPKIWSQEQTMIACVGFNLFLCKFKNACIKGKIIESGLWFFDKAMLLMEDPKGDSCGEETEF
RNTTMEIGSQLEKVEQIDLEDGTEQHWGISLRIKIQVEVSSPLKCEVTSMEGEEEVGEMVDEEIREKDIGNSVTKVGGRPVAPEHMLESSRKVEKNQTPARKVTDCLSEK
EKAKNPKQGNKLDENNGIEISSGSKDHTEGDKSPYAAEAHRLDLTPCLKLLYELPVGPNGPADQKLQRWHIDAVVKEKDWHWRFSGIYGNPNRDMHHTTWTLMNRLKEGC
SLPWIVGGDFNEITNQNEKKGGMTKAELDMQEFRDIIDECALHDPGYIGPEFTWCNNHVNGQMIWERLDRLLLTHDMQEKCSLFKVFHLSWIASDHRPILAEWFEDKRVK
RKWEIKRPRRFEEMWVKYEECKDIVRRVWQESGDRMPGNIMNKTRDCLERLGNWSRDRRETTSHLFWECKTTKGLWPKYFHPTDLVCLNDRKNWAAKDYLENVWKKSESE
ESETNKMSKSLILCWQIWDYRNKLIHNKTRSDPTMLQSHIDKYMEELQGRGEIYQDISHEASTAAIGPTWLRPPNGMWKINCDAAWSEDHQRGGIGWIFRQWNGNLIYAG
CRTISRPWKIPWLEAMAVCEGIRLLPTDSPPVQIESDALKVINLLIGKDEDETELKLSTEEVKSLRSGRNIEGFYHVKKKHNQMAHKLAHKACMSGSSDSWTHSFPSWLL
DLNATDVGGVNTIGGGSYPTGIIPLVPIF