; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025477 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025477
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr10:13594333..13595822
RNA-Seq ExpressionLag0025477
SyntenyLag0025477
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]3.7e-7934.34Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        +C SKF GGLGFRDLE FN+ALLAKQ WR++  P SL+ R+ + +Y     FLEA   +N SF+W+SL WG+ELL  G+RWRVGNG SI V  D W+P  
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS---DQIIQWWR
        +  +++    +  +T V  L    G WNV  + D+  +++  A L IP       D LIWHYE+NGMYSVKSGYRLAC  K+  S   S   D   ++W+
Subjt:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS---DQIIQWWR

Query:  FLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF
         +WA +IP+K+K F WR   DFLP    L  R +     CP CH+   S  HA+W C+  +  W+NS + ++       S  ++    + + S +    F
Subjt:  FLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF

Query:  LGMCWWAWCRRNKEVF--------------------------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFA
          +CW  W RRN  +F                                G     +  +  W PP A  YK+N D A  +   +  +G ++R+  GE M A
Subjt:  LGMCWWAWCRRNKEVF--------------------------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFA

Query:  WMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA
         ++++         E M   + L  AI+ GF    +E D+   +  +L +  C+  + G+L+EE+  +    +        RS N +AH +A  A
Subjt:  WMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]3.3e-8035.83Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        M  +K RGGLGFRDL  FN+AL+AKQGWR++  PNSL+ RV+K +Y+K+ +F  A+  SN SF+W+S+LWG ++++ GVRWR+G+G  + V KD WIPR 
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW
         T + I  + +   T V  L++ +  W V+ +     +ED  A+L I  P    +DE++WH++K G YSVKSGY+LA +        +S+   + W+  W
Subjt:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW

Query:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIP---WPQSARSAADVLWWCKSNMSAKVFEDF
           +P KVKIF WR   + LPT  NL KR    +  C RC    ++ SH + ECK  R+ W  +P    P     Q   SA   +W   S  S    E  
Subjt:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIP---WPQSARSAADVLWWCKSNMSAKVFEDF

Query:  LGMCWWAWCRRNKEVF---------------------------GPLCGGEREIV---KWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWM
        +  CW  W  RNK +F                           G + G +   +   KW PP     KLN DAA     +   LGAI+RD  G+++   +
Subjt:  LGMCWWAWCRRNKEVF---------------------------GPLCGGEREIV---KWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWM

Query:  KKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALR
        K+  F + V + EA  +   L VA +     + VESD   VV LL +    R+E+  ++ ++R+ SK  +   FS   R+ N  AH +A  ALR
Subjt:  KKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALR

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.0e-8137.53Show/hide
Query:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR
        K  GG+GFRDLELFNKALLAKQ WR++ +PNS+L RVLKG+YFKDCSF+EA+   N S++W+S+LWGR+LL+ G+RWR+GNG S+ +  DNW+P   TL+
Subjt:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR

Query:  VIRREGIDPATKVDALL-NPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLAC---SIKENASSSNSDQIIQWWRFLW
        ++    +   ++V +L+ + +G W  + V D    ++A  +L IP  R   +D LIW+YEK G+YSV+SGY++A       +  SSS+S+++  WW   W
Subjt:  VIRREGIDPATKVDALL-NPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLAC---SIKENASSSNSDQIIQWWRFLW

Query:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHKS---TSHAIWECKRVRRQWQN------SPFCSIPWPQSARSAAD------VLWWCKSN
           IP+K+K+F WRL  D LPT  NL KRGV+I   C  C ++   + H  W CK     W N      SPF  +     + S AD      V+W   + 
Subjt:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHKS---TSHAIWECKRVRRQWQN------SPFCSIPWPQSARSAAD------VLWWCKSN

Query:  MSAKVFEDF------LGMCWWAWCRRNKEVF-----GPLCGGEREI--VKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQD
         +A+ F D       +GM    W  +    F      P+ G       + W PP    YK+NTDA+     + + LG II ++ G+VM A  K +  +Q 
Subjt:  MSAKVFEDF------LGMCWWAWCRRNKEVF-----GPLCGGEREI--VKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQD

Query:  VDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAAL
        VD+ EA+   + L +A E G     +E               D SE G +V + +         SF+   R  N  AH +A  AL
Subjt:  VDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAAL

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.1e-8236.64Show/hide
Query:  SKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTL
        SK RGG+GFRDL  FN+AL+AKQGWR++  P+SL+ RVLK +YFK   F+ A   S  SFVW+S++WGR++L  G RWR+GNG ++ V  +NWIPR TT 
Subjt:  SKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTL

Query:  RVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIK--ENASSSNSDQIIQWWRFLWA
        + I    +   T V  L++    W  + ++     EDA A++ IP P+R  +D+LIWHY+K G YSVKSGY++A  IK  E+ S SN DQ +  WRF+W 
Subjt:  RVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIK--ENASSSNSDQIIQWWRFLWA

Query:  RQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLW----WCKSNMSAKVFEDF
          IP KVKIF WR  HD LPT  NL K+ V  +  C  CH   ++ SHA+ EC R R+ W+   + ++          D++W    W + +   +  E  
Subjt:  RQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLW----WCKSNMSAKVFEDF

Query:  LGMCWWAWCRRNKEVF-----GPL-------------------------CGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWM
          + W  W  RNK +F      PL                          G      +WSPP     K+N DAA D + +++ LG ++RD  G    A +
Subjt:  LGMCWWAWCRRNKEVF-----GPL-------------------------CGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWM

Query:  KKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALR
        K +     V + EA  +   L VA +        ESDS  V+ L+       +E+G L+ +I++  +  Q        R  N  AH +A  AL+
Subjt:  KKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALR

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]9.0e-7834Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        M  +K +GG+GFRD   FN+ALLAKQGWR+   P+SL+ RVL+ +YF   +FL A+  SN S++W+S+LWGR+++  G RWR+GNG  +++ K NWIP+ 
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW
         T + + +  +     V  L+N +  W+   +     + DA  +  IP PRR+ +DELIWH+ K+G Y+VKSGY+ A  I+  A  S+S+     W  +W
Subjt:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW

Query:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDFLGM
        +  +P K++IF WR   + LP+  NL KR +  +  C  C    ++  HA+ +CK  ++ W+ S F         +    +L   K   S    + F  M
Subjt:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDFLGM

Query:  CWWAWCRRNKEVF---------------------------GPLCGGEREIVK---WSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKV
         W  W  RN+ +F                             +  G+++ V    W+PP     K+NTDAAT+++  L+ LGA+IRDE G+V    +K  
Subjt:  CWWAWCRRNKEVF---------------------------GPLCGGEREIVK---WSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKV

Query:  PFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALRTGFERC
         F   V   EA  +   L VA +   + V +ESDS  VV L+ +    RSE+  +V EI+++ +     S     RS N +AH +   AL    E+C
Subjt:  PFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAALRTGFERC

TrEMBL top hitse value%identityAlignment
A0A2N9EYC3 Reverse transcriptase domain-containing protein1.8e-7934.67Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        +C SK  GG+G RDL +FN+ALLAKQ WR++ NP+SL  +V K KYF  CS LE +  +  S+ W+S+L  R+L+  G  WRVG G  I +  D W+   
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPA-TKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIK--ENASSSNSDQIIQWW
           R+I    ++ + + V+ L++ D  +W    V +L   ++A  +LGIP   R   D L+W   K G+Y+V+SGY L  + +  +    S++ ++ Q W
Subjt:  TTLRVIRREGIDPA-TKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIK--ENASSSNSDQIIQWW

Query:  RFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQ-----SARSAADVLWWCKSNMSA
        + +W+ QIPSK + F WR  H  LPT  NL  R +     C  C    +ST HA+W+CK +   W      SIPW Q     S     D+++ C   +S 
Subjt:  RFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPWPQ-----SARSAADVLWWCKSNMSA

Query:  KVFEDFLGMCWWAWCRRNK-------EVFGPLCGGERE-----------------------IVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGE
          F+ F  +CW  W RRN+       +    L    RE                       ++KW PP    YK+N D A  ND   + +G IIR+  GE
Subjt:  KVFEDFLGMCWWAWCRRNK-------EVFGPLCGGERE-----------------------IVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGE

Query:  VMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA
        VM A  +++P+   V+  EA   R ++  A + GF  +++E DS  +V  +L +  C  +  G ++E+IRQI++GLQ   F    R  NVMAH +A  A
Subjt:  VMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA

A0A2N9F2A9 RNase H domain-containing protein1.6e-8036.67Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        +C  K RGG+G RDL  FN+ALLAKQ WR++ NP+SLL +V K KYF  CS LEA +    S+ W+S+L  R+L+  G  WRVG+G+ + +  D W+P  
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPA-TKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYR--LACSIKENASSSNSDQIIQWW
            +I       + + V  L++PD   W    V +L    +A A+LGIP   R + D L+W   KNG+YSV+SGY+  L  S +E+  SS+  ++ Q W
Subjt:  TTLRVIRREGIDPA-TKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYR--LACSIKENASSSNSDQIIQWW

Query:  RFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRC---HKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFED
        + +W+  +P K++ F WR  H+ LPT+ NL  R +     C  C    +ST HA+W+CK V+  WQ+  + S     +     D+L  C S +S    + 
Subjt:  RFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRC---HKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFED

Query:  FLGMCWWAWCRRNK-EVFGPL-----CGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLV
        F  + W  W RRN+  +  P        G  E+VKW+PP   SYK+N D A  +D   + +G I+R+  GEVM +   ++PF   V+  EA   R ++  
Subjt:  FLGMCWWAWCRRNK-EVFGPL-----CGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLV

Query:  AIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA
        A + GF +  +E DS  VV  L       +  G ++++I+QI++ LQ   F    R  NVMAH +A  A
Subjt:  AIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA

A0A5E4FZN9 PREDICTED: retrotransposon1.8e-7934.34Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        +C SKF GGLGFRDLE FN+ALLAKQ WR++  P SL+ R+ + +Y     FLEA   +N SF+W+SL WG+ELL  G+RWRVGNG SI V  D W+P  
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS---DQIIQWWR
        +  +++    +  +T V  L    G WNV  + D+  +++  A L IP       D LIWHYE+NGMYSVKSGYRLAC  K+  S   S   D   ++W+
Subjt:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS---DQIIQWWR

Query:  FLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF
         +WA +IP+K+K F WR   DFLP    L  R +     CP CH+   S  HA+W C+  +  W+NS + ++       S  ++    + + S +    F
Subjt:  FLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF

Query:  LGMCWWAWCRRNKEVF--------------------------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFA
          +CW  W RRN  +F                                G     +  +  W PP A  YK+N D A  +   +  +G ++R+  GE M A
Subjt:  LGMCWWAWCRRNKEVF--------------------------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFA

Query:  WMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA
         ++++         E M   + L  AI+ GF    +E D+   +  +L +  C+  + G+L+EE+  +    +        RS N +AH +A  A
Subjt:  WMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVV-GLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAA

A0A6J1DAR4 uncharacterized protein LOC1110189545.0e-8237.53Show/hide
Query:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR
        K  GG+GFRDLELFNKALLAKQ WR++ +PNS+L RVLKG+YFKDCSF+EA+   N S++W+S+LWGR+LL+ G+RWR+GNG S+ +  DNW+P   TL+
Subjt:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR

Query:  VIRREGIDPATKVDALL-NPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLAC---SIKENASSSNSDQIIQWWRFLW
        ++    +   ++V +L+ + +G W  + V D    ++A  +L IP  R   +D LIW+YEK G+YSV+SGY++A       +  SSS+S+++  WW   W
Subjt:  VIRREGIDPATKVDALL-NPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLAC---SIKENASSSNSDQIIQWWRFLW

Query:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHKS---TSHAIWECKRVRRQWQN------SPFCSIPWPQSARSAAD------VLWWCKSN
           IP+K+K+F WRL  D LPT  NL KRGV+I   C  C ++   + H  W CK     W N      SPF  +     + S AD      V+W   + 
Subjt:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCHKS---TSHAIWECKRVRRQWQN------SPFCSIPWPQSARSAAD------VLWWCKSN

Query:  MSAKVFEDF------LGMCWWAWCRRNKEVF-----GPLCGGEREI--VKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQD
         +A+ F D       +GM    W  +    F      P+ G       + W PP    YK+NTDA+     + + LG II ++ G+VM A  K +  +Q 
Subjt:  MSAKVFEDF------LGMCWWAWCRRNKEVF-----GPLCGGEREI--VKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQD

Query:  VDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAAL
        VD+ EA+   + L +A E G     +E               D SE G +V + +         SF+   R  N  AH +A  AL
Subjt:  VDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVAMAAL

A0A803QPB7 Uncharacterized protein3.3e-7834.58Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD
        +C  K  GG+GFRDL+ FN++LLAKQGW++I NP+ LL +VLK  YF   SF EA+     S VW+ +LWGRELLR G RW VGNGS I + +D W+PR 
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRD

Query:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW
            +  +  I     +++LL PDG W  N V      +D   VLGI +P     D + W    NG+YSV SGY+L       A  SN  QI  WW+F+W
Subjt:  TTLRVIRREGIDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLW

Query:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRC---HKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDFLGM
           +  K+K F WR+++ ++PT++ L KRG+ I   C  C   ++   HA+W C +V+  W+   F  +  P + + AADVLWW   ++  + F  F+G+
Subjt:  ARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRC---HKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDFLGM

Query:  CWWAWCRRNKEVFGPLCGGEREIVKWS---------------------------PPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFV
         W  W RRN  VF      E+    W+                           PP    + +NTDA+     +   L A+IR   G ++ A    +P  
Subjt:  CWWAWCRRNKEVFGPLCGGEREIVKWS---------------------------PPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFV

Query:  QDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVA
          V + EA  V   + +AI       +V SD+  ++  L S+    ++ G LV++I+ +    Q   F   +RS N +A+ +A
Subjt:  QDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVA

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.7e-2625.1Show/hide
Query:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKY----FKDCSFLEARSRSNSSFVWQSLLWG-RELLRAGVRWRVGNGSSINVLKDN
        +C  K  GGLG R  +  N+AL++K GWR++   NSL   VL+ KY     +D  +L  +   +S+  W+S+  G R+++  GV W  G+G  I    D 
Subjt:  MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKY----FKDCSFLEARSRSNSSFVWQSLLWG-RELLRAGVRWRVGNGSSINVLKDN

Query:  WIPRDTTLRVIRREGIDPATKVDA--LLNPDGAWNVNSVMDLLGE----EDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS
        W+     L +   E       V A  L  P   W+   +          E    VL +    R   D L W + ++G +SV+S Y +             
Subjt:  WIPRDTTLRVIRREGIDPATKVDA--LLNPDGAWNVNSVMDLLGE----EDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNS

Query:  DQIIQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPW-PQSARSAADVLWWCKSN
          +  ++  LW  ++P +VK F W + +  + TE    +R +     C  C    +S  H + +C      W       +P   Q    +  +  W   N
Subjt:  DQIIQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGGCPRCH---KSTSHAIWECKRVRRQWQNSPFCSIPW-PQSARSAADVLWWCKSN

Query:  MSAKV-FED------FLGMCWWAWCRRNKEVFG--PLCGGEREIVK----------------------------WSPPVAPSYKLNTDAATDNDLKLSSL
        +  +   ED      F  + WW W  R   +FG    C    + VK                            W  P     K+NTD A+  +  L+S 
Subjt:  MSAKV-FED------FLGMCWWAWCRRNKEVFG--PLCGGEREIVK----------------------------WSPPVAPSYKLNTDAATDNDLKLSSL

Query:  GAIIRDEGGEVMFAWMKKVPF-VQDVDVPEAMV--VRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLV
        G ++RD  G    AW       +     P+A +  V   L  A E    +VE+E DS  +VG LK+   D   +  LV
Subjt:  GAIIRDEGGEVMFAWMKKVPF-VQDVDVPEAMV--VRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLV

P93295 Uncharacterized mitochondrial protein AtMg003101.4e-1745.19Show/hide
Query:  MCVSK-FRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPR
        +C SK   GGLGFRDL  FN+ALLAKQ +R+I  P++LL R+L+ +YF   S +E    +  S+ W+S++ GRELL  G+   +G+G    V  D WI  
Subjt:  MCVSK-FRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPR

Query:  DTTL
        +T L
Subjt:  DTTL

Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)1.4e-0431.52Show/hide
Query:  VLKDNWIPRDTTLRVIRREGI----DPATKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLA
        V KD WIP   T+     + I    D    V+ L++ +   W ++ +  L+   D   +LGI   R  + D   W + K+G Y+VKSGY +A
Subjt:  VLKDNWIPRDTTLRVIRREGI----DPATKVDALLNPD-GAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLA

AT3G09510.1 Ribonuclease H-like superfamily protein5.3e-2824.67Show/hide
Query:  LKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLRVIRREGIDPATKVDALLNPDGA---WNVNSVMDLLGE
        +K +YFKD S L+A+ R   S+ W SLL G  LL+ G R  +G+G +I +  DN +      R +  E       ++ L    G+   W+ + +   + +
Subjt:  LKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLRVIRREGIDPATKVDALLNPDGA---WNVNSVMDLLGE

Query:  EDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGY-RLACSIKENASSSNSDQ-IIQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGG
         D G +  I   +    D++IW+Y   G Y+V+SGY  L      N  + N     I     +W   I  K+K F WR     L T   L  RG+ I   
Subjt:  EDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGY-RLACSIKENASSSNSDQ-IIQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGG

Query:  CPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF-----LGMCWWAWCRRNKEVF------------------
        CPRCH+   S +HA++ C      W+ S    I     +    + +    + +      DF     + + W  W  RN  VF                  
Subjt:  CPRCHK---STSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDF-----LGMCWWAWCRRNKEVF------------------

Query:  --------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQV
                       P        ++W  P A   K N DA  D     ++ G IIR+  G  +     K+    +    E   +  +L      G+ QV
Subjt:  --------------GPLCGGEREIVKWSPPVAPSYKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQV

Query:  EVESDSARVVGLLKSNGCD-RSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVA
         +E D   ++ L+  NG    S +   +E+I   +       F    R  N +AH +A
Subjt:  EVESDSARVVGLLKSNGCD-RSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAHKVA

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-4626.82Show/hide
Query:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR
        K  GG+GF+D+E FN ALL KQ WRM+  P SL+ +V K +YF     L A   S  SFVW+S+   +E+LR G R  VGNG  I + +  W+       
Subjt:  KFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLR

Query:  VIRREGIDP--------ATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRP-RRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSS---NSDQI
         +R + + P          KV  L++  G      V+++L  E    ++G  RP  R + D   W Y  +G Y+VKSGY +   I    SS    +   +
Subjt:  VIRREGIDP--------ATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRP-RRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSS---NSDQI

Query:  IQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGG---CPRCHKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADV---LWWC----
           ++ +W  Q   K++ F W+   + LP    L  R +  +     CP C ++ +H +++C   R  W  S   SIP P     A  +   L+W     
Subjt:  IQWWRFLWARQIPSKVKIFCWRLYHDFLPTEVNLRKRGVDIKGG---CPRCHKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADV---LWWC----

Query:  -KSNMSAKVFEDFLGMCWWAWCRRNKEVF--------------------------GPLCGGEREI-----VKWSPPVAPSYKLNTDAATDNDLKLSSLGA
          +    K  +    + W  W  RN+ VF                             CG + ++      +W PP     K NTDA  + D +   +G 
Subjt:  -KSNMSAKVFEDFLGMCWWAWCRRNKEVF--------------------------GPLCGGEREI-----VKWSPPVAPSYKLNTDAATDNDLKLSSLGA

Query:  IIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAH
        ++R+E GEV +   + +P ++ V   E   +R ++L      +  V  ESDS  ++ +L ++    S +   +++++++        F    R  N +A 
Subjt:  IIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSRCSRSRNVMAH

Query:  KVAMAAL
        +VA  +L
Subjt:  KVAMAAL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-1845.19Show/hide
Query:  MCVSK-FRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPR
        +C SK   GGLGFRDL  FN+ALLAKQ +R+I  P++LL R+L+ +YF   S +E    +  S+ W+S++ GRELL  G+   +G+G    V  D WI  
Subjt:  MCVSK-FRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPR

Query:  DTTL
        +T L
Subjt:  DTTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGTGTCCAAATTTAGGGGTGGTCTTGGGTTTAGGGACCTTGAGTTGTTTAATAAGGCGTTATTGGCGAAACAAGGGTGGAGGATGATCTGCAATCCTAACTCCTT
ACTATGTAGAGTTTTGAAAGGGAAGTATTTCAAGGACTGCTCGTTCTTGGAAGCTAGAAGCAGGAGTAATAGCTCCTTTGTCTGGCAGAGTTTGTTGTGGGGGCGGGAGT
TGTTGAGGGCGGGTGTGAGGTGGAGGGTGGGGAATGGAAGCTCAATTAATGTCTTAAAGGACAATTGGATCCCACGGGATACCACACTTCGGGTTATTAGGAGGGAAGGA
ATTGACCCAGCAACTAAGGTTGATGCATTGTTGAATCCAGATGGAGCATGGAATGTGAATTCTGTTATGGATTTGTTGGGGGAGGAGGATGCAGGGGCTGTCTTAGGAAT
CCCAAGGCCTAGGAGGATGGTTCAAGATGAACTTATATGGCATTACGAAAAGAATGGTATGTATTCAGTGAAGAGTGGGTACAGGTTGGCTTGTTCAATTAAGGAAAACG
CCAGCAGCTCAAATTCTGATCAAATAATTCAATGGTGGAGGTTTCTATGGGCTAGACAAATCCCAAGTAAGGTGAAGATATTTTGTTGGAGATTGTATCATGATTTCCTG
CCAACAGAGGTAAATTTGCGTAAGAGGGGAGTTGATATCAAAGGAGGGTGTCCTCGGTGCCACAAGTCGACCTCTCATGCTATATGGGAGTGCAAAAGGGTGAGGAGGCA
GTGGCAAAATTCACCTTTTTGCTCGATACCATGGCCTCAGTCTGCTCGTAGTGCTGCCGATGTTCTGTGGTGGTGCAAATCTAACATGTCCGCAAAGGTTTTTGAGGATT
TTTTGGGGATGTGCTGGTGGGCCTGGTGTAGGAGGAATAAGGAGGTGTTTGGTCCTCTGTGCGGTGGAGAGAGGGAAATCGTGAAATGGTCTCCACCAGTTGCCCCAAGT
TACAAACTCAACACAGATGCTGCCACAGATAATGATTTGAAATTAAGTAGCCTAGGAGCTATTATCAGAGATGAAGGTGGGGAGGTGATGTTTGCTTGGATGAAGAAAGT
TCCGTTCGTGCAAGACGTTGATGTCCCAGAGGCAATGGTTGTTCGAGATAGTCTGCTGGTGGCGATTGAGGGTGGTTTCCGGCAGGTGGAGGTGGAATCCGACTCGGCCC
GTGTGGTGGGGTTGCTCAAATCGAATGGTTGCGACAGATCTGAGGTGGGTATGCTTGTGGAGGAAATTCGCCAAATCTCCAAGGGTCTCCAGTTGTGTTCCTTTTCGCGG
TGCTCCAGGTCGAGGAACGTAATGGCTCATAAGGTTGCGATGGCGGCGCTGAGGACTGGGTTTGAACGGTGTGTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGTGTCCAAATTTAGGGGTGGTCTTGGGTTTAGGGACCTTGAGTTGTTTAATAAGGCGTTATTGGCGAAACAAGGGTGGAGGATGATCTGCAATCCTAACTCCTT
ACTATGTAGAGTTTTGAAAGGGAAGTATTTCAAGGACTGCTCGTTCTTGGAAGCTAGAAGCAGGAGTAATAGCTCCTTTGTCTGGCAGAGTTTGTTGTGGGGGCGGGAGT
TGTTGAGGGCGGGTGTGAGGTGGAGGGTGGGGAATGGAAGCTCAATTAATGTCTTAAAGGACAATTGGATCCCACGGGATACCACACTTCGGGTTATTAGGAGGGAAGGA
ATTGACCCAGCAACTAAGGTTGATGCATTGTTGAATCCAGATGGAGCATGGAATGTGAATTCTGTTATGGATTTGTTGGGGGAGGAGGATGCAGGGGCTGTCTTAGGAAT
CCCAAGGCCTAGGAGGATGGTTCAAGATGAACTTATATGGCATTACGAAAAGAATGGTATGTATTCAGTGAAGAGTGGGTACAGGTTGGCTTGTTCAATTAAGGAAAACG
CCAGCAGCTCAAATTCTGATCAAATAATTCAATGGTGGAGGTTTCTATGGGCTAGACAAATCCCAAGTAAGGTGAAGATATTTTGTTGGAGATTGTATCATGATTTCCTG
CCAACAGAGGTAAATTTGCGTAAGAGGGGAGTTGATATCAAAGGAGGGTGTCCTCGGTGCCACAAGTCGACCTCTCATGCTATATGGGAGTGCAAAAGGGTGAGGAGGCA
GTGGCAAAATTCACCTTTTTGCTCGATACCATGGCCTCAGTCTGCTCGTAGTGCTGCCGATGTTCTGTGGTGGTGCAAATCTAACATGTCCGCAAAGGTTTTTGAGGATT
TTTTGGGGATGTGCTGGTGGGCCTGGTGTAGGAGGAATAAGGAGGTGTTTGGTCCTCTGTGCGGTGGAGAGAGGGAAATCGTGAAATGGTCTCCACCAGTTGCCCCAAGT
TACAAACTCAACACAGATGCTGCCACAGATAATGATTTGAAATTAAGTAGCCTAGGAGCTATTATCAGAGATGAAGGTGGGGAGGTGATGTTTGCTTGGATGAAGAAAGT
TCCGTTCGTGCAAGACGTTGATGTCCCAGAGGCAATGGTTGTTCGAGATAGTCTGCTGGTGGCGATTGAGGGTGGTTTCCGGCAGGTGGAGGTGGAATCCGACTCGGCCC
GTGTGGTGGGGTTGCTCAAATCGAATGGTTGCGACAGATCTGAGGTGGGTATGCTTGTGGAGGAAATTCGCCAAATCTCCAAGGGTCTCCAGTTGTGTTCCTTTTCGCGG
TGCTCCAGGTCGAGGAACGTAATGGCTCATAAGGTTGCGATGGCGGCGCTGAGGACTGGGTTTGAACGGTGTGTGGATTGA
Protein sequenceShow/hide protein sequence
MCVSKFRGGLGFRDLELFNKALLAKQGWRMICNPNSLLCRVLKGKYFKDCSFLEARSRSNSSFVWQSLLWGRELLRAGVRWRVGNGSSINVLKDNWIPRDTTLRVIRREG
IDPATKVDALLNPDGAWNVNSVMDLLGEEDAGAVLGIPRPRRMVQDELIWHYEKNGMYSVKSGYRLACSIKENASSSNSDQIIQWWRFLWARQIPSKVKIFCWRLYHDFL
PTEVNLRKRGVDIKGGCPRCHKSTSHAIWECKRVRRQWQNSPFCSIPWPQSARSAADVLWWCKSNMSAKVFEDFLGMCWWAWCRRNKEVFGPLCGGEREIVKWSPPVAPS
YKLNTDAATDNDLKLSSLGAIIRDEGGEVMFAWMKKVPFVQDVDVPEAMVVRDSLLVAIEGGFRQVEVESDSARVVGLLKSNGCDRSEVGMLVEEIRQISKGLQLCSFSR
CSRSRNVMAHKVAMAALRTGFERCVD