; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000055 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000055
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:22995725..23006781
RNA-Seq ExpressionSpg000055
SyntenySpg000055
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]9.7e-5539.82Show/hide
Query:  GLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANS
        G YG P +  R +SW+L+ RL  ++   W+  GD NEIL  DEK+GG +R   Q+  FR A+D+C   DLGFSG KFTW   R+   ++ +RLDR +A +
Subjt:  GLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANS

Query:  NFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNSEIPSLPLA----LKNCASALGGWGFCQNRR
        ++C  F   +V +L+  KSDH PI ++++  +      R R  F+FEE W  HE+C   IR    W    E  S P A    LK     L GW       
Subjt:  NFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNSEIPSLPLA----LKNCASALGGWGFCQNRR

Query:  LKTNIREVRDKIKMSYES-ASPINFEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFV
        L   I+ ++ K+    E+  +P   E    L   LDSL  + EIYW+QRSR  WLK GDRNTK+FH KA+ R+RRN I G+ED  GIWQT +  + +T V
Subjt:  LKTNIREVRDKIKMSYES-ASPINFEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFV

Query:  NYFTSIFSSG------QCVNVQKGEYTVKSGYKL
        +YF  +FSS       +  +V +G  + +   KL
Subjt:  NYFTSIFSSG------QCVNVQKGEYTVKSGYKL

EPS72636.1 hypothetical protein M569_02121, partial [Genlisea aurea]5.0e-5930.33Show/hide
Query:  MVERIGNVVGVFEDVD-SRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGD
        + E +GN +G F++ D  RNGF      L++RV I+   PL+R I +  +   S + +P  YERL   C +CG + H+ +DC+ +S    G GS PQ+G 
Subjt:  MVERIGNVVGVFEDVD-SRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGD

Query:  CLRFSRKGMVLNHFTA-REIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKV-VEVESKYRGGFFAGLW---MPEKSGRNN--LS
         LR       L  F A R +  E        N+S+     +       ++  FS G     +G  +V  +   K     FAG+      E SG +N  L 
Subjt:  CLRFSRKGMVLNHFTA-REIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKV-VEVESKYRGGFFAGLW---MPEKSGRNN--LS

Query:  VPEEAGQQD--------PMPVTVTTKEPVKLKQPFGWRIDEGPSNYERE---SDPEMDEEL--GPLGSDGKEWIGTNEELSLKADTHENLKEYSDRDDKS
         P     Q+        PM VT+ ++    L     +     P +  R    S P    E   G  G     W+G     S+  D    +K +S      
Subjt:  VPEEAGQQD--------PMPVTVTTKEPVKLKQPFGWRIDEGPSNYERE---SDPEMDEEL--GPLGSDGKEWIGTNEELSLKADTHENLKEYSDRDDKS

Query:  VDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDE
                             T   D   +P            P W+          G YG+P    R+ SW+L+ RL+      W++ GD NE+LWQDE
Subjt:  VDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDE

Query:  KSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRT
              R  + +  FR AL++C+L DLGF G  FTW N R   + V  RLDRFVAN+++ N+  +F V +L +  SDH PI L     V      RR+R 
Subjt:  KSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRT

Query:  FKFEEWWTCHEDCGNIIRRAGSWAC--NSEIPSLPL--ALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESA-SPINFEVIHGLERHLDSLQLEEE
        FKFE+ W  +E C  II   G WA   +S  P L L   L+NC   L  W       L+  I  ++D++    E   S    + I  L+  L  L   +E
Subjt:  FKFEEWWTCHEDCGNIIRRAGSWAC--NSEIPSLPL--ALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESA-SPINFEVIHGLERHLDSLQLEEE

Query:  IYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS
        I+WKQRS+ +WL+ GD+N K+FH  A+ R+RRN I  ++    IW    + +H  F++ +  +F S
Subjt:  IYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS

KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]1.8e-5625.82Show/hide
Query:  SSTLVDEWARLSLTEEEEDISVTVDREAVDRTGQLLE---SCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQ---------------------------
        +  L + +A LSLTE+E + +V+V+   + R G +LE   +CL+ +LL  +    E  ++  R  W+  +G++                           
Subjt:  SSTLVDEWARLSLTEEEEDISVTVDREAVDRTGQLLE---SCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQ---------------------------

Query:  --VDSHLII-------------------------NFPIRLFGSMVER-IGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLW
           D HL++                         + P+      + R +G  +G   +VD  NG + WG  +R+RV I++S+PL R   +  +  +S  W
Subjt:  --VDSHLII-------------------------NFPIRLFGSMVER-IGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLW

Query:  VPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPN
        V F YERL ++C  CGI+GH  ++C     ++  S  +P YG  LR       LN  + R  GR +         S        +AT  P         N
Subjt:  VPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPN

Query:  VSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSV-------PEEAGQQDPMPVTVTTKE----------------------PVKLKQPFGWRIDEGP
            G E V E   + +     G+ +P   G +++          E A  +  M V    KE                      PV         +D GP
Subjt:  VSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSV-------PEEAGQQDPMPVTVTTKE----------------------PVKLKQPFGWRIDEGP

Query:  SNYERESDPEMDEELGPLGSDGKEWIGTNEELSLKADTHEN-----------------LKEYSDRDDKSVDMKEK---------------AWMGR-----
        S+               +     + +G+     ++ + +E                  L     R  K + + E                +W+ R     
Subjt:  SNYERESDPEMDEELGPLGSDGKEWIGTNEELSLKADTHEN-----------------LKEYSDRDDKSVDMKEK---------------AWMGR-----

Query:  ----LGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVP--------------------RGLYGHPDSNLRTQSWNLIRRLYDSHEAAWV
             G       E+ W       +   +    G    WK   H  +V                      G+YGHP+S  R++ W L++ L       W+
Subjt:  ----LGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVP--------------------RGLYGHPDSNLRTQSWNLIRRLYDSHEAAWV

Query:  IEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAG
        + GD NEIL   EK GG  R  +Q+  FR  L DC+L DLG+ G  FTW NRR     V  RLDRF+ANS +C ++ N +V +   A SDH P+ L   G
Subjt:  IEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAG

Query:  EVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNS-EIPSLPLALKNCASALGGW-----GFCQ----NRRLKTNIREVRDKIKMSYESASPINF
         + R    R +R F+FE  W   ++C +II RA      S  +  +   +  CA  LG W     G  Q    N + K    E  D   +S E       
Subjt:  EVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNS-EIPSLPLALKNCASALGGW-----GFCQ----NRRLKTNIREVRDKIKMSYESASPINF

Query:  EVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQ---TEKAIVHDTFVNYFTS
        EV   LER        +E+ WKQRSR  WL+ GD N+++FH KA+ RRR+N I+ ++D  G WQ      A++ + F N FT+
Subjt:  EVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQ---TEKAIVHDTFVNYFTS

XP_035545013.1 uncharacterized protein LOC108979776 [Juglans regia]1.3e-6427.42Show/hide
Query:  RLSLTE-EEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII--
        +LSLTE E E + V VD + +  T    E CL+ +LL  R       ++  R  W+   GL++                             D HL++  
Subjt:  RLSLTE-EEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII--

Query:  ----------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEI
                        +F +R+           + + IG  +G  ED+D  +G + WG  +RIRV ID+++ L RG  +   G  +  WV F YE+L + 
Subjt:  ----------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEI

Query:  CSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTARE--------IGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSL
        C IC  IGH  RDC Q+   +      P YG  LR   +G+ +     R         +       G+++    ++   +R+  V+P      G      
Subjt:  CSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTARE--------IGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSL

Query:  KGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEEL
            + VE   +   GF + +        N L    E G Q P+ +TV   +     QP       GPS+    + P     +G   +  +E +G     
Subjt:  KGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEEL

Query:  SLKADTHENLKEYSDRDDKSVDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLY-
          +  T E+      ++ K    K ++  G L +    D +      + N     +     R   W+          G+YG+P +  R  +W+LIR+L+ 
Subjt:  SLKADTHENLKEYSDRDDKSVDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLY-

Query:  -DSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDH
         D  +  W++ GD NE+L   EK  G +R  NQ+ AFRE L DC+L D+GF G KFTW N R+    +S RLDRF+ N++F  LF    V +   A SDH
Subjt:  -DSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDH

Query:  RPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIR---RAGSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK----MSYES
         PI     G + +G   +R R F+FE  W   + C +II      G+   N+ +  +   +K C   L  W       +K    E R +++     + E 
Subjt:  RPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIR---RAGSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK----MSYES

Query:  ASPINFEVIHGLERHLDSLQL---EEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQ
          P+      GL +  ++LQ+    EE+ W+QRSR  WL  GD+NT++FH +A +RRRRN I G+ + QG W  E  + ++  + +F ++FS+ +    Q
Subjt:  ASPINFEVIHGLERHLDSLQL---EEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQ

Query:  KGEYTV
        +G+  V
Subjt:  KGEYTV

XP_042958006.1 uncharacterized protein LOC122293492 [Carya illinoinensis]1.3e-5428.85Show/hide
Query:  ERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLR
        E+IG+ VG  E VD +     WG  LR+++ IDL++PL RG  +   G    +W+PF YE++  IC +CG I H   +C +     GG     QYG  LR
Subjt:  ERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLR

Query:  FS---RKGMVLNHFTAREIG------RESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRN----
         +   R    +N     E G      +E   +G     S+ E+  R        + +         KG EK  E E     G    +    + G      
Subjt:  FS---RKGMVLNHFTAREIG------RESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRN----

Query:  --NLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEELSLKADTHENLKEYSDRDDKSVDMKEKAW
          +L+  EE  +     V V  +   K      W  +EG    +R    ++ ++L  +    K  +    E  L+    E++K    R   S        
Subjt:  --NLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEELSLKADTHENLKEYSDRDDKSVDMKEKAW

Query:  MGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSW-KKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSR
        +G+ G  +       W    +    N+     G   +W  + A +  +    YG P++  R  +W++++ L    +  W+I GD NEIL +DEK GG +R
Subjt:  MGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSW-KKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSR

Query:  DYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRR--RTFKFEE
          +Q+  FRE + + +L DLG+ G KFT  N          RLDR VANS +  ++    V  L    SDH+P+ + L  +  RG    R+  + FK+E 
Subjt:  DYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRR--RTFKFEE

Query:  WWTCHEDCGNIIRRAGSWACNSEIPSLPLA-LKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYE---SASPINFEVIHGLERHLDSLQLEEEIYWKQR
         W   ++C  ++RRA  W  N  +    +  L N   A   W   + +R   N +EV +K K+        S  N E I  L   L +L  +E+++WKQR
Subjt:  WWTCHEDCGNIIRRAGSWACNSEIPSLPLA-LKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYE---SASPINFEVIHGLERHLDSLQLEEEIYWKQR

Query:  SRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQ
        ++ NW K GDRNTK+FH  A  RR+RN I  VED           V + F  YF ++F S Q
Subjt:  SRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQ

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein2.5e-6427.8Show/hide
Query:  VDEWARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHL
        ++E  R     + E   + + RE V R+ Q  +  +L +LL  RP   E ++ + RA W    G+ V                             D  L
Subjt:  VDEWARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHL

Query:  -------------------------IINFPIRLFGSMV-ERIGNVVGVFEDVD--SRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRY
                                 I+N PI+     V E IG  VG   DVD   + G + WG  LRIRV +DL+ PL RG  +  +     +WV FRY
Subjt:  -------------------------IINFPIRLFGSMV-ERIGNVVGVFEDVD--SRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRY

Query:  ERLLEICSICGIIGHVTRDCIQSSRSEGGSGSI----PQYGDCLRF--------SRKGMVLNHFTAREIGRES---PQRGIIINESSDEQIQRRSATVSP
        E L   C  CGIIGH   +CI      GGS +      QYG  LR          ++   +N  ++    ++S   PQ+ +  N++  + +    +  + 
Subjt:  ERLLEICSICGIIGHVTRDCIQSSRSEGGSGSI----PQYGDCLRF--------SRKGMVLNHFTAREIGRES---PQRGIIINESSDEQIQRRSATVSP

Query:  ALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMP---EKSGRNNLSVPEEAGQQDP---------MPVTVTTKEPVKLKQPFGWRIDEGPSNYERES
        A+ S +  P + +      V V  K        L +P   +   + N+    +  Q  P         MP T     P+++    G  I+   +N  R S
Subjt:  ALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMP---EKSGRNNLSVPEEAGQQDP---------MPVTVTTKEPVKLKQPFGWRIDEGPSNYERES

Query:  DPEMDEELGPLG----SDGKEWIGTNEE-----LSLKA-----------DTHENLKEYSDRDDKS--------VDMKEKAWMG-RLGMTMSDDKETE---
          E   ELGP      S G + I +N E     + +KA           +T   L     ++D S        ++++   ++  RLGM      E     
Subjt:  DPEMDEELGPLG----SDGKEWIGTNEE-----LSLKA-----------DTHENLKEYSDRDDKS--------VDMKEKAWMG-RLGMTMSDDKETE---

Query:  ------WGDMTDNPTPNFIPN------ILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQ
              W    +    ++  +      I   G  W+          G YGHP+  LR  SW+L+R L+ +    W++ GD NEI   DEK G   R   Q
Subjt:  ------WGDMTDNPTPNFIPN------ILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQ

Query:  ILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHE
        + AFRE+L DC+L DLG+ G  FTW NRR+ G  V +RLDR VAN  + +LF   QV ++  A SDH  + +      N     R+++ F+FE  W    
Subjt:  ILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHE

Query:  DCGNIIRRAGSWACNSEIPSLPL-----ALKNCASALGGWGFCQNRRLKTNIREVRDK-IKMSYESASPINFEVIHGLERHLDSLQLEEEIYWKQRSREN
         C   I+ A  W  NS     P+      +K C   L  W   Q R     I E + + +++        N   ++ L R L+ L  +EE++W+QRSR +
Subjt:  DCGNIIRRAGSWACNSEIPSLPL-----ALKNCASALGGWGFCQNRRLKTNIREVRDK-IKMSYESASPINFEVIHGLERHLDSLQLEEEIYWKQRSREN

Query:  WLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS
        WLK GDRNT++FH+ A+ R++ N ILG+ D  G+W ++  +++   V+YF ++F+S
Subjt:  WLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS

A0A2N9HK89 Uncharacterized protein3.2e-6427.9Show/hide
Query:  WARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLIIN
        W R SL E E       DR  +  + Q+    L  +    R +  E + R F+  W+ ++G  V                             D +LI N
Subjt:  WARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLIIN

Query:  FPI-RLFGSMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGS
         PI  L   + E +G+ +G        +  L  G  +RIRV++D+++PL RG  I       G W  F+YERL   C  CG++ H  +DC    R+    
Subjt:  FPI-RLFGSMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGS

Query:  GSIPQ-YGDCLRFSRKGMVLNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNN
        G   Q YG  ++                G E P R                                        VE+  + R   F         GR  
Subjt:  GSIPQ-YGDCLRFSRKGMVLNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNN

Query:  LSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSD-GKEWIGTNEELSLKADTHENLK---EYSDRDDKSVDMKEKA
         + PEE GQQ   PVT  TK             D G +N E  + P ++ ++    +D   +    +E L L  +  EN       SD  D  VD     
Subjt:  LSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSD-GKEWIGTNEELSLKADTHENLK---EYSDRDDKSVDMKEKA

Query:  WMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRG-PSWKKRAHAGMVPRG-------------------------LYGHPDSNLRTQSWNLIRRLYDS
          G       +DK    G +TD      + NI   G  SWKK+A A  +  G                         +YG P+++LR ++WNLIRRL   
Subjt:  WMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRG-PSWKKRAHAGMVPRG-------------------------LYGHPDSNLRTQSWNLIRRLYDS

Query:  HEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPI
        H   W   GD NEI+   E  G   R   Q+  FRE LD+C + DLG+ G  FTWCN RD      +RLDR VA+ ++ + F+N ++ +L+   SDH+ +
Subjt:  HEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPI

Query:  ELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSW---ACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESA-SPINF
         L L            R+ F+FEE WT  + C + I+    W      +E+  +   LKNC S LG W       +   + E R ++ ++   A    + 
Subjt:  ELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSW---ACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESA-SPINF

Query:  EVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQK
        + +  L+  ++ L  +EE  W+QRSR +WLK GDRNT++FH +A+ RRRRN I+G+ + +G+W  EK  + +  + Y+ +IF++ Q  ++++
Subjt:  EVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQK

A0A6P9EM92 uncharacterized protein LOC1089797766.5e-6527.42Show/hide
Query:  RLSLTE-EEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII--
        +LSLTE E E + V VD + +  T    E CL+ +LL  R       ++  R  W+   GL++                             D HL++  
Subjt:  RLSLTE-EEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII--

Query:  ----------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEI
                        +F +R+           + + IG  +G  ED+D  +G + WG  +RIRV ID+++ L RG  +   G  +  WV F YE+L + 
Subjt:  ----------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEI

Query:  CSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTARE--------IGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSL
        C IC  IGH  RDC Q+   +      P YG  LR   +G+ +     R         +       G+++    ++   +R+  V+P      G      
Subjt:  CSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTARE--------IGRESPQRGIIINESSDEQIQRRSATVSPALYSFSGGPNVSL

Query:  KGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEEL
            + VE   +   GF + +        N L    E G Q P+ +TV   +     QP       GPS+    + P     +G   +  +E +G     
Subjt:  KGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEELGPLGSDGKEWIGTNEEL

Query:  SLKADTHENLKEYSDRDDKSVDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLY-
          +  T E+      ++ K    K ++  G L +    D +      + N     +     R   W+          G+YG+P +  R  +W+LIR+L+ 
Subjt:  SLKADTHENLKEYSDRDDKSVDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLY-

Query:  -DSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDH
         D  +  W++ GD NE+L   EK  G +R  NQ+ AFRE L DC+L D+GF G KFTW N R+    +S RLDRF+ N++F  LF    V +   A SDH
Subjt:  -DSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDH

Query:  RPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIR---RAGSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK----MSYES
         PI     G + +G   +R R F+FE  W   + C +II      G+   N+ +  +   +K C   L  W       +K    E R +++     + E 
Subjt:  RPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIR---RAGSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK----MSYES

Query:  ASPINFEVIHGLERHLDSLQL---EEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQ
          P+      GL +  ++LQ+    EE+ W+QRSR  WL  GD+NT++FH +A +RRRRN I G+ + QG W  E  + ++  + +F ++FS+ +    Q
Subjt:  ASPINFEVIHGLERHLDSLQL---EEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQ

Query:  KGEYTV
        +G+  V
Subjt:  KGEYTV

A0A7N2MCK5 Uncharacterized protein1.0e-6528.01Show/hide
Query:  LVDEWARLSLT-EEEEDISVTVDREAVDRTGQLLESC---LLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV----------------------------
        +++    + LT EEEE+I ++ +   V+     ++SC   L+ + L  +P   +  +   R AW +D+GLQ+                            
Subjt:  LVDEWARLSLT-EEEEDISVTVDREAVDRTGQLLESC---LLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV----------------------------

Query:  -DSHLII------------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIY-PDGPLSGLWV
         ++ L++                  +  ++++G        ++   IG  +GV EDV+ R         LR++V + +S+P+RRG  +   DG     WV
Subjt:  -DSHLII------------------NFPIRLFG--------SMVERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIY-PDGPLSGLWV

Query:  PFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIP-QYGDCLRF--SRKGMVLNHFT-----AREIGRESPQRGIIINESSDEQIQRRSATVSPALY
         ++YERL   C  CGI+GH  R C     +   + S+  QYGD LR   +R        T        +G+E     ++    +      R+AT    LY
Subjt:  PFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIP-QYGDCLRF--SRKGMVLNHFT-----AREIGRESPQRGIIINESSDEQIQRRSATVSPALY

Query:  SF--SGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDP------MPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEE
             GG + S+  K +    E  Y    F    +P   G +++  P E  +  P      +    +  E  K       R++ GPS+     +P +   
Subjt:  SF--SGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDP------MPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEE

Query:  L----GPLGSDGKEWIGT------NEELSLKADTHENLKEY---------------SDRDD-----KSVDMKEKAWMGRLG------MTMSDDKETEWGD
        +     PLG   +E   T       +   +K     NL++                 DR+      K +  K K  + +LG      M   DD   +   
Subjt:  L----GPLGSDGKEWIGT------NEELSLKADTHENLKEY---------------SDRDD-----KSVDMKEKAWMGRLG------MTMSDDKETEWGD

Query:  MTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWD
         +DN    ++      G  W        V  G YG P++  R +SW L+  +    + AW+  GD NE+L   EK         Q+ AFREAL+ CNL D
Subjt:  MTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWNLIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWD

Query:  LGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAG--SWA
        LGF G KFTW NRR   A    RLDR VAN  + + F   +V ++    SDH P  L L   V+     + RR+FKFEE W   EDC  +IR A   S+ 
Subjt:  LGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELSLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAG--SWA

Query:  CNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK-MSYESASPINFEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATL
        C S + +    +  CAS L  WG  + +     I+ ++ +I+ ++  + +  N E      + LD     +EIYW QRSR NW+K GD+NTK+FH KA+ 
Subjt:  CNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIK-MSYESASPINFEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATL

Query:  RRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQK
        RR+RN I G+ D+QG W  E   V +  V YFT +FS+G C  + +
Subjt:  RRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQK

A0A7N2R0C3 Reverse transcriptase domain-containing protein3.1e-6727.41Show/hide
Query:  WARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII-
        W RL +TEEEE+ S+ +  E +    +  + C+  +++  + L  E +R+N R  WK ++ +Q+                             +  L++ 
Subjt:  WARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQV-----------------------------DSHLII-

Query:  ------------------------NFPIRLFGSMV-ERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLE
                                N P++       + IG  +G F +VD     + WG  LR+RV ID++R L RG  I  +      WV F+YERL  
Subjt:  ------------------------NFPIRLFGSMV-ERIGNVVGVFEDVDSRNGFLFWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLE

Query:  ICSICGIIGHVTRDCIQS-SRSEGGSGSIPQYGDCLR------------FSRKGMV------LNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPA
         C  CG++ H  +DC++   + + G  S  QYG  LR            F++K ++       N   A   GR+  Q G+       E I    +     
Subjt:  ICSICGIIGHVTRDCIQS-SRSEGGSGSIPQYGDCLR------------FSRKGMV------LNHFTAREIGRESPQRGIIINESSDEQIQRRSATVSPA

Query:  LYSFSGGPNVSLKGKE-------------KVVEV--ESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSN-YE
             GG   ++K  E             ++VEV  E++  G    G    +K    NL+ P    +      TV T   V L    G   D+       
Subjt:  LYSFSGGPNVSLKGKE-------------KVVEV--ESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSN-YE

Query:  RESDPE---MDEELGPLGSDGKEWIGTNEELSLK----------------ADTHENLKEYSDRDDKSVDMKEKAW----MGRLGMTMSDDKETEWGD---
         + DPE   +  +LGP     K  I    +  +K                 +  +N+K    R      M   AW    MG      +   E + GD   
Subjt:  RESDPE---MDEELGPLGSDGKEWIGTNEELSLK----------------ADTHENLKEYSDRDDKSVDMKEKAW----MGRLGMTMSDDKETEWGD---

Query:  ---------------------MTDNPTPNFIPNILGRGPSWKK----------RAHA--------GMVP---RGLYGHPDSNLRTQSWNLIRRLYDSHEA
                             +T            G    W++           +H         G VP    G YGHPD+ +R  SW L+  L      
Subjt:  ---------------------MTDNPTPNFIPNILGRGPSWKK----------RAHA--------GMVP---RGLYGHPDSNLRTQSWNLIRRLYDSHEA

Query:  AWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELS
         WV+ GD NEIL  DEK G   RD  Q+  FRE L +C L DLGF G +FTWCN R    +  +RLDR VAN  + NLF   +V +   A SDH  + LS
Subjt:  AWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIELS

Query:  LAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRA-GSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESASPINF-----E
        +     R  +   RR F FEE WT  E C  +I RA     CN E+ ++   LK C   L  W    NRR+  N+ ++  + +   +    +N      E
Subjt:  LAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRA-GSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESASPINF-----E

Query:  VIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS
         +  L++ ++ + L EEI W QRSR  W+K+GDRNT++FH  A  RRR+N I G+ D++G W+     V +  + YF  I+SS
Subjt:  VIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.5e-1022.76Show/hide
Query:  VIEGDLNEILWQDEKSG--GTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFEN-FQVYNLDWAKSDHRPIEL
        ++ GD ++I    +      TS     +  F+  L D +L D+   G  +TW N +D    +  +LDR +AN ++ + F +   V+ L    SDH P  +
Subjt:  VIEGDLNEILWQDEKSG--GTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFEN-FQVYNLDWAKSDHRPIEL

Query:  SLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNSEIPSLPLAL-------KNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESASPIN
         L          R ++ F++  + + H     ++    +W     + S   +L       K C   L   GF     ++   +E  D ++ S +S    N
Subjt:  SLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNSEIPSLPLAL-------KNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESASPIN

Query:  -----FEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS
             F V H   +  +      E +++Q+SR  WL+ GD NT++FH+     + +N I  +     +       V +  V Y+T +  S
Subjt:  -----FEVIHGLERHLDSLQLEEEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTTCTACTTTGGTTGATGAATGGGCTCGTCTGAGCTTAACGGAGGAAGAAGAAGATATCTCGGTGACTGTTGATCGTGAGGCGGTCGATCGAACAGGTCAACT
GTTGGAATCTTGTCTTCTGGACAGGCTGTTGTGTCATCGGCCGTTGGGAGCAGAGGTTATGCGGCGCAATTTTAGGGCTGCATGGAAAATCGACCAAGGACTGCAAGTGG
ACAGCCATCTGATCATCAATTTTCCTATTCGGCTTTTTGGATCCATGGTTGAGAGAATTGGCAATGTCGTTGGGGTCTTCGAAGATGTTGACAGCAGAAACGGTTTTCTA
TTCTGGGGAGCCAATTTAAGAATCAGAGTTCGTATTGATCTGTCTCGTCCTCTCCGACGTGGGATCCACATTTATCCGGATGGCCCCCTAAGTGGGTTGTGGGTTCCCTT
CAGGTATGAACGTTTACTGGAGATTTGTTCTATCTGTGGTATTATAGGTCATGTAACGCGTGATTGTATTCAGTCTTCCCGATCGGAAGGAGGTTCTGGGTCAATCCCGC
AATATGGGGATTGTTTACGGTTTTCAAGGAAGGGTATGGTTTTGAATCACTTCACTGCAAGGGAGATAGGCCGTGAATCTCCCCAGCGAGGAATTATAATTAATGAGTCG
TCTGATGAGCAGATCCAGAGGAGGTCAGCTACGGTTTCGCCGGCGTTATACTCATTTTCCGGTGGGCCTAACGTGTCTCTCAAAGGAAAAGAAAAAGTCGTCGAGGTCGA
GAGCAAGTACAGAGGCGGCTTTTTTGCAGGTCTGTGGATGCCGGAGAAATCTGGTCGGAACAATCTGTCGGTTCCTGAGGAGGCGGGGCAACAGGATCCAATGCCGGTGA
CTGTTACAACCAAGGAACCAGTGAAGTTAAAGCAGCCGTTTGGATGGAGAATTGATGAAGGTCCAAGTAATTACGAGAGGGAGAGTGATCCTGAGATGGATGAGGAACTT
GGGCCTTTGGGCTCAGATGGAAAGGAATGGATTGGAACCAATGAAGAGTTGTCACTGAAGGCTGACACTCATGAAAATTTGAAAGAATATTCTGATCGGGATGACAAGTC
AGTTGACATGAAGGAGAAGGCTTGGATGGGAAGGCTTGGGATGACAATGTCTGATGACAAGGAGACTGAGTGGGGTGACATGACTGACAATCCTACACCAAATTTCATTC
CAAACATATTGGGTCGTGGGCCTTCTTGGAAAAAGAGAGCGCATGCTGGGATGGTGCCAAGAGGCTTATATGGTCATCCGGATTCTAATTTAAGGACCCAATCTTGGAAT
CTTATTCGGCGTTTATATGACTCTCACGAGGCTGCTTGGGTTATAGAGGGTGATCTGAATGAAATTCTGTGGCAAGATGAAAAATCAGGTGGTACGTCTAGAGATTACAA
TCAAATTTTGGCTTTTCGTGAAGCTTTGGATGATTGCAATCTCTGGGACCTTGGTTTCTCTGGGGGCAAATTCACTTGGTGTAACAGAAGGGACATGGGGGCTCAAGTGA
GTTTGCGATTGGATCGTTTTGTTGCAAACTCTAACTTTTGTAACCTTTTTGAGAACTTTCAAGTGTATAATCTAGATTGGGCAAAATCTGACCATCGCCCAATTGAACTC
AGTCTGGCGGGAGAAGTTAATCGGGGTCACAAAAATCGGAGGCGTAGGACCTTCAAATTTGAGGAATGGTGGACCTGCCATGAGGATTGTGGTAACATTATTCGCAGAGC
AGGAAGTTGGGCGTGTAACTCGGAAATCCCGTCTTTGCCCCTTGCTTTGAAAAATTGTGCTTCGGCCTTGGGTGGGTGGGGCTTCTGTCAGAATAGAAGACTGAAGACTA
ATATCAGAGAGGTGCGTGATAAAATTAAGATGTCATACGAGAGCGCTTCTCCTATTAATTTTGAGGTCATCCACGGTTTGGAGCGTCATCTGGATTCTCTTCAGCTTGAA
GAAGAGATTTATTGGAAACAGCGTTCGAGGGAAAATTGGCTTAAGTGGGGTGATCGAAACACGAAGTGGTTCCACCAAAAAGCAACTTTACGACGTCGGAGGAATTGTAT
TCTTGGAGTTGAAGACGCTCAAGGAATTTGGCAAACGGAGAAGGCGATTGTACACGATACATTCGTGAACTACTTCACTTCCATATTCTCTTCAGGTCAGTGTGTTAATG
TTCAGAAGGGTGAATATACTGTAAAGAGTGGGTACAAGCTAAGTATGATGCTCGATCAACAGGCTACCTTGTCAGGCGCAAGGAGAGAGACGAGGTGGTGGAAAAAAGTT
TGGAAGATGAGAGTGCCTAGCAAGGGGGCATGGGCTATTTGGAATGATAGAAATTGTCGGGTTCATAACCGTCCAATTCCGAATGTGGAGATGCGTAGTGATTGGATTAT
CGAATATTTGATCGATTATTGGCATGCTAATCCGAAAGGTTCTGTAAATGTTCAGAATGAGGACGATGTGTACAAGATTATCTCAAATGGTGAAGATTATATCTTGCATA
TTGATGCAGCTTTTATGGGTGCTATGCGAGCAAGTGGGGTTGGGTTAGTTCTACGAGATAAATTTGGTCGTCTGAAGGCTGCACAATCTTTACGATCACAGGTTTACACT
TCTCCGTTAGGGGTGAAAGCTGTCGCAGTTCTCCATGGACTTTATATGGCTAAGACGTTGGGTGTGAATCCTCGCTTTGGCACATTGGGTGTGAATCGTGTAACGATATT
GTCGGATTCTCTAACACTGATCAAATCCATCAATGTGGAGATGCAATATGACTCTTGCATGGCGAATACTATCTGGGACATCAAGAATATTCAAAATTCATTCGATAAGA
AGCGCAGCAGCATCTTGGCGGCTTCAGGCTGTGGCGGCGACGACAAGTGGATGAGGGAGGCAAACGGTGAGAGGAGCGGCACGAGCGGAATTCAGGCTGGTCATCCTCTC
ACTTCTGTTTCTTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGATCTCTCGCCCTTTTCATCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGA
ATTCAGGATGATCTCTCGTCCTTCACATCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGATCTCTCGCCCTTCATATCTTGCCTCAGGCAGTG
ATCAACCCCTGACTCAGGCGGAATTCAGGATTATCTCTCGCCCTTCACATTTTGCCTCAGATAACGGGAAAAAACGTCTGGTTGTTCTGCTGGACAAAGATCAACCTATC
AAAAGTGAGACCATTTTTCCAATAGAGACTTCCATTATAGAGTCAAACCTTGAATTGGACAAATCTCATTTTCTTTCGAAATGGTCTGTAGAAAAGACTCAAGGCCAAAA
CTCTTCCACTAAAGTTTGGATTCTACGATCTTCTTTACACGGAAAGAAGTCAAATGTCAATCCAGAACCGACGTTGGGGCGTCGAATCATTGAAAATCAGCATTCTTGGG
TGTACAAGTTGAATCAGAAGCTCGATTTGGTATGGTTGAGGGTAGCAGAGAAGCACGAATCTGGATTTGCTGAGTTCGGGGGATTTCTGGGAAAATCCGTTGGGAATCCC
GTTTTTCTTACTTCAGGCCAAGAGGAAATAGGCCGAGTCTACAGGCCAGGGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTTCTACTTTGGTTGATGAATGGGCTCGTCTGAGCTTAACGGAGGAAGAAGAAGATATCTCGGTGACTGTTGATCGTGAGGCGGTCGATCGAACAGGTCAACT
GTTGGAATCTTGTCTTCTGGACAGGCTGTTGTGTCATCGGCCGTTGGGAGCAGAGGTTATGCGGCGCAATTTTAGGGCTGCATGGAAAATCGACCAAGGACTGCAAGTGG
ACAGCCATCTGATCATCAATTTTCCTATTCGGCTTTTTGGATCCATGGTTGAGAGAATTGGCAATGTCGTTGGGGTCTTCGAAGATGTTGACAGCAGAAACGGTTTTCTA
TTCTGGGGAGCCAATTTAAGAATCAGAGTTCGTATTGATCTGTCTCGTCCTCTCCGACGTGGGATCCACATTTATCCGGATGGCCCCCTAAGTGGGTTGTGGGTTCCCTT
CAGGTATGAACGTTTACTGGAGATTTGTTCTATCTGTGGTATTATAGGTCATGTAACGCGTGATTGTATTCAGTCTTCCCGATCGGAAGGAGGTTCTGGGTCAATCCCGC
AATATGGGGATTGTTTACGGTTTTCAAGGAAGGGTATGGTTTTGAATCACTTCACTGCAAGGGAGATAGGCCGTGAATCTCCCCAGCGAGGAATTATAATTAATGAGTCG
TCTGATGAGCAGATCCAGAGGAGGTCAGCTACGGTTTCGCCGGCGTTATACTCATTTTCCGGTGGGCCTAACGTGTCTCTCAAAGGAAAAGAAAAAGTCGTCGAGGTCGA
GAGCAAGTACAGAGGCGGCTTTTTTGCAGGTCTGTGGATGCCGGAGAAATCTGGTCGGAACAATCTGTCGGTTCCTGAGGAGGCGGGGCAACAGGATCCAATGCCGGTGA
CTGTTACAACCAAGGAACCAGTGAAGTTAAAGCAGCCGTTTGGATGGAGAATTGATGAAGGTCCAAGTAATTACGAGAGGGAGAGTGATCCTGAGATGGATGAGGAACTT
GGGCCTTTGGGCTCAGATGGAAAGGAATGGATTGGAACCAATGAAGAGTTGTCACTGAAGGCTGACACTCATGAAAATTTGAAAGAATATTCTGATCGGGATGACAAGTC
AGTTGACATGAAGGAGAAGGCTTGGATGGGAAGGCTTGGGATGACAATGTCTGATGACAAGGAGACTGAGTGGGGTGACATGACTGACAATCCTACACCAAATTTCATTC
CAAACATATTGGGTCGTGGGCCTTCTTGGAAAAAGAGAGCGCATGCTGGGATGGTGCCAAGAGGCTTATATGGTCATCCGGATTCTAATTTAAGGACCCAATCTTGGAAT
CTTATTCGGCGTTTATATGACTCTCACGAGGCTGCTTGGGTTATAGAGGGTGATCTGAATGAAATTCTGTGGCAAGATGAAAAATCAGGTGGTACGTCTAGAGATTACAA
TCAAATTTTGGCTTTTCGTGAAGCTTTGGATGATTGCAATCTCTGGGACCTTGGTTTCTCTGGGGGCAAATTCACTTGGTGTAACAGAAGGGACATGGGGGCTCAAGTGA
GTTTGCGATTGGATCGTTTTGTTGCAAACTCTAACTTTTGTAACCTTTTTGAGAACTTTCAAGTGTATAATCTAGATTGGGCAAAATCTGACCATCGCCCAATTGAACTC
AGTCTGGCGGGAGAAGTTAATCGGGGTCACAAAAATCGGAGGCGTAGGACCTTCAAATTTGAGGAATGGTGGACCTGCCATGAGGATTGTGGTAACATTATTCGCAGAGC
AGGAAGTTGGGCGTGTAACTCGGAAATCCCGTCTTTGCCCCTTGCTTTGAAAAATTGTGCTTCGGCCTTGGGTGGGTGGGGCTTCTGTCAGAATAGAAGACTGAAGACTA
ATATCAGAGAGGTGCGTGATAAAATTAAGATGTCATACGAGAGCGCTTCTCCTATTAATTTTGAGGTCATCCACGGTTTGGAGCGTCATCTGGATTCTCTTCAGCTTGAA
GAAGAGATTTATTGGAAACAGCGTTCGAGGGAAAATTGGCTTAAGTGGGGTGATCGAAACACGAAGTGGTTCCACCAAAAAGCAACTTTACGACGTCGGAGGAATTGTAT
TCTTGGAGTTGAAGACGCTCAAGGAATTTGGCAAACGGAGAAGGCGATTGTACACGATACATTCGTGAACTACTTCACTTCCATATTCTCTTCAGGTCAGTGTGTTAATG
TTCAGAAGGGTGAATATACTGTAAAGAGTGGGTACAAGCTAAGTATGATGCTCGATCAACAGGCTACCTTGTCAGGCGCAAGGAGAGAGACGAGGTGGTGGAAAAAAGTT
TGGAAGATGAGAGTGCCTAGCAAGGGGGCATGGGCTATTTGGAATGATAGAAATTGTCGGGTTCATAACCGTCCAATTCCGAATGTGGAGATGCGTAGTGATTGGATTAT
CGAATATTTGATCGATTATTGGCATGCTAATCCGAAAGGTTCTGTAAATGTTCAGAATGAGGACGATGTGTACAAGATTATCTCAAATGGTGAAGATTATATCTTGCATA
TTGATGCAGCTTTTATGGGTGCTATGCGAGCAAGTGGGGTTGGGTTAGTTCTACGAGATAAATTTGGTCGTCTGAAGGCTGCACAATCTTTACGATCACAGGTTTACACT
TCTCCGTTAGGGGTGAAAGCTGTCGCAGTTCTCCATGGACTTTATATGGCTAAGACGTTGGGTGTGAATCCTCGCTTTGGCACATTGGGTGTGAATCGTGTAACGATATT
GTCGGATTCTCTAACACTGATCAAATCCATCAATGTGGAGATGCAATATGACTCTTGCATGGCGAATACTATCTGGGACATCAAGAATATTCAAAATTCATTCGATAAGA
AGCGCAGCAGCATCTTGGCGGCTTCAGGCTGTGGCGGCGACGACAAGTGGATGAGGGAGGCAAACGGTGAGAGGAGCGGCACGAGCGGAATTCAGGCTGGTCATCCTCTC
ACTTCTGTTTCTTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGATCTCTCGCCCTTTTCATCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGA
ATTCAGGATGATCTCTCGTCCTTCACATCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGATCTCTCGCCCTTCATATCTTGCCTCAGGCAGTG
ATCAACCCCTGACTCAGGCGGAATTCAGGATTATCTCTCGCCCTTCACATTTTGCCTCAGATAACGGGAAAAAACGTCTGGTTGTTCTGCTGGACAAAGATCAACCTATC
AAAAGTGAGACCATTTTTCCAATAGAGACTTCCATTATAGAGTCAAACCTTGAATTGGACAAATCTCATTTTCTTTCGAAATGGTCTGTAGAAAAGACTCAAGGCCAAAA
CTCTTCCACTAAAGTTTGGATTCTACGATCTTCTTTACACGGAAAGAAGTCAAATGTCAATCCAGAACCGACGTTGGGGCGTCGAATCATTGAAAATCAGCATTCTTGGG
TGTACAAGTTGAATCAGAAGCTCGATTTGGTATGGTTGAGGGTAGCAGAGAAGCACGAATCTGGATTTGCTGAGTTCGGGGGATTTCTGGGAAAATCCGTTGGGAATCCC
GTTTTTCTTACTTCAGGCCAAGAGGAAATAGGCCGAGTCTACAGGCCAGGGAACTAG
Protein sequenceShow/hide protein sequence
MDSSTLVDEWARLSLTEEEEDISVTVDREAVDRTGQLLESCLLDRLLCHRPLGAEVMRRNFRAAWKIDQGLQVDSHLIINFPIRLFGSMVERIGNVVGVFEDVDSRNGFL
FWGANLRIRVRIDLSRPLRRGIHIYPDGPLSGLWVPFRYERLLEICSICGIIGHVTRDCIQSSRSEGGSGSIPQYGDCLRFSRKGMVLNHFTAREIGRESPQRGIIINES
SDEQIQRRSATVSPALYSFSGGPNVSLKGKEKVVEVESKYRGGFFAGLWMPEKSGRNNLSVPEEAGQQDPMPVTVTTKEPVKLKQPFGWRIDEGPSNYERESDPEMDEEL
GPLGSDGKEWIGTNEELSLKADTHENLKEYSDRDDKSVDMKEKAWMGRLGMTMSDDKETEWGDMTDNPTPNFIPNILGRGPSWKKRAHAGMVPRGLYGHPDSNLRTQSWN
LIRRLYDSHEAAWVIEGDLNEILWQDEKSGGTSRDYNQILAFREALDDCNLWDLGFSGGKFTWCNRRDMGAQVSLRLDRFVANSNFCNLFENFQVYNLDWAKSDHRPIEL
SLAGEVNRGHKNRRRRTFKFEEWWTCHEDCGNIIRRAGSWACNSEIPSLPLALKNCASALGGWGFCQNRRLKTNIREVRDKIKMSYESASPINFEVIHGLERHLDSLQLE
EEIYWKQRSRENWLKWGDRNTKWFHQKATLRRRRNCILGVEDAQGIWQTEKAIVHDTFVNYFTSIFSSGQCVNVQKGEYTVKSGYKLSMMLDQQATLSGARRETRWWKKV
WKMRVPSKGAWAIWNDRNCRVHNRPIPNVEMRSDWIIEYLIDYWHANPKGSVNVQNEDDVYKIISNGEDYILHIDAAFMGAMRASGVGLVLRDKFGRLKAAQSLRSQVYT
SPLGVKAVAVLHGLYMAKTLGVNPRFGTLGVNRVTILSDSLTLIKSINVEMQYDSCMANTIWDIKNIQNSFDKKRSSILAASGCGGDDKWMREANGERSGTSGIQAGHPL
TSVSSGSDQPLIQAEFRMISRPFHLASGSDQPLIQAEFRMISRPSHLASGSDQPLIQAEFRMISRPSYLASGSDQPLTQAEFRIISRPSHFASDNGKKRLVVLLDKDQPI
KSETIFPIETSIIESNLELDKSHFLSKWSVEKTQGQNSSTKVWILRSSLHGKKSNVNPEPTLGRRIIENQHSWVYKLNQKLDLVWLRVAEKHESGFAEFGGFLGKSVGNP
VFLTSGQEEIGRVYRPGN