; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035553 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035553
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:2486865..2495386
RNA-Seq ExpressionSpg035553
SyntenySpg035553
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]6.0e-6925.55Show/hide
Query:  IDNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLF
        +D+L + W   +L ++E+   I  + +    + ++ E  L  K+ ++R + K  +++     W+      +  V  NT++  F    +KD +++  PW F
Subjt:  IDNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLF

Query:  DRNLLILENP---------------------------SARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVS
        D +L ++ NP                              +    K+G+ + E  + + +++   WG S+R+++ LD+ +PL RG  +   GV    W  
Subjt:  DRNLLILENP---------------------------SARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVS

Query:  IRYERLPDFCFGCGRIGHVAKGCLEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQ---------DLSTKDDIGRNKKQGERLNVDLN
        ++YE++P FCF CGRI H + GC    D       S+++G+WL+ +   +   +    ++ ++ ++ +         D    D  G  K  G  L+  + 
Subjt:  IRYERLPDFCFGCGRIGHVAKGCLEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQ---------DLSTKDDIGRNKKQGERLNVDLN

Query:  KDSPIIEDLINLESRS---KEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQK---------------------EGNHIEVDGSNKVSSIMEISQS
         D  I  D + ++  S   K    C  E  + ++   + D  +  N+ G E   ++                          +  +G     ++ ++  +
Subjt:  KDSPIIEDLINLESRS---KEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQK---------------------EGNHIEVDGSNKVSSIMEISQS

Query:  ATGNNSKKTSWKRKMR-----TDHTETPKMKLGCG--------------TALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTN
          G ++ ++SWK+K R      +  E   ++   G               ALP  M+LI WN+RGLGNP   ++L  LV ++ P +LFL ETK + +V  
Subjt:  ATGNNSKKTSWKRKMR-----TDHTETPKMKLGCG--------------TALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTN

Query:  RIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQE---KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILC
        R+K    F +C  V   G  GG+ + W ++  + I++FS  HI  ++T +E   + W  TG YG  ++SK+ ++W+L+  L    +  WL+ GD NEIL 
Subjt:  RIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQE---KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILC

Query:  HEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNR-RGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRT
        + EK GG  +    M+ F+  +D C L DL   G  FTW   R R   I ER+DRFL N  + + +  +S  +    +SDH P  I ++  + +  G R 
Subjt:  HEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNR-RGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRT

Query:  NQFKFEEVWTKYEECSELITKNGDWKGV
          F+FE +W    +C ++I     W+GV
Subjt:  NQFKFEEVWTKYEECSELITKNGDWKGV

MCH80348.1 hypothetical protein [Trifolium medium]1.2e-6929.81Show/hide
Query:  DEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQR
        D++   IT   +E    D+     L  K+ T         K  +T AW++R   +I+ + KN YLFKF  K E D +  NGPW FDRNLLIL   S  ++
Subjt:  DEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQR

Query:  TS--------------------------RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRI
         S                          +K+GN +  + + D  KE    G  +R+RV +D+ +PL+RG  L   G  ++ WV  +YERLP+FCF CGRI
Subjt:  TS--------------------------RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRI

Query:  GHVAKGCLEMNDSEEARKNSLE-----YGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRN---------KKQGERLNVDLNKDSPIIEDLIN
        GH  + C ++ D +E + + LE     +G WL+     ++  +     +    +     ST +  G+N         + + +R++ D +    I +  ++
Subjt:  GHVAKGCLEMNDSEEARKNSLE-----YGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRN---------KKQGERLNVDLNKDSPIIEDLIN

Query:  LESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPS
         ++    +K  + + ++  + E+    T S      E T   KEG         KV+       + T     K S    M TD T    MK G       
Subjt:  LESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPS

Query:  AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLG----AKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ
         +   C      G+PRA R+L  L     PQ++FL ET+        I+    F +C  VD  G      GGL ++W +  S+ I SFS NHI      +
Subjt:  AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLG----AKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ

Query:  E--KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTW-HGNRRGTQIWE
        E  + W  TG+YG+PE   K++TW LI  L       WL  GD N+IL   EK GG  R ++     +  +    L DL  +G  FTW +G   G  +  
Subjt:  E--KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTW-HGNRRGTQIWE

Query:  RIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKN
        R+DR + N +F   F+     +L    SDH  + IC+   + +   RR   F+FEE WTK  +C ELI  N
Subjt:  RIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKN

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]3.5e-6928.7Show/hide
Query:  IDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQRTSR-------------
        I  + + CL  KLL++++  K   K  +   W       I  +  N ++ +F++  +KD +K  GPW+F + L++ ++    ++  +             
Subjt:  IDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQRTSR-------------

Query:  -------------KIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRIGHVAKGCLEMNDSEEA
                      IG  L + I+ D +K +   G  + +RV LD+++PL RG  +   G  +  W    YERLP+FC+ CG +GH  K  +    + EA
Subjt:  -------------KIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRIGHVAKGCLEMNDSEEA

Query:  RKNS-LEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLN------VDLNKDSPIIED--------LINLESRSKEAKTCDDEI
           S   YG WL+    A   S    PI  +R+ N +   + + +          N      V+L     ++E         L+NL +  +  +T  + I
Subjt:  RKNS-LEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLN------VDLNKDSPIIED--------LINLESRSKEAKTCDDEI

Query:  DIL-RINENSE----------DWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDH-----TETPKMKLGCGTALP
            R   N E          D  T+ N   +     Q +G H  +      S+  E+   A G N  +  WKR + T+H        P+  L      P
Subjt:  DIL-RINENSE----------DWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDH-----TETPKMKLGCGTALP

Query:  S------AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEI
        S       + L+ WNA+G  NPR   +L +L+    P VLFL ETK +       +I   F  C  V ++G  GG+ +LWK   +L+I  +S  HID +I
Subjt:  S------AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEI

Query:  TWQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGT-QIW
              W  TG+YG PE +K+ +TW+L+ RL    +  WL+ GD NEIL +EEK GG  R R  +E+F+H    C LRDL  +G  +TW+  R  T QI+
Subjt:  TWQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGT-QIW

Query:  ERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELI
        ER+DRF+ N  +  +F  +   +    +SDH+P+ I ++  S    G +   F+FE +W   + CS+++
Subjt:  ERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELI

XP_030509295.1 uncharacterized protein LOC115723978 [Cannabis sativa]5.3e-6528.49Show/hide
Query:  CLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILE------NPSA--------------------
        CL  + LTNR I    ++N L   W+      ++ +  N +LF+F ++V+   + +  PW +DR  LI+E      +P A                    
Subjt:  CLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILE------NPSA--------------------

Query:  RQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRIGHVAKGCLEMNDSEE--------
        R+ T R +GN + ++I+ D       W + +R+RV LD+T+PLRR L L  +  S   WV+ +YER P FCF CG IG+  K C ++             
Subjt:  RQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRIGHVAKGCLEMNDSEE--------

Query:  ------ARKNSLEYGA-WLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLN------------VDLNKDSPIIEDLINLESRSKEAKT
               ++N    GA WL+ +G+   ++    P  +  +TN    S+  D    K++ + L+            V   KD  +  +    +S+ K    
Subjt:  ------ARKNSLEYGA-WLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLN------------VDLNKDSPIIEDLINLESRSKEAKT

Query:  CDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPSAMNLICWNAR
         DDE D++   ++ ED     +    +   + K G+H+   G  +V S+        G       WK                     P+ M +I WN R
Subjt:  CDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPSAMNLICWNAR

Query:  GLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKI-----SCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ-EKVWRFTG
        GLG  R  + LK LV  +RP V+FLCET     +TN++K+     S  F+  FTV++ G  GG+ +LWK+++ + I SFS NHID  +T++ ++ +RFTG
Subjt:  GLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKI-----SCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ-EKVWRFTG

Query:  VYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIW--ERIDRFLCNP
        +YG P  S ++QTW LI  L+A  ++PW + GD N +L   EK+GG P    L++ F+  +  CGL DL   G  FTW    R T  W   R+DR     
Subjt:  VYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIW--ERIDRFLCNP

Query:  DFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG
            MF++ +++                          +   F+FE  W  +++C ++I  + +  G
Subjt:  DFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG

XP_042954615.1 uncharacterized protein LOC122291031 [Carya illinoinensis]7.6e-6426.91Show/hide
Query:  DNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFD
        + L  +W K +L + EK   ++       T  +Q   CL   ++  + I K   K+ +   W+         VG N +L +F++  +   +    PW FD
Subjt:  DNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFD

Query:  RNLLILEN--------------------------PSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIR
        R LL L+                               Q T   +G+ + + +    +     WG  +RIRV++D+T+ L RG  L  +G  + CWV  +
Subjt:  RNLLILEN--------------------------PSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIR

Query:  YERLPDFCFGCGRIGHVAKGCLEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDSPIIEDLIN
        YERLP  CF CG I H    CL    S  +R  S +YGAWL+    A     +D  + K  E  + +    D   R + +G     DL   SP I D   
Subjt:  YERLPDFCFGCGRIGHVAKGCLEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDSPIIEDLIN

Query:  LESRSKEAKTCD--DEIDIL------------------------------------RINENSEDWTTSMNVNGTEWTVNQK-----EGNHIEVDGSNKVS
          + + +   C   D  DI                                     ++    ++ T    +  T  T  +       G  I  D   ++ 
Subjt:  LESRSKEAKTCD--DEIDIL------------------------------------RINENSEDWTTSMNVNGTEWTVNQK-----EGNHIEVDGSNKVS

Query:  SI---MEISQSATGNNSKKTSWKRKMRTDHTETPKMKLG----------------------------CGTALPSAMNLICWNARGLGNPRAFRSLKHLVI
        SI   +     A     K  +W+R+ R   TE     LG                               A P ++  I WN+RGLGNP   R L+ L  
Subjt:  SI---MEISQSATGNNSKKTSWKRKMRTDHTETPKMKLG----------------------------CGTALPSAMNLICWNARGLGNPRAFRSLKHLVI

Query:  TRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ---EKVWRFTGVYGFPEISKKKQTWDLICR
           P +LFL ET+       +IK    F +C  V +    GG+ +LWK+   L I  +S +HI  EI  Q   E VW  TGVYG PE +++  TW LI  
Subjt:  TRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ---EKVWRFTGVYGFPEISKKKQTWDLICR

Query:  LYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFT-WHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSD
        L       WL+ GD NEIL   EK GG  R    MEDF   L  C L DL   G  FT W+G +    + ER+DRF  N ++ + F      +L   +SD
Subjt:  LYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFT-WHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSD

Query:  HKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG
        H  + + ++ +       R   F+FE +WT+ E C ++++   + +G
Subjt:  HKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG

TrEMBL top hitse value%identityAlignment
A0A2N9EVR9 F-box domain-containing protein2.4e-7129.53Show/hide
Query:  WRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLI-
        W +F+L ++E       N  +  +   Q + CLA K LT R +   ++       W+T + F I  +  NT +F FE++ +++ +    PW +D++L++ 
Subjt:  WRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLI-

Query:  --LENPSARQRTSRK-----------------------IGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPD
          +E+  A      K                       +G+ L +        E    G ++RIRV +D+T+PL RG   + +   ++ W+S +YERLP+
Subjt:  --LENPSARQRTSRK-----------------------IGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPD

Query:  FCFGCGRIGHVAKGC-LEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDSPIIEDL-------
        FC+ CG + H  K C   + + +  R    ++GAWL+     R   KT+  I  E    +      +       Q  R        SP I  +       
Subjt:  FCFGCGRIGHVAKGC-LEMNDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDSPIIEDL-------

Query:  -------INLESRSKEAKTCDDEIDILRINENS---------------------EDWTTSM----NVNGTEWTVNQKEGNHIEVDGSNKV-SSIMEISQS
               I + S+S    T   ++    I   S                     ED    M    N+  T   + Q  G      G       I   SQ 
Subjt:  -------INLESRSKEAKTCDDEIDILRINENS---------------------EDWTTSM----NVNGTEWTVNQKEGNHIEVDGSNKV-SSIMEISQS

Query:  ATGN---NSKKTSWKRKMRTDHTETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDS
         TG+    S K SWK+  R         K G  T  P AMN + WN RGLGNPR  + L  LV  + P ++FL ET  DE    R++    F++ F  +S
Subjt:  ATGN---NSKKTSWKRKMRTDHTETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDS

Query:  LGAKGGLCILWKDQNSLAIKSFSNNHIDTEI-TWQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDF
            GGLC+LWK   ++ + SFS +HID  +    E  WRFTG YG PE  K++++W L+ RL +  + PW   GD NE++  EEK G   R    M+ F
Subjt:  LGAKGGLCILWKDQNSLAIKSFSNNHIDTEI-TWQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDF

Query:  KHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICI--RSRSSNFGGRRTNQFKFEEVWTKYEECSE
        +  LD CG  DL   G  FTW  NR G   WER+DR +   ++   F  +   +LD  +SDHKP+ + +    R S         F+FEEVWT  + C E
Subjt:  KHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICI--RSRSSNFGGRRTNQFKFEEVWTKYEECSE

Query:  LITKNGDWK----GVSIF
        +IT    WK    GV +F
Subjt:  LITKNGDWK----GVSIF

A0A2N9HYE3 Reverse transcriptase domain-containing protein1.0e-6926.58Show/hide
Query:  WRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLIL
        W KF+L + E        + +  +     +  LA K LT R +    +       W+T + F I  + +N  +F F+++ +++ +    PW +D+ L+IL
Subjt:  WRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLIL

Query:  EN------------------------PSAR--QRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPD
        +                         P  R     +  +G+ L +       +     G ++RIRV +D+T+PL RG   + +    + W+S +YERLP+
Subjt:  EN------------------------PSAR--QRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPD

Query:  FCFGCGRIGHVAKGCLE-MNDSEEARKNSLEYGAWLKFQG---FARIMSKTD-------------------NPINKERETNEQDLSTKDDIGRNKKQGER
        FC+ CG + H  K C   + + E       ++G+WL+      + ++  K D                   +PI    ++  +   T   I  +      
Subjt:  FCFGCGRIGHVAKGCLE-MNDSEEARKNSLEYGAWLKFQG---FARIMSKTD-------------------NPINKERETNEQDLSTKDDIGRNKKQGER

Query:  LNVDLNKDSPIIEDLI----NLESRSKEAKTCDDEIDILRINENS-------------EDWTTSMN--VNGTEWTVNQKEGNHIEVDGS-NKVSSIMEIS
         +       P+ +        +   +  + T DD    + + EN              +D+ + +    N   ++    E     +  S + ++S  + S
Subjt:  LNVDLNKDSPIIEDLI----NLESRSKEAKTCDDEIDILRINENS-------------EDWTTSMN--VNGTEWTVNQKEGNHIEVDGS-NKVSSIMEIS

Query:  QSATGN--------------NSKKTSWKRKMRTDHT------ETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEK
        + A G+              +S K +WK+  RT         E  ++  GCG ALP AMN + WN RGLGNPR  + +  L   + P V+FL ET  DE 
Subjt:  QSATGN--------------NSKKTSWKRKMRTDHT------ETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEK

Query:  VTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEIT-WQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEIL
           +++    FD  F V+     GGLC+ WK    L+++SFS++HID  +   Q   WRFTG YG PE  K++++WDL+ RL A    PW   GD NE++
Subjt:  VTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEIT-WQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEIL

Query:  CHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRT
          EEK G   R  S M+ F+  LD CG  DL   G  FTW  NR G   WER+DR +  PD+   F  +   +L+  +SDHKPI +   +        + 
Subjt:  CHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRT

Query:  NQFKFEEVWTKYEECSELITKNGDWKGVSIFLCIVLYAIINMKLPTTVFFYCKRAKDIWKLTFDHVFLEEDFRGSVMDRWFKLNETLSME
          F+FEEVWT  + C  +I     WK       + +Y +            C+R   +W  T    F     R   ++R  K+ E  SM+
Subjt:  NQFKFEEVWTKYEECSELITKNGDWKGVSIFLCIVLYAIINMKLPTTVFFYCKRAKDIWKLTFDHVFLEEDFRGSVMDRWFKLNETLSME

A0A2N9IIR5 Uncharacterized protein1.4e-7129.51Show/hide
Query:  IDNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLF
        ++++ + W KF+L + E     TF+ D   T        LA K  T R +   ++       WKT   F ++ +G+N  +F FE+ V+ D +  N PW +
Subjt:  IDNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLF

Query:  DRNLLILEN--------------------------PSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSI
        D++L+IL+                            S     +  IG+ L     + +++     G+ +RIR+ LD ++PL RG  ++  G  +  WVS 
Subjt:  DRNLLILEN--------------------------PSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSI

Query:  RYERLPDFCFGCGRIGHVAKGCLE-MNDSEEARKNSLEYGAWLK----FQGFARIMSKTDNPINKER---ETNEQDLSTKDDIGRNKK---QGERLNVDL
        ++ERLP+FC+ CGR+ H  K C + +         S +YG WL+    F G  R++S   +     R   +TN +  STK  +  +     + + +N D+
Subjt:  RYERLPDFCFGCGRIGHVAKGCLE-MNDSEEARKNSLEYGAWLK----FQGFARIMSKTDNPINKER---ETNEQDLSTKDDIGRNKK---QGERLNVDL

Query:  ---NKDSPIIEDLINLESRSKEAKTCDDEIDIL-RINENSEDWTTSMNVNGTEWTVNQKEGNHIEVD------GSNKVSSIMEISQSATGNNSKKTSWKR
           +K++P  E        +    T    ++ + ++ E   D     N N       +K  +  +VD      GS K  + +  + SAT     K +WK+
Subjt:  ---NKDSPIIEDLINLESRSKEAKTCDDEIDIL-RINENSEDWTTSMNVNGTEWTVNQKEGNHIEVD------GSNKVSSIMEISQSATGNNSKKTSWKR

Query:  KMRTDHT---ETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKD
         +    +       +  GC TALP AMN I WN RGLGNPR  + L  LV  + P+ +F+ ET  D     +I+   +F +   V      GG+ + WK 
Subjt:  KMRTDHT---ETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKD

Query:  QNSLAIKSFSNNHIDTEIT-WQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLK
          +L IKSFS  HID+ I       WRFTG YG PE   +  +WD++  L    + PW   GD NE++   EK GG PR  + M+ F+  LD CG +DL 
Subjt:  QNSLAIKSFSNNHIDTEIT-WQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLK

Query:  PQGELFTWHGNR-RGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELI
          G  FTW  NR  G  +WER+DR + N ++   F  ++  ++    SDH P+ +   S SS     R   F+FE +W   E C   +
Subjt:  PQGELFTWHGNR-RGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELI

A0A392M033 CCHC-type domain-containing protein (Fragment)5.9e-7029.81Show/hide
Query:  DEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQR
        D++   IT   +E    D+     L  K+ T         K  +T AW++R   +I+ + KN YLFKF  K E D +  NGPW FDRNLLIL   S  ++
Subjt:  DEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQR

Query:  TS--------------------------RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRI
         S                          +K+GN +  + + D  KE    G  +R+RV +D+ +PL+RG  L   G  ++ WV  +YERLP+FCF CGRI
Subjt:  TS--------------------------RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRI

Query:  GHVAKGCLEMNDSEEARKNSLE-----YGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRN---------KKQGERLNVDLNKDSPIIEDLIN
        GH  + C ++ D +E + + LE     +G WL+     ++  +     +    +     ST +  G+N         + + +R++ D +    I +  ++
Subjt:  GHVAKGCLEMNDSEEARKNSLE-----YGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRN---------KKQGERLNVDLNKDSPIIEDLIN

Query:  LESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPS
         ++    +K  + + ++  + E+    T S      E T   KEG         KV+       + T     K S    M TD T    MK G       
Subjt:  LESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPS

Query:  AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLG----AKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ
         +   C      G+PRA R+L  L     PQ++FL ET+        I+    F +C  VD  G      GGL ++W +  S+ I SFS NHI      +
Subjt:  AMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLG----AKGGLCILWKDQNSLAIKSFSNNHIDTEITWQ

Query:  E--KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTW-HGNRRGTQIWE
        E  + W  TG+YG+PE   K++TW LI  L       WL  GD N+IL   EK GG  R ++     +  +    L DL  +G  FTW +G   G  +  
Subjt:  E--KVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTW-HGNRRGTQIWE

Query:  RIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKN
        R+DR + N +F   F+     +L    SDH  + IC+   + +   RR   F+FEE WTK  +C ELI  N
Subjt:  RIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKN

A0A803PNQ8 Uncharacterized protein1.6e-7030Show/hide
Query:  KDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLIL------
        ++D++  ++   R E   ID +   CL  KLLT R       +N +   W+      ++ +  N +LF+F ++V+   + +  PW +DR   I       
Subjt:  KDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEKDWIKNNGPWLFDRNLLIL------

Query:  ENP------------------SARQRTS--RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDC---WVSIRYERLPDFCF
        ENP                  S     S    +GN +  +++ D N     W + +R+RV+LDV +P++R + +    ++ED    WV+ +YERLP FCF
Subjt:  ENP------------------SARQRTS--RKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDC---WVSIRYERLPDFCF

Query:  GCGRIGHVAKGC----------------LEMNDSEEARKNSLEYGA-WLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDS
         CG IGH  K C                LE+  + + R+++  +GA WL+ +   R      +   +E + ++ D ST   I RN +    L V   +DS
Subjt:  GCGRIGHVAKGC----------------LEMNDSEEARKNSLEYGA-WLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDS

Query:  PIIEDLINLESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVN-QKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMK
         I  +    E   K             I EN       M VNG E  +N    GN  +   +N + S                   ++ R D+       
Subjt:  PIIEDLINLESRSKEAKTCDDEIDILRINENSEDWTTSMNVNGTEWTVN-QKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMK

Query:  LGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDT
        L    + P AM+ + WN RGLGNPRA + L  LV  ++P +LFLCET C++     I++   FD CF V++ G  GGL +LWK+ + +AI+ FS NHID 
Subjt:  LGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKVTNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDT

Query:  EITWQEKV-WRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQ
         +  Q  V WR TGVYG P    +  TW+L   +      PW L GD+N +  ++EK GG P    L++ F    D C L +L   G  FTW   R G  
Subjt:  EITWQEKV-WRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRDRSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQ

Query:  -IWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG
         + ER+D+ L    + ++F  S+  NL+   SDH PI +  +  S +      N F+FE  W +   C +L+   G W+G
Subjt:  -IWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKNGDWKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.5e-0934.74Show/hide
Query:  SKKKQTWDLICRLYAGG---NSPWLLGGDLNEILCHEEKLGGPPRDRSL--MEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCN
        ++++  WD I RL A     NSPWL+ GD N+I    E     P + SL  +ED + C+    L DL  +G L+TW  +++   I  ++DR + N
Subjt:  SKKKQTWDLICRLYAGG---NSPWLLGGDLNEILCHEEKLGGPPRDRSL--MEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCN

AT5G36228.1 nucleic acid binding;zinc ion binding5.4e-0727.2Show/hide
Query:  LFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERL
        L ++E+   +D++     W+  R + +   P   +RT   I + L E +  D N+E T     IR++V++D T PLR    ++     E   +   YE+L
Subjt:  LFKFENKVEKDWIKNNGPWLFDRNLLILENPSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERL

Query:  PDFCFGCGRIGHVAKGCLEMNDSEE
           C  C R+ H    C  +   EE
Subjt:  PDFCFGCGRIGHVAKGCLEMNDSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCCACATGTTCCCGTCACGATCCTCACGATCTAAGCTCCAAAAATCAACTCCTCACGATCTCCTCAGTATTCTGGCCGAGTTTTGCTCTCAGTAGTCGAAATTT
TCTCTCCAAAGAAGCTCCTAAAAAAAGACTGCATCGTTTAGAATTTATACTGCCCATTCTGCCGCGACAGCATCGTGACGCTAAGGACACAGCGTCGCAACGCAGTCGCG
TTGCGTGCCCTAATAATTCTGGAAAGTGCCGCAGCGTCGAGACGCTGTGGAGCAATCTGATCAATGTAGTCCTTTCGGAAGATAGGGATGTTTTTCGGATGGTCAGCACG
TTTCAAAATGGTACGTACAGGGATTTCTATGTAGTTGTCGACGTGCAGGATGTAGATACTGGTATACACAATCGGGGGCAGTTGCAAGTGAAGGGGATGAAAGCTCTGTC
GCAGCGGAAGCGCATTGATAGGACCTTGCGTCGGATCCCGCCCTCTCACTGGCCTGAGAGGGACTTTATGTTTATTGGTTGGACCATAAACAGGTTGTTCATTAGAGGAG
CACTGGTACTTAAGGAGCTAGAGGTAACTCAGAGGTCCATTAGGTCCCCTGCTAGCTCGTCAAGACCGCTTCTCGGTTCTTGGAGGAATCCCCTGAAAATCTGGGGCTAC
ATCTGCACTATTGCTAGAAGTCAAGAAGCACAACGAGTGGGAGATATGGGAACTGCAATGGAAATAGACAATCTGATTACCGAATGGAGGAAATTTAACCTGAAAGATGA
TGAGAAAACAACAAAAATTACCTTCAATCGAGATGAGGCCAAAACAATTGATAAGCAGATGGAGGTATGCCTGGCTGAAAAACTCCTGACAAACAGAAACATCACTAAAA
CAACTCTCAAGAATGCATTAACCGGAGCATGGAAAACAAGATACGATTTTGATATAGAAACAGTTGGTAAGAATACTTATCTGTTCAAATTTGAGAATAAAGTGGAGAAA
GATTGGATAAAGAACAATGGACCCTGGCTGTTTGATAGGAATTTGCTAATTCTTGAGAACCCATCTGCAAGACAAAGAACATCAAGAAAGATTGGGAACATCTTGGCAGA
GTATATCGATTGGGACAACAATAAAGAAAAGACTCCTTGGGGGAATAGCATAAGAATAAGAGTTAAATTGGATGTGACTAGACCTCTTCGAAGAGGTCTTATGCTTCAAA
CAGATGGAGTCAGTGAGGATTGCTGGGTCTCTATTAGATACGAGAGACTCCCCGACTTTTGCTTCGGATGTGGCAGAATAGGGCACGTTGCTAAAGGATGTCTCGAGATG
AATGATAGTGAGGAAGCAAGAAAAAACAGTTTAGAGTATGGGGCATGGTTAAAATTTCAAGGATTTGCCAGAATTATGAGTAAAACAGACAACCCAATCAATAAAGAAAG
AGAGACAAATGAACAAGATCTAAGCACAAAAGATGATATTGGGAGAAATAAGAAACAAGGGGAAAGGCTGAATGTGGATCTTAATAAAGACAGTCCAATAATTGAAGATT
TGATCAACCTGGAAAGCAGAAGCAAAGAAGCAAAGACATGTGATGATGAGATTGACATTCTAAGAATTAATGAGAATAGCGAAGACTGGACTACAAGCATGAATGTAAAT
GGGACTGAGTGGACTGTTAACCAGAAAGAAGGAAATCATATTGAGGTAGACGGCAGTAACAAAGTATCATCAATTATGGAAATTAGCCAAAGTGCTACGGGGAACAATTC
AAAGAAGACAAGCTGGAAAAGGAAAATGAGGACTGATCATACAGAGACCCCAAAGATGAAACTTGGCTGTGGAACAGCCCTGCCTTCAGCCATGAATCTCATTTGCTGGA
ATGCTCGGGGTTTGGGGAACCCGAGAGCATTTCGATCACTTAAACACCTTGTAATAACAAGGAGACCCCAAGTTCTCTTTCTATGTGAGACAAAGTGTGACGAGAAAGTT
ACTAATAGAATCAAGATTTCTTGCAATTTCGATGACTGCTTTACTGTTGACAGCTTAGGAGCTAAAGGAGGACTCTGCATCTTATGGAAAGATCAGAATTCATTGGCCAT
TAAATCCTTTTCAAATAACCACATAGACACAGAGATTACATGGCAAGAGAAGGTTTGGAGATTTACTGGAGTATATGGATTTCCTGAAATCAGTAAGAAAAAACAAACAT
GGGACTTGATATGTAGGCTCTATGCTGGAGGTAATTCCCCTTGGCTCTTGGGAGGGGACCTAAACGAAATTTTGTGTCATGAGGAGAAGTTGGGAGGCCCTCCAAGAGAT
AGATCATTAATGGAAGATTTCAAGCATTGCTTAGATTACTGTGGCTTGAGAGATTTAAAGCCCCAAGGTGAGCTTTTTACATGGCATGGGAATCGGAGAGGCACTCAAAT
TTGGGAAAGGATCGACCGCTTTCTTTGCAACCCCGACTTTGACACTATGTTTAATTTCTCTAGTTCGAGGAATTTGGATTGGTTATTTTCAGATCATAAACCAATTGAGA
TATGCATCAGAAGCAGAAGCAGCAACTTTGGTGGAAGAAGAACAAACCAGTTCAAATTTGAAGAAGTATGGACAAAGTATGAAGAATGTTCAGAGTTAATTACCAAGAAT
GGTGATTGGAAAGGGGTATCGATATTCCTATGCATTGTCCTCTATGCAATTATCAATATGAAACTTCCGACCACTGTCTTTTTTTATTGTAAAAGGGCAAAAGATATTTG
GAAACTAACCTTTGATCATGTTTTTTTGGAGGAGGATTTCCGTGGTAGCGTGATGGATAGATGGTTTAAACTCAACGAGACCTTGTCCATGGAGAATCTCAAGTTGGTTG
CAGTTACCTGTTGGTCCATTTGGAACAACAGAAACAAATATGTTCATGGAGAAACAATTCCAGATATCAATTTCAAAAGTCAGTGGATCGTGAAATACCTGGAGGATTTT
CGAAAGGCGAACCCTAGAAATTCGGAGAGATCGATGAACAATAGAGGATCTGAAAATACTTCTTCAAGACCAAATGAATGCTGGAAGCCTCCTCAGGTGGGATCGTGGAA
ATTGAACTGTGATGCCGCTTGTGTGGTCAATCCCCCGTCGACTGGTGTTGGAGCCATCTGTAGAAATAAAGATGGGAAAGCAATGGCAGCTGAAGCGAAATATCAAGACT
TTTATTTGGATCCTTTATCAGCAGAACTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCCCACATGTTCCCGTCACGATCCTCACGATCTAAGCTCCAAAAATCAACTCCTCACGATCTCCTCAGTATTCTGGCCGAGTTTTGCTCTCAGTAGTCGAAATTT
TCTCTCCAAAGAAGCTCCTAAAAAAAGACTGCATCGTTTAGAATTTATACTGCCCATTCTGCCGCGACAGCATCGTGACGCTAAGGACACAGCGTCGCAACGCAGTCGCG
TTGCGTGCCCTAATAATTCTGGAAAGTGCCGCAGCGTCGAGACGCTGTGGAGCAATCTGATCAATGTAGTCCTTTCGGAAGATAGGGATGTTTTTCGGATGGTCAGCACG
TTTCAAAATGGTACGTACAGGGATTTCTATGTAGTTGTCGACGTGCAGGATGTAGATACTGGTATACACAATCGGGGGCAGTTGCAAGTGAAGGGGATGAAAGCTCTGTC
GCAGCGGAAGCGCATTGATAGGACCTTGCGTCGGATCCCGCCCTCTCACTGGCCTGAGAGGGACTTTATGTTTATTGGTTGGACCATAAACAGGTTGTTCATTAGAGGAG
CACTGGTACTTAAGGAGCTAGAGGTAACTCAGAGGTCCATTAGGTCCCCTGCTAGCTCGTCAAGACCGCTTCTCGGTTCTTGGAGGAATCCCCTGAAAATCTGGGGCTAC
ATCTGCACTATTGCTAGAAGTCAAGAAGCACAACGAGTGGGAGATATGGGAACTGCAATGGAAATAGACAATCTGATTACCGAATGGAGGAAATTTAACCTGAAAGATGA
TGAGAAAACAACAAAAATTACCTTCAATCGAGATGAGGCCAAAACAATTGATAAGCAGATGGAGGTATGCCTGGCTGAAAAACTCCTGACAAACAGAAACATCACTAAAA
CAACTCTCAAGAATGCATTAACCGGAGCATGGAAAACAAGATACGATTTTGATATAGAAACAGTTGGTAAGAATACTTATCTGTTCAAATTTGAGAATAAAGTGGAGAAA
GATTGGATAAAGAACAATGGACCCTGGCTGTTTGATAGGAATTTGCTAATTCTTGAGAACCCATCTGCAAGACAAAGAACATCAAGAAAGATTGGGAACATCTTGGCAGA
GTATATCGATTGGGACAACAATAAAGAAAAGACTCCTTGGGGGAATAGCATAAGAATAAGAGTTAAATTGGATGTGACTAGACCTCTTCGAAGAGGTCTTATGCTTCAAA
CAGATGGAGTCAGTGAGGATTGCTGGGTCTCTATTAGATACGAGAGACTCCCCGACTTTTGCTTCGGATGTGGCAGAATAGGGCACGTTGCTAAAGGATGTCTCGAGATG
AATGATAGTGAGGAAGCAAGAAAAAACAGTTTAGAGTATGGGGCATGGTTAAAATTTCAAGGATTTGCCAGAATTATGAGTAAAACAGACAACCCAATCAATAAAGAAAG
AGAGACAAATGAACAAGATCTAAGCACAAAAGATGATATTGGGAGAAATAAGAAACAAGGGGAAAGGCTGAATGTGGATCTTAATAAAGACAGTCCAATAATTGAAGATT
TGATCAACCTGGAAAGCAGAAGCAAAGAAGCAAAGACATGTGATGATGAGATTGACATTCTAAGAATTAATGAGAATAGCGAAGACTGGACTACAAGCATGAATGTAAAT
GGGACTGAGTGGACTGTTAACCAGAAAGAAGGAAATCATATTGAGGTAGACGGCAGTAACAAAGTATCATCAATTATGGAAATTAGCCAAAGTGCTACGGGGAACAATTC
AAAGAAGACAAGCTGGAAAAGGAAAATGAGGACTGATCATACAGAGACCCCAAAGATGAAACTTGGCTGTGGAACAGCCCTGCCTTCAGCCATGAATCTCATTTGCTGGA
ATGCTCGGGGTTTGGGGAACCCGAGAGCATTTCGATCACTTAAACACCTTGTAATAACAAGGAGACCCCAAGTTCTCTTTCTATGTGAGACAAAGTGTGACGAGAAAGTT
ACTAATAGAATCAAGATTTCTTGCAATTTCGATGACTGCTTTACTGTTGACAGCTTAGGAGCTAAAGGAGGACTCTGCATCTTATGGAAAGATCAGAATTCATTGGCCAT
TAAATCCTTTTCAAATAACCACATAGACACAGAGATTACATGGCAAGAGAAGGTTTGGAGATTTACTGGAGTATATGGATTTCCTGAAATCAGTAAGAAAAAACAAACAT
GGGACTTGATATGTAGGCTCTATGCTGGAGGTAATTCCCCTTGGCTCTTGGGAGGGGACCTAAACGAAATTTTGTGTCATGAGGAGAAGTTGGGAGGCCCTCCAAGAGAT
AGATCATTAATGGAAGATTTCAAGCATTGCTTAGATTACTGTGGCTTGAGAGATTTAAAGCCCCAAGGTGAGCTTTTTACATGGCATGGGAATCGGAGAGGCACTCAAAT
TTGGGAAAGGATCGACCGCTTTCTTTGCAACCCCGACTTTGACACTATGTTTAATTTCTCTAGTTCGAGGAATTTGGATTGGTTATTTTCAGATCATAAACCAATTGAGA
TATGCATCAGAAGCAGAAGCAGCAACTTTGGTGGAAGAAGAACAAACCAGTTCAAATTTGAAGAAGTATGGACAAAGTATGAAGAATGTTCAGAGTTAATTACCAAGAAT
GGTGATTGGAAAGGGGTATCGATATTCCTATGCATTGTCCTCTATGCAATTATCAATATGAAACTTCCGACCACTGTCTTTTTTTATTGTAAAAGGGCAAAAGATATTTG
GAAACTAACCTTTGATCATGTTTTTTTGGAGGAGGATTTCCGTGGTAGCGTGATGGATAGATGGTTTAAACTCAACGAGACCTTGTCCATGGAGAATCTCAAGTTGGTTG
CAGTTACCTGTTGGTCCATTTGGAACAACAGAAACAAATATGTTCATGGAGAAACAATTCCAGATATCAATTTCAAAAGTCAGTGGATCGTGAAATACCTGGAGGATTTT
CGAAAGGCGAACCCTAGAAATTCGGAGAGATCGATGAACAATAGAGGATCTGAAAATACTTCTTCAAGACCAAATGAATGCTGGAAGCCTCCTCAGGTGGGATCGTGGAA
ATTGAACTGTGATGCCGCTTGTGTGGTCAATCCCCCGTCGACTGGTGTTGGAGCCATCTGTAGAAATAAAGATGGGAAAGCAATGGCAGCTGAAGCGAAATATCAAGACT
TTTATTTGGATCCTTTATCAGCAGAACTTTAG
Protein sequenceShow/hide protein sequence
MIPTCSRHDPHDLSSKNQLLTISSVFWPSFALSSRNFLSKEAPKKRLHRLEFILPILPRQHRDAKDTASQRSRVACPNNSGKCRSVETLWSNLINVVLSEDRDVFRMVST
FQNGTYRDFYVVVDVQDVDTGIHNRGQLQVKGMKALSQRKRIDRTLRRIPPSHWPERDFMFIGWTINRLFIRGALVLKELEVTQRSIRSPASSSRPLLGSWRNPLKIWGY
ICTIARSQEAQRVGDMGTAMEIDNLITEWRKFNLKDDEKTTKITFNRDEAKTIDKQMEVCLAEKLLTNRNITKTTLKNALTGAWKTRYDFDIETVGKNTYLFKFENKVEK
DWIKNNGPWLFDRNLLILENPSARQRTSRKIGNILAEYIDWDNNKEKTPWGNSIRIRVKLDVTRPLRRGLMLQTDGVSEDCWVSIRYERLPDFCFGCGRIGHVAKGCLEM
NDSEEARKNSLEYGAWLKFQGFARIMSKTDNPINKERETNEQDLSTKDDIGRNKKQGERLNVDLNKDSPIIEDLINLESRSKEAKTCDDEIDILRINENSEDWTTSMNVN
GTEWTVNQKEGNHIEVDGSNKVSSIMEISQSATGNNSKKTSWKRKMRTDHTETPKMKLGCGTALPSAMNLICWNARGLGNPRAFRSLKHLVITRRPQVLFLCETKCDEKV
TNRIKISCNFDDCFTVDSLGAKGGLCILWKDQNSLAIKSFSNNHIDTEITWQEKVWRFTGVYGFPEISKKKQTWDLICRLYAGGNSPWLLGGDLNEILCHEEKLGGPPRD
RSLMEDFKHCLDYCGLRDLKPQGELFTWHGNRRGTQIWERIDRFLCNPDFDTMFNFSSSRNLDWLFSDHKPIEICIRSRSSNFGGRRTNQFKFEEVWTKYEECSELITKN
GDWKGVSIFLCIVLYAIINMKLPTTVFFYCKRAKDIWKLTFDHVFLEEDFRGSVMDRWFKLNETLSMENLKLVAVTCWSIWNNRNKYVHGETIPDINFKSQWIVKYLEDF
RKANPRNSERSMNNRGSENTSSRPNECWKPPQVGSWKLNCDAACVVNPPSTGVGAICRNKDGKAMAAEAKYQDFYLDPLSAEL