; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G12165 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G12165
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr11:20673724..20675770
RNA-Seq ExpressionClc11G12165
SyntenyClc11G12165
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.4e-11237.35Show/hide
Query:  LVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQN-QDEIIQERAPRNHLPQRHFEENQRPRHI-LQERHQEFQERDSSSEYDQEEV--FHEPR
        +V R++   A     +  P  +Q WGRRGR G    R  +N Q+ II E        Q    + +R R I  Q R +E +   SS E D +++  F   R
Subjt:  LVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQN-QDEIIQERAPRNHLPQRHFEENQRPRHI-LQERHQEFQERDSSSEYDQEEV--FHEPR

Query:  RVREERYEARGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERF
         V+ ER +      +YKMKID+P Y+GK +IE FLDW+KN EN  +YM T  +KKV L+A KLKGGA AW DQ+ +NRQ+QGK PIR+WEKMK+LMK+RF
Subjt:  RVREERYEARGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERF

Query:  LPPNYEQTLYNNIKTAGKALGQW--LIMRIPPIGSNQ-------------LGGKRTTPNCEVHRRPKNGHQR-----EGINDMLQEDIK-----------
        +PPNYEQTLY   +   + + +    I     +G                +GG R     +V  +P           E + +M++   K           
Subjt:  LPPNYEQTLYNNIKTAGKALGQW--LIMRIPPIGSNQ-------------LGGKRTTPNCEVHRRPKNGHQR-----EGINDMLQEDIK-----------

Query:  -KDYVGCIHKKEKETQEGAIE------KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA------------------------------
         K       K +  T E  +E      KK     ++K +NPY RP  G  +RCGQ GH S +CPQR+ +A                              
Subjt:  -KDYVGCIHKKEKETQEGAIE------KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA------------------------------

Query:  ----YMTLSLQDKMHNQWK-----SVQCDHR---------QWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVC
               + +  K  NQ +       +C  +           SSENFV+KKL+ ALNLK + H  PYKI W++KGG+               + KDQ+VC
Subjt:  ----YMTLSLQDKMHNQWK-----SVQCDHR---------QWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVC

Query:  DELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLE
        D +EMDVCHILLG+PWQ+  +++H+GRENTYEF WM KKV+L+PL K+ D+ +  K +    LF TISGK  L+ER+ +ILG+ ++     T+D ++   
Subjt:  DELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLE

Query:  VQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL
        ++ LF ++ ++ ++P  LPPLRDI H+++L+ G + PHLPHY MSP EY+IL   IE+LLKKGHI+PS   C    +   K+   W++
Subjt:  VQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL

TYK11204.1 reverse transcriptase [Cucumis melo var. makuwa]5.4e-10442.63Show/hide
Query:  METLVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRR
        +ETL RRM+ L A  +   +A     +  R GR      R + NQ    ++R   N  P R+ + +    +   E  Q  QE DSSS  +Q  ++++ R 
Subjt:  METLVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRR

Query:  VREER----YEARGD-PHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLM
        ++  +    +EA+ +  HDYKMKID+P Y GK D+E+FLDWI+N EN  +YM+T D KKV L+A KL+GGA AW DQ+EINRQR GK PI +WEKMK+LM
Subjt:  VREER----YEARGD-PHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLM

Query:  KERFLPPNYEQTLYN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG-----CIHKKEKETQEGAI
        K RFLPPNYEQTLYN   N +   K++ ++ I     + +      +T       R      +     D  ++    +         + +K K+ +    
Subjt:  KERFLPPNYEQTLYN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG-----CIHKKEKETQEGAI

Query:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLSLQDKMHNQWKSVQ-----------CD--HRQWSSENFVTKKLMAALNLKVE
         KK +   K K QN YNRPSLGK FRC QT +LS  CPQR+ +A +     D   +  K+V+           CD      SSENF+ K L+  LNLK E
Subjt:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLSLQDKMHNQWKSVQ-----------CD--HRQWSSENFVTKKLMAALNLKVE

Query:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN
         H +P+KI WV KGG+                 KDQI+CD ++MDVCH+LLG+PWQ+ T+TLHKGRENTYEF WMGKK+VL+PL+ KN+   ++K +G  
Subjt:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN

Query:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLI
         LFTT+ GK L+ ER+ DIL L V +KSI  Q   +  E+Q L  EF  LK++P GLP LRDIQH +DLI
Subjt:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLI

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.1e-9538.73Show/hide
Query:  PTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRRVREER----YEARGDP-HDYK
        P  I+ + R   +  GR R  +N      +R     +P ++ ++N +  +   E  Q  Q+ DSS   +Q  ++++    R  +     EAR +  HDYK
Subjt:  PTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRRVREER----YEARGDP-HDYK

Query:  MKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAW--SDQLEINRQ-------RQGKRPIRTW-EKMKRLMKERFLPPNYEQ
        MKID+P YNGK DIE+FLDWIKN EN   YM  PD KKV L+A KLKGGA AW  S    +NR        RQG + +  + E+  RL     L  N + 
Subjt:  MKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAW--SDQLEINRQ-------RQGKRPIRTW-EKMKRLMKERFLPPNYEQ

Query:  TLYNNIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG---------CIHKKEKETQEGAIEKKTKTTFKQ
         +   I      + + + +    + S  +    T     +  R KN ++R           KK   G          +  K K        KK ++  + 
Subjt:  TLYNNIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG---------CIHKKEKETQEGAIEKKTKTTFKQ

Query:  KSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKLMAALNLKVE
        K+QN Y RPSLGK FRCG+ GHLS  C QR+ +A       YM+    ++   + + ++ D                  + + ++    K    +N KV 
Subjt:  KSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKLMAALNLKVE

Query:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN
         H  PYKI WV+KGG+               + KDQIVCD +EMDVCH+LLG+PWQ+ T+TLH+GRENTYEFQWMGKKV+L+PL KKN E  +++++   
Subjt:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN

Query:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLK
         LF T+SGK+LLKER+ D+LGL V +KS       V   ++ LF EF  LK++P+GLPPLRDIQH +DL+   +LP+LPHY MSP+EY++L  HIEDLLK
Subjt:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLK

Query:  KGHIQPSLRSCS
        KGHI+PSL  C+
Subjt:  KGHIQPSLRSCS

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]2.3e-10241.4Show/hide
Query:  LVRRMELLTAAPQHNEDAPTFIQEWGR-RGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEV--FHEPRR
        L+ R E     P  NE      Q +G  RGR   G  R + N    +Q R P   + Q+  E+ Q      QE  QE QE DSSS  +Q  +  F++  R
Subjt:  LVRRMELLTAAPQHNEDAPTFIQEWGR-RGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEV--FHEPRR

Query:  V--REERYEA-RGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKE
             +R EA RG+ HDYKMKID+P Y+GK +IE FLDWIK+ EN  +YM+TP+ KKV L+A KL+ GA AW DQLEINRQR GK+P+R+WEKMK+L+K 
Subjt:  V--REERYEA-RGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKE

Query:  RFLPPNYEQTLYN---NIKTAGKALGQWL-----IMRIPPIGSNQ-------LGGKRTTPNCEVHRRP----------------------KNGHQREGIN
        RFLPPNYEQTLYN   N +   +++ +++     +     +  N+       +GG R     +V  +P                      KN ++R    
Subjt:  RFLPPNYEQTLYN---NIKTAGKALGQWL-----IMRIPPIGSNQ-------LGGKRTTPNCEVHRRP----------------------KNGHQREGIN

Query:  DMLQEDIKKDYVGCIHK---KEKETQEGAIEKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYM---------TLSLQDKMH------
            +    D      K   KE + QE A+E+K + TFK   QN Y+RPSLGK FRCGQTGHLS  CPQR+ +A           ++  +++        
Subjt:  DMLQEDIKKDYVGCIHK---KEKETQEGAIEKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYM---------TLSLQDKMH------

Query:  ---------------NQWKSVQ---------------CD--HRQWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQ
                        + K++Q               CD      SSENFV KKL+  LNLK EAH +PYKI WVRKGG+               A KDQ
Subjt:  ---------------NQWKSVQ---------------CD--HRQWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQ

Query:  IVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEV
        IVCD +EMDVCH+LLG+PWQY T++LHKGRENTYEFQWMG+KVVL+P+ KK +EG+     G   LF T+SGK++LKER+ DILGL V EK+ + Q  ++
Subjt:  IVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEV

Query:  SLEVQTLFDEFTQLKE
          ++Q L  EF  +K+
Subjt:  SLEVQTLFDEFTQLKE

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]5.7e-13047.06Show/hide
Query:  RGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERFLPPNYEQTL
        RG+ HDYKMKID+P Y+GK +IE FLDWIK+ EN  +YM+TP+ KKV L+A KL+ GA AW DQLEINRQR GK+PIR+WEKMK+L+K RFLPPNYEQTL
Subjt:  RGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERFLPPNYEQTL

Query:  YN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHR---------------RPKNGHQREGINDMLQEDIKKDYVGCIHK---KEKETQEGAI
        YN   N +   +++  ++         ++L  +      E H+               R KN ++R        +    D      K   KE + QE A+
Subjt:  YN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHR---------------RPKNGHQREGINDMLQEDIKKDYVGCIHK---KEKETQEGAI

Query:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYM---------TLSLQDK---------------------MHNQWKSVQ--------
        E+K + TFK   QN Y+RPSLGK FRCGQTGHLS  CPQR+ +A           ++  +++                     M  + K++Q        
Subjt:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYM---------TLSLQDK---------------------MHNQWKSVQ--------

Query:  -------CD--HRQWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGR
               CD      SSENFV KKL+  LNLK EAH +PYKI WVRKGG+               A KDQIVCD +EMDVCH+LLG+PWQY T++LHKGR
Subjt:  -------CD--HRQWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGR

Query:  ENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHH
        ENTYEFQWMG+KVVL+P+ KK +EG+     G   LF T+SGK +LKER+  ILGL V EK+ + Q  ++  ++Q L  EF  +KE+PKGLPPLRDIQHH
Subjt:  ENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHH

Query:  VDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL
        +DLI G +LP+L HY MSP+EY+IL  HIE+LLKKGHI+PSL  C+   +   K+   W++
Subjt:  VDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase1.8e-7647.04Show/hide
Query:  KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKL
        KK ++  + K+QN Y RPSLGK FRCG+ GHLS  C QR+ +A       YM+ + +++   + + ++ D                  + + ++    K 
Subjt:  KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKL

Query:  MAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEG
           +N KV  H  PYKI WV+KGG+               + KDQIVCD +EMDVCH+LLG+PWQ+ T+TLH+GRENTYEFQWMGKKV+L+PL KKN E 
Subjt:  MAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEG

Query:  VNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEIL
         +++++    LF T+SGK+LLKER+ D+LGL V +KS       V   ++ LF EF  LK++P+GLPPLRDIQH +DL+   +LP+LPHY MSP+EY++L
Subjt:  VNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEIL

Query:  RQHIEDLLKKGHIQPSLRSCS
          HIEDLLKKGHI+PSL  C+
Subjt:  RQHIEDLLKKGHIQPSLRSCS

A0A5B7BER3 Uncharacterized protein2.5e-8334.25Show/hide
Query:  DQEEVFHEPRRVREERYEARG-DPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTW
        D +E F +    R    + RG +  +Y+MKID+P +NG   IE+FLDWI   E     M   D K+V+L+A+KLKGGA AW DQ++ NR+RQGK+P+RTW
Subjt:  DQEEVFHEPRRVREERYEARG-DPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTW

Query:  EKMKRLMKERFLPPNYEQTL---YNNIKTAGKALGQWLIMRIPPIGSNQL------------GGKRTTPNCEVH-------------------------R
        +KM+RL++ERFLP +YEQ L   Y N +  G+++ ++          N L            GG R T   +++                          
Subjt:  EKMKRLMKERFLPPNYEQTL---YNNIKTAGKALGQWLIMRIPPIGSNQL------------GGKRTTPNCEVH-------------------------R

Query:  RPKNGHQREGINDMLQEDIKKDYVGCIHKKEKET-QEGAIEKKTKTT---FKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLS--------
        R +N  +    +   Q++  K   G + + +K T ++ A   K + T     QKS NPY RP  GK FRC Q GH S ECP RR V  + ++        
Subjt:  RPKNGHQREGINDMLQEDIKKDYVGCIHKKEKET-QEGAIEKKTKTT---FKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLS--------

Query:  ---------------------------------------LQDKMHNQWKSVQCDHRQ---------WSSENFVTKKLMAALNLKVEAHSSPYKISWVRKG
                                               +  + HN +++ +C   Q          SSEN V+K L+ AL LK E H +PYKI W++KG
Subjt:  ---------------------------------------LQDKMHNQWKSVQCDHRQ---------WSSENFVTKKLMAALNLKVEAHSSPYKISWVRKG

Query:  ---------------GKAQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKE
                       GK  KD++ CD ++MD CH+LLG+PWQ+     HKG++NTY F W  KKVVL+P  K ++     K EG + L  T++G   +++
Subjt:  ---------------GKAQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKE

Query:  RK-ADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQL--KEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSC
         K A  + + + +     +  +V   +Q L  EF  +   E P  LPP+RDIQHH+DL+ G +LP+LPHY MSPKE EIL+Q +EDL+ KG IQ S+  C
Subjt:  RK-ADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQL--KEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSC

Query:  STNPIHAQKR---WKL
        +   +   K+   W++
Subjt:  STNPIHAQKR---WKL

A0A5D3CJ99 Reverse transcriptase2.6e-10442.63Show/hide
Query:  METLVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRR
        +ETL RRM+ L A  +   +A     +  R GR      R + NQ    ++R   N  P R+ + +    +   E  Q  QE DSSS  +Q  ++++ R 
Subjt:  METLVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRR

Query:  VREER----YEARGD-PHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLM
        ++  +    +EA+ +  HDYKMKID+P Y GK D+E+FLDWI+N EN  +YM+T D KKV L+A KL+GGA AW DQ+EINRQR GK PI +WEKMK+LM
Subjt:  VREER----YEARGD-PHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLM

Query:  KERFLPPNYEQTLYN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG-----CIHKKEKETQEGAI
        K RFLPPNYEQTLYN   N +   K++ ++ I     + +      +T       R      +     D  ++    +         + +K K+ +    
Subjt:  KERFLPPNYEQTLYN---NIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG-----CIHKKEKETQEGAI

Query:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLSLQDKMHNQWKSVQ-----------CD--HRQWSSENFVTKKLMAALNLKVE
         KK +   K K QN YNRPSLGK FRC QT +LS  CPQR+ +A +     D   +  K+V+           CD      SSENF+ K L+  LNLK E
Subjt:  EKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVAYMTLSLQDKMHNQWKSVQ-----------CD--HRQWSSENFVTKKLMAALNLKVE

Query:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN
         H +P+KI WV KGG+                 KDQI+CD ++MDVCH+LLG+PWQ+ T+TLHKGRENTYEF WMGKK+VL+PL+ KN+   ++K +G  
Subjt:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN

Query:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLI
         LFTT+ GK L+ ER+ DIL L V +KSI  Q   +  E+Q L  EF  LK++P GLP LRDIQH +DLI
Subjt:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLI

A0A5D3DGR0 Reverse transcriptase1.2e-11237.35Show/hide
Query:  LVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQN-QDEIIQERAPRNHLPQRHFEENQRPRHI-LQERHQEFQERDSSSEYDQEEV--FHEPR
        +V R++   A     +  P  +Q WGRRGR G    R  +N Q+ II E        Q    + +R R I  Q R +E +   SS E D +++  F   R
Subjt:  LVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQN-QDEIIQERAPRNHLPQRHFEENQRPRHI-LQERHQEFQERDSSSEYDQEEV--FHEPR

Query:  RVREERYEARGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERF
         V+ ER +      +YKMKID+P Y+GK +IE FLDW+KN EN  +YM T  +KKV L+A KLKGGA AW DQ+ +NRQ+QGK PIR+WEKMK+LMK+RF
Subjt:  RVREERYEARGDPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERF

Query:  LPPNYEQTLYNNIKTAGKALGQW--LIMRIPPIGSNQ-------------LGGKRTTPNCEVHRRPKNGHQR-----EGINDMLQEDIK-----------
        +PPNYEQTLY   +   + + +    I     +G                +GG R     +V  +P           E + +M++   K           
Subjt:  LPPNYEQTLYNNIKTAGKALGQW--LIMRIPPIGSNQ-------------LGGKRTTPNCEVHRRPKNGHQR-----EGINDMLQEDIK-----------

Query:  -KDYVGCIHKKEKETQEGAIE------KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA------------------------------
         K       K +  T E  +E      KK     ++K +NPY RP  G  +RCGQ GH S +CPQR+ +A                              
Subjt:  -KDYVGCIHKKEKETQEGAIE------KKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA------------------------------

Query:  ----YMTLSLQDKMHNQWK-----SVQCDHR---------QWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVC
               + +  K  NQ +       +C  +           SSENFV+KKL+ ALNLK + H  PYKI W++KGG+               + KDQ+VC
Subjt:  ----YMTLSLQDKMHNQWK-----SVQCDHR---------QWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGK---------------AQKDQIVC

Query:  DELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLE
        D +EMDVCHILLG+PWQ+  +++H+GRENTYEF WM KKV+L+PL K+ D+ +  K +    LF TISGK  L+ER+ +ILG+ ++     T+D ++   
Subjt:  DELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLE

Query:  VQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL
        ++ LF ++ ++ ++P  LPPLRDI H+++L+ G + PHLPHY MSP EY+IL   IE+LLKKGHI+PS   C    +   K+   W++
Subjt:  VQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLKKGHIQPSLRSCSTNPIHAQKR---WKL

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X11.0e-9538.73Show/hide
Query:  PTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRRVREER----YEARGDP-HDYK
        P  I+ + R   +  GR R  +N      +R     +P ++ ++N +  +   E  Q  Q+ DSS   +Q  ++++    R  +     EAR +  HDYK
Subjt:  PTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRRVREER----YEARGDP-HDYK

Query:  MKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAW--SDQLEINRQ-------RQGKRPIRTW-EKMKRLMKERFLPPNYEQ
        MKID+P YNGK DIE+FLDWIKN EN   YM  PD KKV L+A KLKGGA AW  S    +NR        RQG + +  + E+  RL     L  N + 
Subjt:  MKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAW--SDQLEINRQ-------RQGKRPIRTW-EKMKRLMKERFLPPNYEQ

Query:  TLYNNIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG---------CIHKKEKETQEGAIEKKTKTTFKQ
         +   I      + + + +    + S  +    T     +  R KN ++R           KK   G          +  K K        KK ++  + 
Subjt:  TLYNNIKTAGKALGQWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVG---------CIHKKEKETQEGAIEKKTKTTFKQ

Query:  KSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKLMAALNLKVE
        K+QN Y RPSLGK FRCG+ GHLS  C QR+ +A       YM+    ++   + + ++ D                  + + ++    K    +N KV 
Subjt:  KSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMVA-------YMTLSLQDKMHNQWKSVQCD----------------HRQWSSENFVTKKLMAALNLKVE

Query:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN
         H  PYKI WV+KGG+               + KDQIVCD +EMDVCH+LLG+PWQ+ T+TLH+GRENTYEFQWMGKKV+L+PL KKN E  +++++   
Subjt:  AHSSPYKISWVRKGGK---------------AQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVLIPLNKKNDEGVNVKKEGNN

Query:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLK
         LF T+SGK+LLKER+ D+LGL V +KS       V   ++ LF EF  LK++P+GLPPLRDIQH +DL+   +LP+LPHY MSP+EY++L  HIEDLLK
Subjt:  HLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEILRQHIEDLLK

Query:  KGHIQPSLRSCS
        KGHI+PSL  C+
Subjt:  KGHIQPSLRSCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein9.5e-0634.78Show/hide
Query:  FLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERFLP
        +L W  N      + +T    K+ +   +LKG A  W DQ E NR  + + PIRTWE++K  M  ++ P
Subjt:  FLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERFLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACACTGGTAAGAAGAATGGAATTACTTACTGCAGCCCCTCAACACAATGAAGATGCCCCGACATTCATTCAAGAATGGGGGAGGAGGGGAAGACATGGTGCTGG
GCGGCCAAGATTCGTGCAGAATCAAGATGAGATAATCCAAGAAAGGGCCCCAAGAAATCATTTGCCTCAGAGACATTTTGAAGAAAATCAAAGACCACGACATATTCTTC
AAGAAAGACATCAAGAATTCCAAGAACGGGATTCTTCGAGTGAATATGATCAAGAAGAAGTGTTTCATGAGCCAAGAAGAGTACGAGAAGAACGATATGAAGCTAGAGGC
GACCCGCATGATTACAAAATGAAAATCGACATGCCATTCTACAATGGTAAAGGTGACATTGAAACATTCCTTGATTGGATCAAGAATGCCGAGAATTTATGCAGCTATAT
GAATACACCGGATCACAAAAAGGTGCGTTTGTTGGCCTTTAAGTTGAAGGGGGGAGCTTTCGCTTGGTCGGATCAATTAGAGATTAACCGTCAAAGACAAGGCAAGAGAC
CAATTCGAACATGGGAGAAAATGAAGCGGTTGATGAAAGAAAGGTTCCTCCCTCCCAATTACGAGCAGACACTGTACAATAATATCAAAACTGCAGGCAAGGCCCTAGGA
CAGTGGCTTATTATGAGAATTCCACCGATTGGGAGCAACCAACTTGGCGGAAAACGAACAACACCTAACTGCGAGGTTCATCGGCGACCTAAGAATGGTCATCAAAGAGA
AGGAATTAATGATATGCTCCAAGAAGATATCAAGAAAGACTATGTCGGGTGCATCCACAAAAAAGAGAAGGAAACTCAAGAAGGAGCTATTGAGAAGAAAACCAAGACAA
CATTCAAACAGAAGTCTCAAAATCCTTACAATCGGCCCTCATTGGGCAAAGGCTTCCGATGTGGCCAAACTGGCCATCTATCCTATGAGTGCCCTCAAAGGAGAATGGTG
GCTTATATGACATTGTCTCTTCAAGACAAGATGCACAATCAATGGAAAAGTGTGCAATGTGATCACAGACAGTGGAGTAGTGAGAATTTCGTCACCAAGAAGCTAATGGC
AGCCTTGAATCTCAAGGTAGAAGCTCATTCGAGTCCTTACAAGATCAGTTGGGTGAGAAAAGGTGGCAAAGCCCAAAAAGACCAAATAGTGTGTGATGAACTAGAAATGG
ATGTATGTCACATACTATTGGGAAAGCCATGGCAATATGGCACTCGAACCTTGCATAAGGGGAGAGAGAATACATATGAATTTCAATGGATGGGCAAGAAAGTGGTATTA
ATCCCATTAAACAAGAAGAATGATGAAGGAGTGAATGTGAAAAAGGAAGGCAACAACCATTTATTCACTACAATCAGTGGCAAATCACTTTTAAAGGAAAGAAAGGCTGA
TATTTTAGGACTTGCTGTTGCTGAAAAGTCCATAGATACCCAAGACAATGAAGTTTCTTTAGAAGTACAAACGTTGTTTGATGAATTTACACAACTTAAAGAAAAACCCA
AGGGACTACCTCCCTTACGAGACATTCAACATCACGTTGATTTGATTCTGGGAGATACACTACCTCATCTTCCTCACTATGGAATGAGTCCTAAAGAATACGAAATACTG
CGTCAACATATTGAAGACTTGCTTAAAAAAGGACACATCCAACCGAGCTTGAGGTCATGCAGTACCAACCCTATTCACGCCCAAAAAAGATGGAAGTTGGAGGATGTGCG
TGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACACTGGTAAGAAGAATGGAATTACTTACTGCAGCCCCTCAACACAATGAAGATGCCCCGACATTCATTCAAGAATGGGGGAGGAGGGGAAGACATGGTGCTGG
GCGGCCAAGATTCGTGCAGAATCAAGATGAGATAATCCAAGAAAGGGCCCCAAGAAATCATTTGCCTCAGAGACATTTTGAAGAAAATCAAAGACCACGACATATTCTTC
AAGAAAGACATCAAGAATTCCAAGAACGGGATTCTTCGAGTGAATATGATCAAGAAGAAGTGTTTCATGAGCCAAGAAGAGTACGAGAAGAACGATATGAAGCTAGAGGC
GACCCGCATGATTACAAAATGAAAATCGACATGCCATTCTACAATGGTAAAGGTGACATTGAAACATTCCTTGATTGGATCAAGAATGCCGAGAATTTATGCAGCTATAT
GAATACACCGGATCACAAAAAGGTGCGTTTGTTGGCCTTTAAGTTGAAGGGGGGAGCTTTCGCTTGGTCGGATCAATTAGAGATTAACCGTCAAAGACAAGGCAAGAGAC
CAATTCGAACATGGGAGAAAATGAAGCGGTTGATGAAAGAAAGGTTCCTCCCTCCCAATTACGAGCAGACACTGTACAATAATATCAAAACTGCAGGCAAGGCCCTAGGA
CAGTGGCTTATTATGAGAATTCCACCGATTGGGAGCAACCAACTTGGCGGAAAACGAACAACACCTAACTGCGAGGTTCATCGGCGACCTAAGAATGGTCATCAAAGAGA
AGGAATTAATGATATGCTCCAAGAAGATATCAAGAAAGACTATGTCGGGTGCATCCACAAAAAAGAGAAGGAAACTCAAGAAGGAGCTATTGAGAAGAAAACCAAGACAA
CATTCAAACAGAAGTCTCAAAATCCTTACAATCGGCCCTCATTGGGCAAAGGCTTCCGATGTGGCCAAACTGGCCATCTATCCTATGAGTGCCCTCAAAGGAGAATGGTG
GCTTATATGACATTGTCTCTTCAAGACAAGATGCACAATCAATGGAAAAGTGTGCAATGTGATCACAGACAGTGGAGTAGTGAGAATTTCGTCACCAAGAAGCTAATGGC
AGCCTTGAATCTCAAGGTAGAAGCTCATTCGAGTCCTTACAAGATCAGTTGGGTGAGAAAAGGTGGCAAAGCCCAAAAAGACCAAATAGTGTGTGATGAACTAGAAATGG
ATGTATGTCACATACTATTGGGAAAGCCATGGCAATATGGCACTCGAACCTTGCATAAGGGGAGAGAGAATACATATGAATTTCAATGGATGGGCAAGAAAGTGGTATTA
ATCCCATTAAACAAGAAGAATGATGAAGGAGTGAATGTGAAAAAGGAAGGCAACAACCATTTATTCACTACAATCAGTGGCAAATCACTTTTAAAGGAAAGAAAGGCTGA
TATTTTAGGACTTGCTGTTGCTGAAAAGTCCATAGATACCCAAGACAATGAAGTTTCTTTAGAAGTACAAACGTTGTTTGATGAATTTACACAACTTAAAGAAAAACCCA
AGGGACTACCTCCCTTACGAGACATTCAACATCACGTTGATTTGATTCTGGGAGATACACTACCTCATCTTCCTCACTATGGAATGAGTCCTAAAGAATACGAAATACTG
CGTCAACATATTGAAGACTTGCTTAAAAAAGGACACATCCAACCGAGCTTGAGGTCATGCAGTACCAACCCTATTCACGCCCAAAAAAGATGGAAGTTGGAGGATGTGCG
TGGATAG
Protein sequenceShow/hide protein sequence
METLVRRMELLTAAPQHNEDAPTFIQEWGRRGRHGAGRPRFVQNQDEIIQERAPRNHLPQRHFEENQRPRHILQERHQEFQERDSSSEYDQEEVFHEPRRVREERYEARG
DPHDYKMKIDMPFYNGKGDIETFLDWIKNAENLCSYMNTPDHKKVRLLAFKLKGGAFAWSDQLEINRQRQGKRPIRTWEKMKRLMKERFLPPNYEQTLYNNIKTAGKALG
QWLIMRIPPIGSNQLGGKRTTPNCEVHRRPKNGHQREGINDMLQEDIKKDYVGCIHKKEKETQEGAIEKKTKTTFKQKSQNPYNRPSLGKGFRCGQTGHLSYECPQRRMV
AYMTLSLQDKMHNQWKSVQCDHRQWSSENFVTKKLMAALNLKVEAHSSPYKISWVRKGGKAQKDQIVCDELEMDVCHILLGKPWQYGTRTLHKGRENTYEFQWMGKKVVL
IPLNKKNDEGVNVKKEGNNHLFTTISGKSLLKERKADILGLAVAEKSIDTQDNEVSLEVQTLFDEFTQLKEKPKGLPPLRDIQHHVDLILGDTLPHLPHYGMSPKEYEIL
RQHIEDLLKKGHIQPSLRSCSTNPIHAQKRWKLEDVRG