; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008151 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008151
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:13284459..13285566
RNA-Seq ExpressionLag0008151
SyntenyLag0008151
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65867.1 hypothetical protein VITISV_034935 [Vitis vinifera]9.8e-4135.62Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT------------------NSW
        I KFDG +F YW+MQ++DYL  +K+H   L  KP+ M  ++W  LD + +  IR+ LSR VA  V  E T   LM+AL+                  NSW
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT------------------NSW

Query:  KTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---R
        + M+  VSNSTG   LK+ ++ DL +AEEI R+ + + S  GSAL + T+G+      N+  S       +R K ++  +V+C+ C K GHFK QC   +
Subjt:  KTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---R

Query:  KFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPN
        K  E          +QD +L+ VDS         DW+LDS AS H    R +  ++  G  G V + +G      G+GDV +    G   +L  VR +P+
Subjt:  KFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPN

Query:  IKMNLISIGKLANDGYMCEF
        ++ NLIS+G+L ++G+   F
Subjt:  IKMNLISIGKLANDGYMCEF

RVX04667.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-4036.33Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------NSWKTMKTVVSN
        I KFDG +F YW+MQ++DYL  +K+H   L  KP+ M  ++W  LD + +  IR+ LSR VA  V  E T   LM+AL+         N    M+  VSN
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------NSWKTMKTVVSN

Query:  STGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---RKFKEAQKRK
        STG   LK+ ++ DL +AEEI ++ + + S  GSAL + T+G+      N+  S       +R K ++  +V+C+ C K GHFK QC   +K  E     
Subjt:  STGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---RKFKEAQKRK

Query:  PEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIG
             +QD +L+ VDS         DW+LDS AS H  S R +  ++  G  G V + +G      G+GDV +    G   +L  VR++PN++ NLIS+G
Subjt:  PEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIG

Query:  KLANDGYMCEF
        +L ++G+   F
Subjt:  KLANDGYMCEF

TKS15174.1 hypothetical protein D5086_0000036030 [Populus alba]3.0e-4235.71Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------
        I KFDG +FGYWKMQ++DYL  KK+H   L  KP+ M  ++W+ LD + +  I++ LSR VA  V  E +  KLMEAL+                     
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------

Query:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC
                        +SW+ M+T VSNS G + LK+ ++ DL +AEE+ R+ S + S+ GSAL + T+G+  D+        S   S+ K+ ++ +VEC
Subjt:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC

Query:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT
        + C K GHF   C K K+ +     A I  +QD +++ V S         +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT
Subjt:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT

Query:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF
          G    L++VR VP +K  LIS+G+L + G+   F
Subjt:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF

VFQ62075.1 unnamed protein product [Cuscuta campestris]1.3e-4035.87Show/hide
Query:  KSLDEIMKFDGKNFGYWKMQVKDYLTCKKVHKALK-EKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN---------------
        KS   I KFDG +FG+WKMQ++DYL  K +H+ L   KP  MT++ W+  D +A+  I + L+++VA  +  E T   L++AL+N               
Subjt:  KSLDEIMKFDGKNFGYWKMQVKDYLTCKKVHKALK-EKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN---------------

Query:  -----SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENE----LSSSRKKWKNKNEVECFYCHKKGHFKSQC
             SW T+   +S+S G+  LKF E+ D+ ++E I ++     S  GSAL V  +G+ K   +++     S +R K  N++ + C+ C  KGHFK+ C
Subjt:  -----SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENE----LSSSRKKWKNKNEVECFYCHKKGHFKSQC

Query:  RKFKEAQKRKP--------EANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLV
        +K K+ Q +K          A  + D +++ VDS          WILDS AS H +S    F +F  G+ G V + + +     G GDVS+KT  G++  
Subjt:  RKFKEAQKRKP--------EANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLV

Query:  LRDVRFVPNIKMNLISIGKLANDGYMCEF
        L+DVR++P +K NLISIG+L N GY  EF
Subjt:  LRDVRFVPNIKMNLISIGKLANDGYMCEF

XP_034902342.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118039689 [Populus alba]3.0e-4235.71Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------
        I KFDG +FGYWKMQ++DYL  KK+H   L  KP+ M  ++W+ LD + +  I++ LSR VA  V  E +  KLMEAL+                     
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------

Query:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC
                        +SW+ M+T VSNS G + LK+ ++ DL +AEE+ R+ S + S+ GSAL + T+G+  D+        S   S+ K+ ++ +VEC
Subjt:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC

Query:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT
        + C K GHF   C K K+ +     A I  +QD +++ V S         +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT
Subjt:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT

Query:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF
          G    L++VR VP +K  LIS+G+L + G+   F
Subjt:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF

TrEMBL top hitse value%identityAlignment
A0A438J6T4 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-4036.33Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------NSWKTMKTVVSN
        I KFDG +F YW+MQ++DYL  +K+H   L  KP+ M  ++W  LD + +  IR+ LSR VA  V  E T   LM+AL+         N    M+  VSN
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------NSWKTMKTVVSN

Query:  STGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---RKFKEAQKRK
        STG   LK+ ++ DL +AEEI ++ + + S  GSAL + T+G+      N+  S       +R K ++  +V+C+ C K GHFK QC   +K  E     
Subjt:  STGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---RKFKEAQKRK

Query:  PEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIG
             +QD +L+ VDS         DW+LDS AS H  S R +  ++  G  G V + +G      G+GDV +    G   +L  VR++PN++ NLIS+G
Subjt:  PEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIG

Query:  KLANDGYMCEF
        +L ++G+   F
Subjt:  KLANDGYMCEF

A0A484KC47 CCHC-type domain-containing protein6.2e-4135.87Show/hide
Query:  KSLDEIMKFDGKNFGYWKMQVKDYLTCKKVHKALK-EKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN---------------
        KS   I KFDG +FG+WKMQ++DYL  K +H+ L   KP  MT++ W+  D +A+  I + L+++VA  +  E T   L++AL+N               
Subjt:  KSLDEIMKFDGKNFGYWKMQVKDYLTCKKVHKALK-EKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN---------------

Query:  -----SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENE----LSSSRKKWKNKNEVECFYCHKKGHFKSQC
             SW T+   +S+S G+  LKF E+ D+ ++E I ++     S  GSAL V  +G+ K   +++     S +R K  N++ + C+ C  KGHFK+ C
Subjt:  -----SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENE----LSSSRKKWKNKNEVECFYCHKKGHFKSQC

Query:  RKFKEAQKRKP--------EANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLV
        +K K+ Q +K          A  + D +++ VDS          WILDS AS H +S    F +F  G+ G V + + +     G GDVS+KT  G++  
Subjt:  RKFKEAQKRKP--------EANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLV

Query:  LRDVRFVPNIKMNLISIGKLANDGYMCEF
        L+DVR++P +K NLISIG+L N GY  EF
Subjt:  LRDVRFVPNIKMNLISIGKLANDGYMCEF

A0A4U5QVL9 CCHC-type domain-containing protein1.5e-4235.71Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------
        I KFDG +FGYWKMQ++DYL  KK+H   L  KP+ M  ++W+ LD + +  I++ LSR VA  V  E +  KLMEAL+                     
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT---------------------

Query:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC
                        +SW+ M+T VSNS G + LK+ ++ DL +AEE+ R+ S + S+ GSAL + T+G+  D+        S   S+ K+ ++ +VEC
Subjt:  ----------------NSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGK--DKVDEENELSS---SRKKWKNKNEVEC

Query:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT
        + C K GHF   C K K+ +     A I  +QD +++ V S         +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT
Subjt:  FYCHKKGHFKSQCRKFKEAQKRKPEANI--MQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT

Query:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF
          G    L++VR VP +K  LIS+G+L + G+   F
Subjt:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF

A0A803L6Q2 Uncharacterized protein1.1e-4035.35Show/hide
Query:  EIMKFDGKNFGYWKMQVKDYLTCKKVHKALKE-KPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN-------------------
        ++ KFDGK+FG+WKMQ++DYL  KK+++ L+E KP GM + +W+ LD +A+  IR+ LSR+VA  +A ETT   LM+AL+N                   
Subjt:  EIMKFDGKNFGYWKMQVKDYLTCKKVHKALKE-KPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTN-------------------

Query:  ------SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMTKGKDKVDEENELSSS---------RKKWKNKNE------VECFY
              SW    T VS+S+GNN LKF +  DL ++EEI R+ S + S+       ++G++ + + NE   S         R K +N N       VEC+ 
Subjt:  ------SWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMTKGKDKVDEENELSSS---------RKKWKNKNE------VECFY

Query:  CHKKGHFKSQCRKFKEAQKRKPEANIMQ----DVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT
        C K GH+K+QC+   + ++ K EAN+      D  L+C        +    W+LDS AS H  S+++ F  +   + G V +G+ +     G G+V +K 
Subjt:  CHKKGHFKSQCRKFKEAQKRKPEANIMQ----DVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKT

Query:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDG
          G    L+DV+ VP+++ NLIS+G+LA +G
Subjt:  ECGDKLVLRDVRFVPNIKMNLISIGKLANDG

A5BPB3 Uncharacterized protein4.7e-4135.62Show/hide
Query:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT------------------NSW
        I KFDG +F YW+MQ++DYL  +K+H   L  KP+ M  ++W  LD + +  IR+ LSR VA  V  E T   LM+AL+                  NSW
Subjt:  IMKFDGKNFGYWKMQVKDYLTCKKVH-KALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALT------------------NSW

Query:  KTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---R
        + M+  VSNSTG   LK+ ++ DL +AEEI R+ + + S  GSAL + T+G+      N+  S       +R K ++  +V+C+ C K GHFK QC   +
Subjt:  KTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSAL-VMTKGKDKVDEENELSS-------SRKKWKNKNEVECFYCHKKGHFKSQC---R

Query:  KFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPN
        K  E          +QD +L+ VDS         DW+LDS AS H    R +  ++  G  G V + +G      G+GDV +    G   +L  VR +P+
Subjt:  KFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPN

Query:  IKMNLISIGKLANDGYMCEF
        ++ NLIS+G+L ++G+   F
Subjt:  IKMNLISIGKLANDGYMCEF

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-2132.24Show/hide
Query:  VAHETTEVKLMEALTNSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMT-KGKDKVDEENELSSSRKKWKNKNEVE-----CF
        +  E   + L+ +L +S+  + T + +  G  T++  +V    +  E  R+   K    G AL+   +G+      N    S  + K+KN  +     C+
Subjt:  VAHETTEVKLMEALTNSWKTMKTVVSNSTGNNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMT-KGKDKVDEENELSSSRKKWKNKNEVE-----CF

Query:  YCHKKGHFKSQC---RKFK-EAQKRKPEANIM-----QDVVLVCVDSD---TRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTR
         C++ GHFK  C   RK K E   +K + N        D V++ ++ +      S   S+W++D+AAS H    R LF  +  G  G V+MGN   SK  
Subjt:  YCHKKGHFKSQC---RKFK-EAQKRKPEANIM-----QDVVLVCVDSD---TRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTR

Query:  GIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF
        GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY   F
Subjt:  GIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEF

P25601 Putative transposon Ty5-1 protein YCL075W6.2e-0635.23Show/hide
Query:  VCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI
        +C+ S T  +  SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++ T     + L DV +VP++ +NLIS+
Subjt:  VCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-0426.61Show/hide
Query:  NNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMTKGKDKVDEENELSSSRKKWK--------NKNEV-----ECFYCHKKGHFKSQCRKFK----EAQ
        N+  K L V    +        S++ +T  +        ++ D  N  ++S K W+        N N+      +C  C  +GH   +C + +       
Subjt:  NNTLKFLEVCDLAIAEEICRQGSNKESTVGSALVMTKGKDKVDEENELSSSRKKWK--------NKNEV-----ECFYCHKKGHFKSQCRKFK----EAQ

Query:  KRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKM
         ++P +          +   + YS  S++W+LDS A+ HI SD    SL   +TGG    V + +G T      G  SL T+    L L ++ +VPNI  
Subjt:  KRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKM

Query:  NLISIGKLAN-DGYMCEF
        NLIS+ +L N +G   EF
Subjt:  NLISIGKLAN-DGYMCEF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-0431.25Show/hide
Query:  CFYCHKKGHFKSQCRKFKEAQKR-------------KPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNG
        C  C  +GH   +C +  + Q               +P AN+        V+S    +N    W+LDS A+ HI SD    S    +TGG    V + +G
Subjt:  CFYCHKKGHFKSQCRKFKEAQKR-------------KPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNG

Query:  RTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAN
         T      G  SL T     L L  V +VPNI  NLIS+ +L N
Subjt:  RTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAN

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.3e-0826.81Show/hide
Query:  KNKNEVECFYCHKKGHFKSQCRKFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIG
        K+K+E  C  C+K  H +  C+      K + E  I+ D  L  V +    +     WI+   A +++      FT+        V   +G      G G
Subjt:  KNKNEVECFYCHKKGHFKSQCRKFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIG

Query:  DVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLANDGY
        DV ++ + G K  +R+V FVP +  N++S GK+ +  Y
Subjt:  DVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLANDGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGTAGAGCCAAAAAGTTTGGACGAAATCATGAAGTTTGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGACAAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCAAGGGATGTGGCAAGTC
TAGTAGCCCATGAGACAACTGAAGTCAAATTGATGGAAGCGCTTACAAACAGTTGGAAAACGATGAAGACAGTAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TTAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTTGTAGGCAGGGTAGTAATAAAGAGTCTACGGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGA
TGAAGAAAATGAATTGAGTAGCAGTAGAAAAAAGTGGAAAAATAAGAATGAGGTAGAATGTTTTTACTGTCATAAGAAAGGTCACTTCAAGAGTCAGTGTAGGAAATTTA
AAGAGGCTCAGAAAAGAAAACCAGAGGCAAATATAATGCAGGATGTTGTCTTAGTTTGTGTTGATAGTGACACAAGGTATAGTAACCATTCTTCAGATTGGATATTAGAT
AGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGATAAATTGGTACTGCGAGATGTCAGGTTCGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAA
ATGATGGTTACATGTGTGAGTTTGATAGTCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGTAGAGCCAAAAAGTTTGGACGAAATCATGAAGTTTGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGACAAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCAAGGGATGTGGCAAGTC
TAGTAGCCCATGAGACAACTGAAGTCAAATTGATGGAAGCGCTTACAAACAGTTGGAAAACGATGAAGACAGTAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TTAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTTGTAGGCAGGGTAGTAATAAAGAGTCTACGGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGA
TGAAGAAAATGAATTGAGTAGCAGTAGAAAAAAGTGGAAAAATAAGAATGAGGTAGAATGTTTTTACTGTCATAAGAAAGGTCACTTCAAGAGTCAGTGTAGGAAATTTA
AAGAGGCTCAGAAAAGAAAACCAGAGGCAAATATAATGCAGGATGTTGTCTTAGTTTGTGTTGATAGTGACACAAGGTATAGTAACCATTCTTCAGATTGGATATTAGAT
AGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGATAAATTGGTACTGCGAGATGTCAGGTTCGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAA
ATGATGGTTACATGTGTGAGTTTGATAGTCGCTAG
Protein sequenceShow/hide protein sequence
MDFVEPKSLDEIMKFDGKNFGYWKMQVKDYLTCKKVHKALKEKPKGMTDKDWEALDEEAVATIRMCLSRDVASLVAHETTEVKLMEALTNSWKTMKTVVSNSTGNNTLKF
LEVCDLAIAEEICRQGSNKESTVGSALVMTKGKDKVDEENELSSSRKKWKNKNEVECFYCHKKGHFKSQCRKFKEAQKRKPEANIMQDVVLVCVDSDTRYSNHSSDWILD
SAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLANDGYMCEFDSR