; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036528 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036528
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:47949940..47952061
RNA-Seq ExpressionLag0036528
SyntenyLag0036528
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]2.5e-6742.18Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M +++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ASV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G      G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]8.6e-6841.97Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
         +G      GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL K  LD +K
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]7.8e-6941.31Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI V       +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK
         +G   K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL     ++   R+  +W  +K
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]5.0e-6842.44Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G   K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]3.3e-6741.91Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +M+++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG++ +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G      GIGDV +KT  G    L++VR VP +K  LIS+G+L D GY   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein4.2e-6841.97Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
         +G      GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL K  LD +K
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

A0A4U5PY83 CCHC-type domain-containing protein3.8e-6941.31Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI V       +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK
         +G   K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL     ++   R+  +W  +K
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK

A0A4U5QGR0 Uncharacterized protein2.4e-6842.44Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G   K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

A0A4U5QS59 CCHC-type domain-containing protein1.6e-6741.91Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +M+++DYL  +KLH   L  KP++M  ++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV+I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG++ +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G      GIGDV +KT  G    L++VR VP +K  LIS+G+L D GY   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

A0A4V6XW18 CCHC-type domain-containing protein1.2e-6742.18Show/hide
Query:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV
        F   +MQ++DYL  +KLH   L  KP++M +++W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ASV
Subjt:  FWILEMQVKDYLTCRKLH-KALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASV

Query:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE
          ++N   T+ NQL SV I F D++ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + TRG+  D+      
Subjt:  NSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNE

Query:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM
          S   S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DALI         +   +WILDS AS H      +  ++ GG HG+V +
Subjt:  PSS---SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMM

Query:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         +G      G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  GNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-4633.5Show/hide
Query:  EMQVKDYLTCRKLHKAL---KEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNS
        + +++D L  + LHK L    +KP  M  +DW  LDE A ++IR+ LS DV + +  E T   +   L + Y   +  NK+YL K+ + + M+E  +  S
Subjt:  EMQVKDYLTCRKLHKAL---KEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNS

Query:  YINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LAIAEEIRRQGSNKESTVGSALAMTRGK----DKIDEDNE
        ++N    LI QL ++ +   ++  AI LL SLP S++ + T + +  G  T++  +V   L + E++R++  N+    G AL +T G+     +   +  
Subjt:  YINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LAIAEEIRRQGSNKESTVGSALAMTRGK----DKIDEDNE

Query:  PSSSRKKWKNRNEV---ECFYCHKKGHFKSQC---RKLK-EDQKRKHEANTVY------DALICVESDTKC---SNHSSDWILDSAASVHIASDRSLFTS
         S +R K KNR++     C+ C++ GHFK  C   RK K E   +K++ NT        + ++ +  + +C   S   S+W++D+AAS H    R LF  
Subjt:  PSSSRKKWKNRNEV---ECFYCHKKGHFKSQC---RKLK-EDQKRKHEANTVY------DALICVESDTKC---SNHSSDWILDSAASVHIASDRSLFTS

Query:  FTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
        +  G  G V MGN   SK +GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY   F +++ +LT GS  +A G  + TLY+   ++ +
Subjt:  FTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

P25601 Putative transposon Ty5-1 protein YCL075W2.0e-0635.23Show/hide
Query:  ICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI
        +C+ S T  +  SS+WI D+  + H+  DRS+F+SFT       + G G +    G G V++ T     + L DV +VP++ +NLIS+
Subjt:  ICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-0823.26Show/hide
Query:  WEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLLTS
        W+  D+   +++   +SM V   V+  TT  ++ E L   Y  PS  + V  ++           +++ Y+  + T  +QL  +           ++L +
Subjt:  WEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLLTS

Query:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWK--------NRNEV--
        LP+ ++ +             T +     N+  K   V    +        S++ +T  +        ++ D  N  ++S K W+        N N+   
Subjt:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWK--------NRNEV--

Query:  ---ECFYCHKKGHFKSQCRKLKE-----DQKRKHEANTVYDALICVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVMMGNGRTSKT
           +C  C  +GH   +C +L+      + ++     T +     +   +  S  S++W+LDS A+ HI SD    SL   +TGG    VM+ +G T   
Subjt:  ---ECFYCHKKGHFKSQCRKLKE-----DQKRKHEANTVYDALICVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVMMGNGRTSKT

Query:  SGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAD-DGYRCEF--GSRQCKLTFGSQEVAVGHRKSTLYK
        S  G  SL T+    L L ++ +VPNI  NLIS+ +L + +G   EF   S Q K       +  G  K  LY+
Subjt:  SGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAD-DGYRCEF--GSRQCKLTFGSQEVAVGHRKSTLYK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.8e-0724.53Show/hide
Query:  WEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYL--VKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLL
        W   D+   ++I   +SM V   V+  TT  ++ E L   Y  PS  +   L  + +F   Q+A       +  +V  ++  L        D++ A    
Subjt:  WEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKVYL--VKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLL

Query:  TSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWKNRNEV----ECFYCHKKGHFKSQCR
         SL +  E +    S     N+ +   +    +    R   +N+              +      +PSSS  +  NR        C  C  +GH   +C 
Subjt:  TSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWKNRNEV----ECFYCHKKGHFKSQCR

Query:  KLKEDQKRKHEANTVYD-------ALICVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLV
        +L + Q   ++  +          A + V S    +N    W+LDS A+ HI SD    S    +TGG    VM+ +G T   +  G  SL T     L 
Subjt:  KLKEDQKRKHEANTVYD-------ALICVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLV

Query:  LRDVRFVPNIKMNLISIGKLAD
        L  V +VPNI  NLIS+ +L +
Subjt:  LRDVRFVPNIKMNLISIGKLAD

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.4e-0720.83Show/hide
Query:  LVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALA
        L K+  +++M +  S +SY+++   ++ +L   K+  +D      + T+L  S++ + + +      + +    + +       R   S+ E  +   L 
Subjt:  LVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALA

Query:  MTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANTVYDALICVESDTKCSNHSSD-WILDSAASVHIASDRSLFTSFT
          R K K           +KW       C  C+K  H +  C+      K + E   V D  +    +     +  D WI+   A +++      FT+  
Subjt:  MTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANTVYDALICVESDTKCSNHSSD-WILDSAASVHIASDRSLFTSFT

Query:  GGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFG
              V   +G      G GDV ++ + G K  +R+V FVP +  N++S GK+    Y    G
Subjt:  GGHHGLVMMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFG

AT3G29785.1 unknown protein6.7e-1041.56Show/hide
Query:  MQVKDYLTCRKLHKALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKV
        M+++DYL  +KLH+ L +K + M+ DDW  L  + +  IR+ +S ++A  VA E +P  LM+ L++ Y+KPS NN V
Subjt:  MQVKDYLTCRKLHKALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCAAGAAATTCATGCGGCGAGAATGCATGCGGCGAGAAATGCATGCGACGAGAAAGTCATGCGAGCCAAGAAATTTCGATGGAGTCACGAAGTTCGATGGGAAAA
ATTTTGGATATTGGAAATGCAAGTCAAGGATTATTTAACTTGCAGGAAATTGCATAAGGCACTGAAGGAGAAACCGAAAGAGATGACAGACGACGATTGGGAAGCTCTGG
ATGAAGAGGCAGTTGCAAGCATAAGGATGTGCTTATCAATGGATGTGGCAAGTCTAGTGGCCCATGAGACAACTCCAGTTAAATTGATGGAAGCACTTACAAACAGGTAT
GAAAAACCCTCCGCGAATAATAAAGTCTACCTAGTAAAGAAGTTCTTCAACATGCAAATGGCTGAGGATGCTTCTGTAAATTCCTATATTAATGAAGTTACCACTCTGAT
TAATCAGTTAAAATCTGTTAAGATAGGATTTACTGATAAGGTGAATGCTATTAAGTTATTAACATCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATT
CGACTGGAAATAATACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGACAGGGTAGTAATAAGGAGTCTACGGTAGGGTCAGCTTTGGCT
ATGACTAGAGGTAAAGATAAGATTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCA
CTTCAAGAGTCAGTGTAGAAAACTTAAAGAGGATCAGAAAAGAAAACACGAGGCAAATACAGTGTATGATGCCTTAATTTGTGTTGAGAGTGACACAAAGTGTAGTAACC
ATTCATCAGATTGGATATTAGACAGTGCAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGATGATGGGGAAT
GGTAGAACCTCCAAAACCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGTGGAGATAAATTAGTACTGCGAGATGTCAGATTTGTGCCTAATATCAAGATGAATCT
TATTTCTATTGGTAAATTGGCAGATGATGGTTACAGGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCACGTTCGGATCCCAGGAAGTGGCAGTCGGTCACAGGAAATCTA
CACTGTACAAATGTAAGTTGGATGTTGCCAAAAGATCAAGGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTTGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATA
GCCAATTTCAATCAGTCCGATCACGATCCTTCAATTCAGAAACAATTGGGAAGTCCAGGAGATGGCTATCGTGAATCCCAGTTGTCAGACGCTCGAATGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCAAGAAATTCATGCGGCGAGAATGCATGCGGCGAGAAATGCATGCGACGAGAAAGTCATGCGAGCCAAGAAATTTCGATGGAGTCACGAAGTTCGATGGGAAAA
ATTTTGGATATTGGAAATGCAAGTCAAGGATTATTTAACTTGCAGGAAATTGCATAAGGCACTGAAGGAGAAACCGAAAGAGATGACAGACGACGATTGGGAAGCTCTGG
ATGAAGAGGCAGTTGCAAGCATAAGGATGTGCTTATCAATGGATGTGGCAAGTCTAGTGGCCCATGAGACAACTCCAGTTAAATTGATGGAAGCACTTACAAACAGGTAT
GAAAAACCCTCCGCGAATAATAAAGTCTACCTAGTAAAGAAGTTCTTCAACATGCAAATGGCTGAGGATGCTTCTGTAAATTCCTATATTAATGAAGTTACCACTCTGAT
TAATCAGTTAAAATCTGTTAAGATAGGATTTACTGATAAGGTGAATGCTATTAAGTTATTAACATCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATT
CGACTGGAAATAATACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGACAGGGTAGTAATAAGGAGTCTACGGTAGGGTCAGCTTTGGCT
ATGACTAGAGGTAAAGATAAGATTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCA
CTTCAAGAGTCAGTGTAGAAAACTTAAAGAGGATCAGAAAAGAAAACACGAGGCAAATACAGTGTATGATGCCTTAATTTGTGTTGAGAGTGACACAAAGTGTAGTAACC
ATTCATCAGATTGGATATTAGACAGTGCAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGATGATGGGGAAT
GGTAGAACCTCCAAAACCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGTGGAGATAAATTAGTACTGCGAGATGTCAGATTTGTGCCTAATATCAAGATGAATCT
TATTTCTATTGGTAAATTGGCAGATGATGGTTACAGGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCACGTTCGGATCCCAGGAAGTGGCAGTCGGTCACAGGAAATCTA
CACTGTACAAATGTAAGTTGGATGTTGCCAAAAGATCAAGGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTTGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATA
GCCAATTTCAATCAGTCCGATCACGATCCTTCAATTCAGAAACAATTGGGAAGTCCAGGAGATGGCTATCGTGAATCCCAGTTGTCAGACGCTCGAATGAATTGA
Protein sequenceShow/hide protein sequence
MRQEIHAARMHAARNACDEKVMRAKKFRWSHEVRWEKFWILEMQVKDYLTCRKLHKALKEKPKEMTDDDWEALDEEAVASIRMCLSMDVASLVAHETTPVKLMEALTNRY
EKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDKVNAIKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALA
MTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANTVYDALICVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVMMGN
GRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAKRSRRQWMPVKAADGCCRGTVEPAARI
ANFNQSDHDPSIQKQLGSPGDGYRESQLSDARMN