; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036131 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036131
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:39921285..39922718
RNA-Seq ExpressionLag0036131
SyntenyLag0036131
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]1.1e-6742.74Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M E+EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ASV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
            G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]8.6e-6842.26Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
            GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL K  LD +K
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]7.8e-6941.58Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+ V       +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK
         K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL     ++   R+  +W  +K
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]3.9e-6842.74Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]2.5e-6742.2Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        M+++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG++ + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
            GIGDV +KT  G    L++VR VP +K  LIS+G+L D GY   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein4.2e-6842.26Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
            GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL K  LD +K
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

A0A4U5PY83 CCHC-type domain-containing protein3.8e-6941.58Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+ V       +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK
         K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL     ++   R+  +W  +K
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDV-AKRSRRQWMPVK

A0A4U5QGR0 Uncharacterized protein1.9e-6842.74Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
         K  GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

A0A4U5QS59 CCHC-type domain-containing protein1.2e-6742.2Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        M+++DYL  +KLH   L  KP++M  +EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV+I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG++ + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
            GIGDV +KT  G    L++VR VP +K  LIS+G+L D GY   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

A0A4V6XW18 CCHC-type domain-containing protein5.5e-6842.74Show/hide
Query:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN
        MQ++DYL  +KLH   L  KP++M E+EW+ LD + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ASV  ++N
Subjt:  MQVKDYLTCRKLH-KALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYIN

Query:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--
           T+ NQL SV I F DE+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + TRG+  D+        S  
Subjt:  EVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAM-TRGK--DKIDEDNEPSS--

Query:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT
         S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DAL+         +   +WILDS AS H      +  ++ GG HG+V + +G  
Subjt:  -SRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANT--VYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRT

Query:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL
            G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K++ G+  +A G +  TL
Subjt:  SKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-4633.67Show/hide
Query:  QVKDYLTCRKLHKAL---KEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYI
        +++D L  + LHK L    +KP  M  ++W  LDE A ++IR+ LS DV + +  E TA  +   L + Y   +  NK+YL K+ + + M+E  +  S++
Subjt:  QVKDYLTCRKLHKAL---KEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYI

Query:  NEVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LVIAEEIRRQGSNKESTVGSALAMTRGK----DKIDEDNEPS
        N    LI QL ++ +   +E   I LL SLP S++ + T + +  G  T++  +V   L++ E++R++  N+    G AL +T G+     +   +   S
Subjt:  NEVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LVIAEEIRRQGSNKESTVGSALAMTRGK----DKIDEDNEPS

Query:  SSRKKWKNRNEV---ECFYCHKKGHFKSQC---RKLK-EDQKRKHEANTVY------DALVCVESDTKC---SNHSSDWILDSAASVHIASDRSLFTSFT
         +R K KNR++     C+ C++ GHFK  C   RK K E   +K++ NT        + ++ +  + +C   S   S+W++D+AAS H    R LF  + 
Subjt:  SSRKKWKNRNEV---ECFYCHKKGHFKSQC---RKLK-EDQKRKHEANTVY------DALVCVESDTKC---SNHSSDWILDSAASVHIASDRSLFTSFT

Query:  GGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK
         G  G V+MGN   SK +GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY   F +++ +LT GS  +A G  + TLY+   ++ +
Subjt:  GGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAK

P25601 Putative transposon Ty5-1 protein YCL075W7.5e-0635.23Show/hide
Query:  VCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI
        +C+ S T  +  SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++ T     + L DV +VP++ +NLIS+
Subjt:  VCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-0823.53Show/hide
Query:  WEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLLTS
        W+  D+   +++   +SM V   V+  TTA ++ E L   Y  PS  + V  ++           +++ Y+  + T  +QL  +      +  V ++L +
Subjt:  WEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLLTS

Query:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWK--------NRNEV--
        LP+ ++ +             T +     N+  K   V    +        S++ +T  +        ++ D  N  ++S K W+        N N+   
Subjt:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWK--------NRNEV--

Query:  ---ECFYCHKKGHFKSQCRKLKE-----DQKRKHEANTVYDALVCVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKT
           +C  C  +GH   +C +L+      + ++     T +     +   +  S  S++W+LDS A+ HI SD    SL   +TGG    V + +G T   
Subjt:  ---ECFYCHKKGHFKSQCRKLKE-----DQKRKHEANTVYDALVCVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKT

Query:  SGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAD-DGYRCEF--GSRQCKLTFGSQEVAVGHRKSTLYK
        S  G  SL T+    L L ++ +VPNI  NLIS+ +L + +G   EF   S Q K       +  G  K  LY+
Subjt:  SGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLAD-DGYRCEF--GSRQCKLTFGSQEVAVGHRKSTLYK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-0624.53Show/hide
Query:  WEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYL--VKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLL
        W   D+   ++I   +SM V   V+  TTA ++ E L   Y  PS  +   L  + +F   Q+A       +  +V  ++  L        D++      
Subjt:  WEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYL--VKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLL

Query:  TSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWKNRNEV----ECFYCHKKGHFKSQCR
         SL +  E +    S     N+ +   +   V+    R   +N+              +      +PSSS  +  NR        C  C  +GH   +C 
Subjt:  TSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWKNRNEV----ECFYCHKKGHFKSQCR

Query:  KLKEDQKRKHEANTVYD-------ALVCVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLV
        +L + Q   ++  +          A + V S    +N    W+LDS A+ HI SD    S    +TGG    V + +G T   +  G  SL T     L 
Subjt:  KLKEDQKRKHEANTVYD-------ALVCVESDTKCSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLV

Query:  LRDVRFVPNIKMNLISIGKLAD
        L  V +VPNI  NLIS+ +L +
Subjt:  LRDVRFVPNIKMNLISIGKLAD

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.4e-0720.83Show/hide
Query:  LVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALA
        L K+  +++M +  S +SY+++   ++ +L   K+  +D      + T+L  S++ + + +      + +    + +       R   S+ E  +   L 
Subjt:  LVKKFFNMQMAEDASVNSYINEVTTLINQLKSVKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALA

Query:  MTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANTVYDALVCVESDTKCSNHSSD-WILDSAASVHIASDRSLFTSFT
          R K K           +KW       C  C+K  H +  C+      K + E   V D  +    +     +  D WI+   A +++      FT+  
Subjt:  MTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQCRKLKEDQKRKHEANTVYDALVCVESDTKCSNHSSD-WILDSAASVHIASDRSLFTSFT

Query:  GGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFG
              V   +G      G GDV ++ + G K  +R+V FVP +  N++S GK+    Y    G
Subjt:  GGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLADDGYRCEFG

AT3G29785.1 unknown protein2.8e-0838.96Show/hide
Query:  MQVKDYLTCRKLHKALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKV
        M+++DYL  +KLH+ L +K + M++D+W  L  + +  IR+ +S ++A  VA E +   LM+ L++ Y+KPS NN V
Subjt:  MQVKDYLTCRKLHKALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTCAAGGATTATTTAACTTGCAGGAAATTGCATAAGGCACTGAAGGAGAAACCGAAAGAGATGACAGAAGACGAGTGGGAAGCTCTGGATGAAGAGGCAGTTGC
AAGCATAAGGATGTGCTTATCAATGGATGTGGCAAGTCTAGTGGCTCATGAGACAACTGCGGTTAAATTGATGGAAGCACTTACAAACAGGTATGAAAAACCCTCTGCGA
ATAATAAAGTCTACCTAGTTAAGAAGTTCTTCAACATGCAAATGGCTGAGGATGCTTCTGTGAATTCCTATATTAATGAAGTTACCACTTTGATTAATCAGTTAAAATCT
GTTAAGATAGGATTTACTGATGAGGTGAATGTTATTCAGCTATTAACATCTTTACCTGATAGTTGGGAAACAATGAAGACAGCAGTGTCTAATTCGACTGGAAATAATAC
TTTAAAATTTTCAGAAGTTTGTGATTTAGTCATAGCTGAGGAGATTCGTAGACAGGGTAGTAATAAGGAGTCTACAGTAGGGTCAGCTTTGGCTATGACTAGAGGTAAAG
ATAAGATTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAATGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCACTTCAAGAGTCAGTGT
AGAAAACTTAAAGAGGATCAGAAAAGAAAACACGAGGCAAATACAGTGTATGATGCCTTAGTTTGTGTTGAGAGTGACACAAAGTGTAGTAACCATTCATCAGATTGGAT
ATTAGACAGTGCAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAA
CCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGTGGAGATAAATTAGTACTGCGAGATGTCAGATTTGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAA
TTGGCAGATGATGGTTACAGGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCACGTTCGGATCCCAGGAAGTGGCAGTCGGTCACAGGAAATCTACATTGTACAAATGTAA
GTTGGATGTTGCCAAAAGATCAAGGAGACAGTGGATGCCGGTTAAAGCTGCAGATGATCCAGCAGCAAAGATAGCCAATTTCGATCAGTCCGATCACGATCCTTCAATTC
AGAAACAATTGGGAAGTCCAGGAGATGGCTATCGTGAATCCCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGTC
AAAGCAGTTGGTCAGGTCTCTAGCTTGGCAACAGGTTTGAATAGAGGATTCAAGCCATTCTTCTTCGGGAACAGTCGTTCAAGTTGGAAGAAGATAACAGGAGTCACCAC
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTCAAGGATTATTTAACTTGCAGGAAATTGCATAAGGCACTGAAGGAGAAACCGAAAGAGATGACAGAAGACGAGTGGGAAGCTCTGGATGAAGAGGCAGTTGC
AAGCATAAGGATGTGCTTATCAATGGATGTGGCAAGTCTAGTGGCTCATGAGACAACTGCGGTTAAATTGATGGAAGCACTTACAAACAGGTATGAAAAACCCTCTGCGA
ATAATAAAGTCTACCTAGTTAAGAAGTTCTTCAACATGCAAATGGCTGAGGATGCTTCTGTGAATTCCTATATTAATGAAGTTACCACTTTGATTAATCAGTTAAAATCT
GTTAAGATAGGATTTACTGATGAGGTGAATGTTATTCAGCTATTAACATCTTTACCTGATAGTTGGGAAACAATGAAGACAGCAGTGTCTAATTCGACTGGAAATAATAC
TTTAAAATTTTCAGAAGTTTGTGATTTAGTCATAGCTGAGGAGATTCGTAGACAGGGTAGTAATAAGGAGTCTACAGTAGGGTCAGCTTTGGCTATGACTAGAGGTAAAG
ATAAGATTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAATGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCACTTCAAGAGTCAGTGT
AGAAAACTTAAAGAGGATCAGAAAAGAAAACACGAGGCAAATACAGTGTATGATGCCTTAGTTTGTGTTGAGAGTGACACAAAGTGTAGTAACCATTCATCAGATTGGAT
ATTAGACAGTGCAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAA
CCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGTGGAGATAAATTAGTACTGCGAGATGTCAGATTTGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAA
TTGGCAGATGATGGTTACAGGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCACGTTCGGATCCCAGGAAGTGGCAGTCGGTCACAGGAAATCTACATTGTACAAATGTAA
GTTGGATGTTGCCAAAAGATCAAGGAGACAGTGGATGCCGGTTAAAGCTGCAGATGATCCAGCAGCAAAGATAGCCAATTTCGATCAGTCCGATCACGATCCTTCAATTC
AGAAACAATTGGGAAGTCCAGGAGATGGCTATCGTGAATCCCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGTC
AAAGCAGTTGGTCAGGTCTCTAGCTTGGCAACAGGTTTGAATAGAGGATTCAAGCCATTCTTCTTCGGGAACAGTCGTTCAAGTTGGAAGAAGATAACAGGAGTCACCAC
TTAG
Protein sequenceShow/hide protein sequence
MQVKDYLTCRKLHKALKEKPKEMTEDEWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMAEDASVNSYINEVTTLINQLKS
VKIGFTDEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALAMTRGKDKIDEDNEPSSSRKKWKNRNEVECFYCHKKGHFKSQC
RKLKEDQKRKHEANTVYDALVCVESDTKCSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGK
LADDGYRCEFGSRQCKLTFGSQEVAVGHRKSTLYKCKLDVAKRSRRQWMPVKAADDPAAKIANFDQSDHDPSIQKQLGSPGDGYRESPVVRRSNELKKSLRRVEASKWKV
KAVGQVSSLATGLNRGFKPFFFGNSRSSWKKITGVTT