; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G020780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G020780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr04:11892204..11893663
RNA-Seq ExpressionCmoCh04G020780
SyntenyCmoCh04G020780
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]3.4e-12960.09Show/hide
Query:  QKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI----------------------------------------
        +K LHEPL+GVK + M  + WKLKDR+ LG+I+L+LSRNVAFNI+KEKTTSDL+KALSN+                                        
Subjt:  QKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI----------------------------------------

Query:  --------------------------------SRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPN-KGRSKSKSREKSPNRPNVT
                                        S GS+KLKF +IRDVV S+SI KRE G+SSG+ALSVDRRGR + +G N   RSKSK+R KSP + NVT
Subjt:  --------------------------------SRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPN-KGRSKSKSREKSPNRPNVT

Query:  CWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIK
        CWNCGEKGHF T C +PK+ +N KSGDD+DS+NS EDIG+ALILSVDS +ES ILD GASFH  P+KELFQNFK GNF+KVYL DNK L I+ KGDVCIK
Subjt:  CWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIK

Query:  TPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAK
        TPAGN+WTL+D+RYIP LK+NLISIGQLDST YAT+ GKGSWKIMKGA VVARG+KSGTL TT  C+N+A VAE AS+  L HNRL  MSAK MK LAAK
Subjt:  TPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAK

Query:  GVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVKKVRLEM
        G LEG+K VD+G CE+ VM KQKRVSFT+TA+  KKVRLEM
Subjt:  GVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVKKVRLEM

VFQ62075.1 unnamed protein product [Cuscuta campestris]2.6e-12171.6Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR
        MQIEDYLYQKDLHEPL GVKPD+MT EQWKLKDR+ALGMI L+L++NVAFNI+KE TT+ L+KALSN+                              SR
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR

Query:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINS
        GS+KLKF EI DVVLSESI KRE GDSSG+ALSVDRRGRSK KG ++ GRSKSK+R KSPNR N+TCWNCG+KGHF+  C +PK+KQN KSGDD DS+NS
Subjt:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINS

Query:  AEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYA
        AEDIGDALILSVDS +ESWILDSGASFHSSP+KE FQNFKSGNF KVYLADNK L I+GKGDV IKTPAGN+WTLKD+RYIP LK+NLISIGQLD+  YA
Subjt:  AEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYA

Query:  TKLGKGSWKIMKGATVVARGSKSGTLDTTTG
         + GKGSWKI+KGA VVARG+K GTL TT G
Subjt:  TKLGKGSWKIMKGATVVARGSKSGTLDTTTG

VFQ69914.1 unnamed protein product [Cuscuta campestris]5.7e-10075Show/hide
Query:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH
        D + A  + SRGS+KL+F EIRDVVLSESI KRE GDSSG+ALSVDR+GRSK KG ++ GRSKSK+R KSPNR N+TCWNCG+KGHF+  C +PK+KQN 
Subjt:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH

Query:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI
        KSGDD DS+NSAEDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD+RYIP LK+NLI
Subjt:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI

Query:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRH
        SIGQLD+T YA K GKGSWKI+KGA  VARG+K GTL TT GC+N+AA A+  S SSL H
Subjt:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRH

VFQ92713.1 unnamed protein product [Cuscuta campestris]2.8e-9953.94Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR
        MQIEDYLYQKDLHEPL GVKPD+MT EQWKLKDR+ALGMI+L+L++NVAFNI+KE TT+ LMKALSN+                              SR
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR

Query:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNKGRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSA
        G++KL+F EIRDVVLSE                                                       GHF+  C +PK+KQN KSGDD DS+NSA
Subjt:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNKGRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSA

Query:  EDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYAT
        EDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD RYIP LK+NLISI          
Subjt:  EDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYAT

Query:  KLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK
                                     GC+N+AA A+  S+SSL H+RL  MS KGM+ LAAKG LEGL SVD+G CE+ VM KQKRVSFT+T RE K
Subjt:  KLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK

Query:  KVRLEM
        KVRLEM
Subjt:  KVRLEM

VFR02734.1 unnamed protein product [Cuscuta campestris]3.9e-10163.29Show/hide
Query:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH
        D + A  + SRGS+KL+F EIRDVVLSESI KRE GDSSG+ALSVDRRGRSK KG ++ GRSKSK+R KSPN  N++CWNCG+KGHF+  C +PK+KQN 
Subjt:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH

Query:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI
        KSGDD DS+NSAEDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD+RYIP LK+NLI
Subjt:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI

Query:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQK
        SIG+LD+T YA + GKGSWKI+KGA VVARG+K GTL TTTGC+N+AA A+  S SS + N+L+  + K    +     L G +  D    +N  + +  
Subjt:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQK

Query:  RVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVGVEQVGVELEDSTP
         V+F  +   + K R +  P+      TTKQVGVE   VELE STP
Subjt:  RVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVGVEQVGVELEDSTP

TrEMBL top hitse value%identityAlignment
A0A151TNK0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-9443.95Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI--------------------------------
        MQIEDYLYQK L++PL+G KPD M  E+W L DR+ALG+I+L+L++NVAFNI+ EKTT+ LMK LS++                                
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI--------------------------------

Query:  ------------------------------------------SRGSDKLKFGEIRDVVLSESIHKRE----TGDSSGNALSVDRRGRSKPKGPN-KGRSK
                                                  S   +KLK  +IRD++LSE + +R+    +  +S +AL+ + RGR+  KG N +GRSK
Subjt:  ------------------------------------------SRGSDKLKFGEIRDVVLSESIHKRE----TGDSSGNALSVDRRGRSKPKGPN-KGRSK

Query:  SKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAED-IGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLAD
        S+++ +   R ++ CWNC ++GHF   C  PK+ +NHK  DDD+S N+A D I DALI S+DS IESWI+DSGASFH++P+ EL  N+ SG F KVYLAD
Subjt:  SKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAED-IGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLAD

Query:  NKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNR
         K L I GKGD+ I+T +G+ WTLK++R+IPALKRNLIS+GQLD   + T  G G+WK+ KG  +VARG K G+L       N+ AV E+A+NS L H R
Subjt:  NKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNR

Query:  LEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVG
        L  MS KGMK +A KG L  LK VDVG CE+ ++ KQ+++SF+R  + +K  RLE+      G    K +G
Subjt:  LEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVG

A0A484KC47 CCHC-type domain-containing protein1.3e-12171.6Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR
        MQIEDYLYQKDLHEPL GVKPD+MT EQWKLKDR+ALGMI L+L++NVAFNI+KE TT+ L+KALSN+                              SR
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR

Query:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINS
        GS+KLKF EI DVVLSESI KRE GDSSG+ALSVDRRGRSK KG ++ GRSKSK+R KSPNR N+TCWNCG+KGHF+  C +PK+KQN KSGDD DS+NS
Subjt:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINS

Query:  AEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYA
        AEDIGDALILSVDS +ESWILDSGASFHSSP+KE FQNFKSGNF KVYLADNK L I+GKGDV IKTPAGN+WTLKD+RYIP LK+NLISIGQLD+  YA
Subjt:  AEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYA

Query:  TKLGKGSWKIMKGATVVARGSKSGTLDTTTG
         + GKGSWKI+KGA VVARG+K GTL TT G
Subjt:  TKLGKGSWKIMKGATVVARGSKSGTLDTTTG

A0A484KZ82 CCHC-type domain-containing protein2.7e-10075Show/hide
Query:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH
        D + A  + SRGS+KL+F EIRDVVLSESI KRE GDSSG+ALSVDR+GRSK KG ++ GRSKSK+R KSPNR N+TCWNCG+KGHF+  C +PK+KQN 
Subjt:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH

Query:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI
        KSGDD DS+NSAEDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD+RYIP LK+NLI
Subjt:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI

Query:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRH
        SIGQLD+T YA K GKGSWKI+KGA  VARG+K GTL TT GC+N+AA A+  S SSL H
Subjt:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRH

A0A484MUU4 gag_pre-integrs domain-containing protein1.4e-9953.94Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR
        MQIEDYLYQKDLHEPL GVKPD+MT EQWKLKDR+ALGMI+L+L++NVAFNI+KE TT+ LMKALSN+                              SR
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI------------------------------SR

Query:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNKGRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSA
        G++KL+F EIRDVVLSE                                                       GHF+  C +PK+KQN KSGDD DS+NSA
Subjt:  GSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNKGRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSA

Query:  EDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYAT
        EDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD RYIP LK+NLISI          
Subjt:  EDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYAT

Query:  KLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK
                                     GC+N+AA A+  S+SSL H+RL  MS KGM+ LAAKG LEGL SVD+G CE+ VM KQKRVSFT+T RE K
Subjt:  KLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK

Query:  KVRLEM
        KVRLEM
Subjt:  KVRLEM

A0A484NNM3 CCHC-type domain-containing protein1.9e-10163.29Show/hide
Query:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH
        D + A  + SRGS+KL+F EIRDVVLSESI KRE GDSSG+ALSVDRRGRSK KG ++ GRSKSK+R KSPN  N++CWNCG+KGHF+  C +PK+KQN 
Subjt:  DLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNH

Query:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI
        KSGDD DS+NSAEDIGDALILSVDS +ESWILDSGASFHSSP+KELFQNFKSGNF KVYLADNK L I+GKGDV IKTP GN+WTLKD+RYIP LK+NLI
Subjt:  KSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLI

Query:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQK
        SIG+LD+T YA + GKGSWKI+KGA VVARG+K GTL TTTGC+N+AA A+  S SS + N+L+  + K    +     L G +  D    +N  + +  
Subjt:  SIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQK

Query:  RVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVGVEQVGVELEDSTP
         V+F  +   + K R +  P+      TTKQVGVE   VELE STP
Subjt:  RVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVGVEQVGVELEDSTP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3331.36Show/hide
Query:  NVAFNIIKEKTTSDLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNV-TCWNCGEKGH
        N+A  I+  KTT +L    S +               +L+E + K+   ++ G AL  + RGRS  +  N  GRS ++ + K+ ++  V  C+NC + GH
Subjt:  NVAFNIIKEKTTSDLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRSKPKGPNK-GRSKSKSREKSPNRPNV-TCWNCGEKGH

Query:  FRTGCTRPKRKQNHKSGD-DDDSINSAEDIGDALILSVDSSIE---------SWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCI
        F+  C  P++ +   SG  +DD+  +     D ++L ++   E          W++D+ AS H++P ++LF  + +G+F  V + +    +I G GD+CI
Subjt:  FRTGCTRPKRKQNHKSGD-DDDSINSAEDIGDALILSVDSSIE---------SWILDSGASFHSSPNKELFQNFKSGNFEKVYLADNKDLEIKGKGDVCI

Query:  KTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTG--CMNIAAVAESASNSSLRHNRLEPMSAKGMKRL
        KT  G    LKD+R++P L+ NLIS   LD   Y +      W++ KG+ V+A+G   GTL  T    C      A+   +  L H R+  MS KG++ L
Subjt:  KTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTG--CMNIAAVAESASNSSLRHNRLEPMSAKGMKRL

Query:  AAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK
        A K ++   K   V  C+  +  KQ RVSF +T+ E K
Subjt:  AAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVK

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.8e-0424.52Show/hide
Query:  RSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYL
        R KSKS +         C  C +  H +  C         +  D+       E + +    + D  I  WI+   A  + +P  + F          V  
Subjt:  RSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYL

Query:  ADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLG
         D   L ++GKGDV I+   G + T++++ ++P L RN++S G++ S  Y+   G
Subjt:  ADNKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLG

AT3G29785.1 unknown protein3.6e-1250Show/hide
Query:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI
        M+IEDYLY K LH+P LG K +TM+ + W +  R+ L +I+L++S+N+A N+ KEK+   LMK LS+I
Subjt:  MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNI

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.1e-0431.58Show/hide
Query:  KGSWKIMKGATVVARGSKSGTL-----DTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFT
        +G  K++KG   + +G++  +L        TG  N+A  A+    + L H+RL  MS +GM+ L  KG L+  K   +  CE+ +  K  RV+F+
Subjt:  KGSWKIMKGATVVARGSKSGTL-----DTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGATTGAAGATTATCTGTACCAGAAAGATCTTCACGAACCTCTATTGGGGGTGAAGCCGGATACCATGACCACGGAACAGTGGAAGCTTAAGGATCGAAAAGCCTT
AGGGATGATCCAGTTGTCGCTATCCAGAAACGTGGCGTTCAACATTATCAAGGAGAAGACAACGTCAGATCTAATGAAGGCGCTGTCGAATATTTCCCGAGGATCTGATA
AACTGAAGTTTGGTGAAATTCGAGATGTAGTTCTCAGCGAAAGTATTCACAAACGAGAAACTGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGT
AAGCCGAAGGGCCCAAACAAAGGGCGATCAAAATCAAAGAGCCGAGAAAAATCTCCAAATAGACCAAACGTAACGTGTTGGAATTGTGGAGAAAAAGGTCACTTTCGGAC
AGGTTGTACAAGACCAAAGAGAAAGCAGAATCACAAATCTGGAGATGATGATGATTCTATAAATTCAGCAGAAGACATTGGGGATGCTCTAATCCTCAGCGTGGACAGTT
CGATTGAATCCTGGATTTTGGATTCAGGTGCATCTTTTCATTCGTCTCCAAATAAAGAGTTGTTCCAAAATTTCAAGTCTGGAAATTTCGAGAAGGTGTATCTTGCCGAC
AACAAAGATTTGGAGATTAAAGGAAAAGGAGATGTCTGCATAAAAACTCCGGCAGGAAATCGGTGGACATTAAAGGATATCAGATATATTCCTGCTCTCAAAAGGAACCT
GATCTCTATTGGTCAATTGGATAGCACTAGTTATGCAACAAAGTTAGGGAAGGGTTCGTGGAAGATTATGAAGGGTGCTACGGTGGTAGCACGTGGCTCAAAATCTGGAA
CCCTAGACACCACTACAGGGTGTATGAACATAGCTGCTGTTGCTGAGAGTGCTTCAAATTCAAGTCTACGGCACAATAGACTTGAACCTATGAGCGCGAAAGGAATGAAG
AGGCTGGCTGCGAAAGGAGTTTTAGAAGGTCTGAAATCTGTTGATGTGGGTCGTTGTGAGAACTACGTTATGAGCAAGCAGAAACGAGTTAGCTTCACAAGGACCGCCAG
AGAAGTGAAGAAAGTGCGGTTGGAAATGGAACCAGATGTGGAGCAAGGTTCCAAGACCACGAAACAAGTGGGAGTTGAACAAGTGGGAGTTGAACTTGAAGATTCTACCC
CTACTCCTTCTCTCTCACAGGCCCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGATTGAAGATTATCTGTACCAGAAAGATCTTCACGAACCTCTATTGGGGGTGAAGCCGGATACCATGACCACGGAACAGTGGAAGCTTAAGGATCGAAAAGCCTT
AGGGATGATCCAGTTGTCGCTATCCAGAAACGTGGCGTTCAACATTATCAAGGAGAAGACAACGTCAGATCTAATGAAGGCGCTGTCGAATATTTCCCGAGGATCTGATA
AACTGAAGTTTGGTGAAATTCGAGATGTAGTTCTCAGCGAAAGTATTCACAAACGAGAAACTGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGT
AAGCCGAAGGGCCCAAACAAAGGGCGATCAAAATCAAAGAGCCGAGAAAAATCTCCAAATAGACCAAACGTAACGTGTTGGAATTGTGGAGAAAAAGGTCACTTTCGGAC
AGGTTGTACAAGACCAAAGAGAAAGCAGAATCACAAATCTGGAGATGATGATGATTCTATAAATTCAGCAGAAGACATTGGGGATGCTCTAATCCTCAGCGTGGACAGTT
CGATTGAATCCTGGATTTTGGATTCAGGTGCATCTTTTCATTCGTCTCCAAATAAAGAGTTGTTCCAAAATTTCAAGTCTGGAAATTTCGAGAAGGTGTATCTTGCCGAC
AACAAAGATTTGGAGATTAAAGGAAAAGGAGATGTCTGCATAAAAACTCCGGCAGGAAATCGGTGGACATTAAAGGATATCAGATATATTCCTGCTCTCAAAAGGAACCT
GATCTCTATTGGTCAATTGGATAGCACTAGTTATGCAACAAAGTTAGGGAAGGGTTCGTGGAAGATTATGAAGGGTGCTACGGTGGTAGCACGTGGCTCAAAATCTGGAA
CCCTAGACACCACTACAGGGTGTATGAACATAGCTGCTGTTGCTGAGAGTGCTTCAAATTCAAGTCTACGGCACAATAGACTTGAACCTATGAGCGCGAAAGGAATGAAG
AGGCTGGCTGCGAAAGGAGTTTTAGAAGGTCTGAAATCTGTTGATGTGGGTCGTTGTGAGAACTACGTTATGAGCAAGCAGAAACGAGTTAGCTTCACAAGGACCGCCAG
AGAAGTGAAGAAAGTGCGGTTGGAAATGGAACCAGATGTGGAGCAAGGTTCCAAGACCACGAAACAAGTGGGAGTTGAACAAGTGGGAGTTGAACTTGAAGATTCTACCC
CTACTCCTTCTCTCTCACAGGCCCGATAA
Protein sequenceShow/hide protein sequence
MQIEDYLYQKDLHEPLLGVKPDTMTTEQWKLKDRKALGMIQLSLSRNVAFNIIKEKTTSDLMKALSNISRGSDKLKFGEIRDVVLSESIHKRETGDSSGNALSVDRRGRS
KPKGPNKGRSKSKSREKSPNRPNVTCWNCGEKGHFRTGCTRPKRKQNHKSGDDDDSINSAEDIGDALILSVDSSIESWILDSGASFHSSPNKELFQNFKSGNFEKVYLAD
NKDLEIKGKGDVCIKTPAGNRWTLKDIRYIPALKRNLISIGQLDSTSYATKLGKGSWKIMKGATVVARGSKSGTLDTTTGCMNIAAVAESASNSSLRHNRLEPMSAKGMK
RLAAKGVLEGLKSVDVGRCENYVMSKQKRVSFTRTAREVKKVRLEMEPDVEQGSKTTKQVGVEQVGVELEDSTPTPSLSQAR