; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G30800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G30800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase-like protein
Genome locationChr1:25290010..25291851
RNA-Seq ExpressionCSPI01G30800
SyntenyCSPI01G30800
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035681.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-14262.74Show/hide
Query:  YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----S
        Y+QYQ CRQG + VA+YIEEFHRLSAR NLSENEQHQ+ARF+GGLRFDIK+KVKL  FR LSEAI+ AETVEEM+ +R KN NRR AWET+ +K      
Subjt:  YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----S

Query:  KTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGG--QTSEDSIEADEETELVEADDGERV
        KT++QPS+S   KGK +D QE    +KKE   +   QN+Y+RPSLGKCFRCG+ GHLSNNC QRKTIA+AE+     +  D  E +EETEL+EADDG+R+
Subjt:  KTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGG--QTSEDSIEADEETELVEADDGERV

Query:  YCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCD
         C++QRVLITPKEE N Q H LFKTRCTING+V                            HP+PYKIGW++KGGE  ++EICT+PLSIGN+YKDQIVCD
Subjt:  YCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCD

Query:  VIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQ
        VIEMDVCHLLLGRPWQ+DTQ+LH+GRENTYEFQWMG+KV+LLP+ KK  E +R   ++QLFITVSGKN+LKEREQD+LGL+V +K++    E +EP+L++
Subjt:  VIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQ

Query:  LLHEFPHIKEEPKGLP
        L  EFPH+K+EP+GLP
Subjt:  LLHEFPHIKEEPKGLP

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]4.3e-14661.25Show/hide
Query:  MDTPERKKVHLMKKLLK--ARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIA
        M  P+RKKVHL+   LK  A   P++Y Q +   Y+QYQ CRQG + VA+YIEEFHRL AR NLSENEQHQ+ARF+GGLRFDIK+KVKL  FR LSEAI+
Subjt:  MDTPERKKVHLMKKLLK--ARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIA

Query:  FAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKT
         AETVEEM+ +R KN NRR AWET+ +K      KT++QPS+S   KGK +D QE    +KKE   +   QN+Y+RPSLGKCFRCG+ GHLSNNC QRKT
Subjt:  FAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKT

Query:  IAIAE-EGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYK
        IA+AE E    S    E +EETEL+EADDG+R+ C++QRVLITPKEE N Q H LFKTRCTING+V                            HP+PYK
Subjt:  IAIAE-EGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYK

Query:  IGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGK
        IGW++KGGE  ++EICT+PLSIGN+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+LH+GRENTYEFQWMG+KV+LLP+ KK  E +R   ++QLFITVSGK
Subjt:  IGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGK

Query:  NMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
        N+LKEREQD+LGL+V +K++    E +EP+L++L  EFPH+K+EP+GLP
Subjt:  NMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.2e-22888.34Show/hide
Query:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV
        MDTPERKKVHL                                 MKKLLKARFLP NYEQTLYNQYQ CRQGVRSVA+YIEEFHRLSARTNLSENEQHQV
Subjt:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV

Query:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP
        ARFVGGLRFDIK+KV+LQPFRFLSEAI+FAETVEEMIAIRSKNLNRR AWET+STKSKTNDQPS+STK KGKE+DNQEV  ERKKEQTFKPSGQNSYSRP
Subjt:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP

Query:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
        SLGKCFRCGQTGHLS+NCPQRKTIAIAEEGGQ SEDSIEA+EETEL+EADDGERV CVIQR+LITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
Subjt:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN

Query:  FVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT
        FVAKKLVTVLNLKAEAHPNPYKIGW+RKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT
Subjt:  FVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT

Query:  KKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKE
        KKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIK+
Subjt:  KKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKE

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]3.5e-14083.6Show/hide
Query:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV
        MDTPERKKVHL                                 MKKLLKARFLP NYEQTLYNQYQ CRQGVR+VA+YIEEFHRLSARTNLSENEQHQV
Subjt:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV

Query:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP
        ARFVGGLRFDIK+KV+LQPFRFLSEAI+FAETVEEMIAIRSKNLNRR AWET+STKSKTNDQPS+STK KGKE+DNQEV  ERKKEQTFKPSGQN+YSRP
Subjt:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP

Query:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
        SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEA+EETEL+EADDGERV CVIQR+LITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
Subjt:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN

Query:  FVAKKLVTVLNLKAEAH
        FVAKKLVTVLNLKAEAH
Subjt:  FVAKKLVTVLNLKAEAH

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]4.8e-21484.01Show/hide
Query:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV
        MDTPERKKVHL                                 MKKLLKARFLP NYEQTLYNQYQ CRQGVRSVADYIEEFHRLSARTNLSENEQHQV
Subjt:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV

Query:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP
        ARFVG                         ETVEEMIAIRSKNLNRR AWET+STKSKTNDQPS+STKGKGKEVDNQEV  ERKKEQTFKPSGQNSYSRP
Subjt:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRP

Query:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
        SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEA+EETEL+EADDGERV C IQRVLI PKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN
Subjt:  SLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSEN

Query:  FVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT
        FVAKKLV VLNLKAEAHP PYKIGW+RKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT
Subjt:  FVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPIT

Query:  KKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
        KKINEGLRGEKQLFITVSGK MLKEREQ ILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
Subjt:  KKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase6.3e-14362.74Show/hide
Query:  YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----S
        Y+QYQ CRQG + VA+YIEEFHRLSAR NLSENEQHQ+ARF+GGLRFDIK+KVKL  FR LSEAI+ AETVEEM+ +R KN NRR AWET+ +K      
Subjt:  YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----S

Query:  KTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGG--QTSEDSIEADEETELVEADDGERV
        KT++QPS+S   KGK +D QE    +KKE   +   QN+Y+RPSLGKCFRCG+ GHLSNNC QRKTIA+AE+     +  D  E +EETEL+EADDG+R+
Subjt:  KTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGG--QTSEDSIEADEETELVEADDGERV

Query:  YCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCD
         C++QRVLITPKEE N Q H LFKTRCTING+V                            HP+PYKIGW++KGGE  ++EICT+PLSIGN+YKDQIVCD
Subjt:  YCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCD

Query:  VIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQ
        VIEMDVCHLLLGRPWQ+DTQ+LH+GRENTYEFQWMG+KV+LLP+ KK  E +R   ++QLFITVSGKN+LKEREQD+LGL+V +K++    E +EP+L++
Subjt:  VIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQ

Query:  LLHEFPHIKEEPKGLP
        L  EFPH+K+EP+GLP
Subjt:  LLHEFPHIKEEPKGLP

A0A5D3C3X9 Reverse transcriptase5.0e-12458.25Show/hide
Query:  EQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSK
        + TLY+QYQ CRQG R+VADYI+EFH L AR NLSENEQHQ+ARF+GGLRFDIK+K+KLQPFRFLSEAI+FAETVEEM AIR+KN               
Subjt:  EQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSK

Query:  TNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAE-EGGQTSEDSIEADEETELVEADDGERVYC
            PS+ST+GKGKEV+ Q++  ++K+E   K   QN Y+RPSLGKCFRCGQ GH SN CPQRKTIA+A+ E    SE S E +EE +L+EADDG RV C
Subjt:  TNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAE-EGGQTSEDSIEADEETELVEADDGERVYC

Query:  VIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVI
        VIQRVL+ PKEE N Q H LFKTRCTING+VCDVIID+GSSENFVAKKLVT LNLKAE HPNPYKIGW++KGGE T+SEICTVPLSIGN YKDQIVCDVI
Subjt:  VIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVI

Query:  EMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHE
        EMD                                                                EREQD+LGL++++K+ E+Q+E ++ +LQQL  E
Subjt:  EMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHE

Query:  FPHIKEEPKGLP
        FPH+K+EP+GLP
Subjt:  FPHIKEEPKGLP

A0A5D3CJ99 Reverse transcriptase2.8e-11952.2Show/hide
Query:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV
        MDT +RKKVHL                                 MKKL+KARFLP NYEQTLYNQYQ CRQG +SVA+YI+EFH+LSARTNL        
Subjt:  MDTPERKKVHL---------------------------------MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQV

Query:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQN
                                      +TVEEM+A R +N NR+  WET  +K     +KTN+QPS+S   K K+V+ QE    +KKE   K   QN
Subjt:  ARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQN

Query:  SYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDS
        +Y+RPSLGKCFRC QT +LSN CPQRKTIA+AEE   ++ D  +  EE                                   +T+CTING+VCDVIIDS
Subjt:  SYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDS

Query:  GSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVV
        GSSENF+AK LVT LNLKAE HPNP+KIGW+ KGGE  VSEICTVPLSIG+ YKDQI+CDVI+MDVCHLLLGRPWQ+DTQ+LHKGRENTYEF WMG+K+V
Subjt:  GSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVV

Query:  LLPITKKINEGLRGEK---QLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
        LLP++K  N G +  K   QLF TV GK ++ ERE+DIL LVV++K+  +Q   +EP+LQQLL EFPH+K+EP GLP
Subjt:  LLPITKKINEGLRGEK---QLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP

A0A5D3DGR0 Reverse transcriptase3.2e-13958.93Show/hide
Query:  MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNL
        MKKL+K RF+P NYEQTLY QYQ CRQG+R  A+YIEEFHRL  RTNL E E+H ++ FVGGLRFD+K+KVKLQPF+ LSEAI +AETVEEMI  R+K+ 
Subjt:  MKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIAFAETVEEMIAIRSKNL

Query:  NRRPAWETSSTKSKT--NDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSI-EAD
         +RP WE S++K  T  N +  ++T  K  E   QE    +K+    +  G+N Y RP  G C+RCGQ GH SN CPQRKTIA+A++    S  S+ E D
Subjt:  NRRPAWETSSTKSKT--NDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDSI-EAD

Query:  EETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVP
        EETE++EAD+G+ + C++QRVLI+PKEE  LQRH LFKTRCTI G+VC+VIIDSGSSENFV+KKLVT LNLK + H  PYKIGWI+KGGE  +SEIC VP
Subjt:  EETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEICTVP

Query:  LSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQ---LFITVSGKNMLKEREQDILGLVVIEK
        LSIGN+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GRENTYEF WM +KV+LLP+ K+ ++ +   ++   LF+T+SGK  L+ERE +ILG+V+   
Subjt:  LSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQ---LFITVSGKNMLKEREQDILGLVVIEK

Query:  TKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
            + E I   +++L  ++P I +EP  LP
Subjt:  TKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X12.1e-14661.25Show/hide
Query:  MDTPERKKVHLMKKLLK--ARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIA
        M  P+RKKVHL+   LK  A   P++Y Q +   Y+QYQ CRQG + VA+YIEEFHRL AR NLSENEQHQ+ARF+GGLRFDIK+KVKL  FR LSEAI+
Subjt:  MDTPERKKVHLMKKLLK--ARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKLQPFRFLSEAIA

Query:  FAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKT
         AETVEEM+ +R KN NRR AWET+ +K      KT++QPS+S   KGK +D QE    +KKE   +   QN+Y+RPSLGKCFRCG+ GHLSNNC QRKT
Subjt:  FAETVEEMIAIRSKNLNRRPAWETSSTK-----SKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKT

Query:  IAIAE-EGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYK
        IA+AE E    S    E +EETEL+EADDG+R+ C++QRVLITPKEE N Q H LFKTRCTING+V                            HP+PYK
Subjt:  IAIAE-EGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYK

Query:  IGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGK
        IGW++KGGE  ++EICT+PLSIGN+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+LH+GRENTYEFQWMG+KV+LLP+ KK  E +R   ++QLFITVSGK
Subjt:  IGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLR--GEKQLFITVSGK

Query:  NMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP
        N+LKEREQD+LGL+V +K++    E +EP+L++L  EFPH+K+EP+GLP
Subjt:  NMLKEREQDILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G13320.1 unknown protein3.2e-1433.33Show/hide
Query:  LFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKA-EAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEM--DVCHLLLGRPWQYD
        +F+T+C IN   C +++  G+  N ++K LV  L LK  + +P+   +   R+  +    E C VP+SIG+ YKD++ C V+ M  +   LL G PW Y 
Subjt:  LFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKA-EAHPNPYKIGWIRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEM--DVCHLLLGRPWQYD

Query:  TQSLHKGRENTYEFQWMGRKVVL
         Q+ H GR+++    W    ++L
Subjt:  TQSLHKGRENTYEFQWMGRKVVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAAACGCCACCCCTTTGCCTTTATTTTCTACTTTTTCGGTAAGTGAAATTTCAAATGTGAGCAATGAACCAATCATTTCATCAAGCGAGAGAGTAGTGACATC
ATGTGCTCCTTCAATCGCGATGACTTTCATATCGAATCTCTTTGGTAAAGAGCGTAACACTTTCCTTACCAATCTTGCTTCTGAGATTCCTTCACCAAGATTAAATGCAT
CATTGGCTATTTCCAGTTCTCGAACATTATTCTTTGTGATAGTTTCTTCTTCCATCATTCTTAGTGATTCGAATTTAGAAGTTAACAACAGTACACGAGATGTTTTGACT
TTGGAAGTTCCTTCAAATGTTATCTCCATTTTTCTCCATGCTTCTTTCGCAGTAGCACACAAGCTGATTAGCCTTCAAATGGATACACCTGAACGCAAGAAAGTCCACCT
AATGAAAAAGTTGCTGAAAGCAAGATTCCTACCCCTAAACTATGAACAAACACTGTACAATCAGTACCAAAAATGCCGCCAAGGTGTCCGATCAGTAGCTGACTACATTG
AAGAATTCCACCGCTTGAGTGCAAGAACAAACCTAAGCGAAAATGAACAACATCAGGTTGCAAGATTTGTGGGAGGTCTCCGATTCGACATCAAGGACAAGGTCAAACTA
CAACCATTCCGTTTCCTGTCTGAAGCAATAGCCTTTGCAGAAACAGTGGAGGAGATGATTGCGATTCGATCCAAGAACCTAAATAGAAGACCAGCATGGGAAACAAGTTC
AACAAAAAGCAAAACAAACGACCAACCTTCATCCTCAACAAAAGGAAAAGGGAAGGAAGTTGATAACCAAGAAGTAGGCTTTGAAAGAAAGAAAGAACAAACGTTCAAAC
CTAGTGGTCAGAACAGCTACTCCCGCCCGTCATTAGGAAAATGCTTCCGATGTGGCCAAACTGGCCACCTCTCCAACAACTGCCCGCAAAGAAAAACCATTGCAATAGCC
GAAGAAGGAGGACAGACGAGTGAAGACAGTATAGAAGCAGATGAAGAAACTGAACTGGTTGAAGCAGATGATGGGGAAAGGGTCTATTGCGTTATCCAACGGGTATTAAT
TACGCCCAAGGAAGAAAAGAACCTGCAACGCCACTGTCTTTTTAAGACAAGATGCACCATAAACGGAAGAGTATGTGATGTAATCATAGACAGCGGCAGCAGTGAAAACT
TCGTAGCAAAGAAATTAGTAACAGTCTTGAATCTAAAGGCTGAAGCACATCCAAACCCTTACAAGATAGGTTGGATAAGAAAAGGAGGAGAAGCCACGGTCAGCGAAATC
TGTACAGTCCCTCTCTCCATTGGAAACGCCTACAAAGACCAAATTGTTTGTGATGTCATTGAGATGGATGTATGCCATCTCCTACTAGGAAGACCTTGGCAGTATGATAC
TCAATCATTACACAAAGGGAGAGAAAATACATATGAATTTCAATGGATGGGAAGAAAGGTAGTTTTACTCCCAATAACAAAAAAGATCAACGAAGGATTAAGAGGTGAGA
AACAGTTATTCATTACTGTTAGTGGGAAAAATATGCTTAAAGAAAGGGAACAAGACATCCTAGGACTGGTTGTTATTGAAAAAACAAAGGAAAAACAGGTCGAAGACATA
GAACCCAAATTACAACAGCTCCTTCATGAGTTCCCACATATAAAGGAAGAACCAAAGGGACTCCCCCACTTCGAGACATACAGCACCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAAACGCCACCCCTTTGCCTTTATTTTCTACTTTTTCGGTAAGTGAAATTTCAAATGTGAGCAATGAACCAATCATTTCATCAAGCGAGAGAGTAGTGACATC
ATGTGCTCCTTCAATCGCGATGACTTTCATATCGAATCTCTTTGGTAAAGAGCGTAACACTTTCCTTACCAATCTTGCTTCTGAGATTCCTTCACCAAGATTAAATGCAT
CATTGGCTATTTCCAGTTCTCGAACATTATTCTTTGTGATAGTTTCTTCTTCCATCATTCTTAGTGATTCGAATTTAGAAGTTAACAACAGTACACGAGATGTTTTGACT
TTGGAAGTTCCTTCAAATGTTATCTCCATTTTTCTCCATGCTTCTTTCGCAGTAGCACACAAGCTGATTAGCCTTCAAATGGATACACCTGAACGCAAGAAAGTCCACCT
AATGAAAAAGTTGCTGAAAGCAAGATTCCTACCCCTAAACTATGAACAAACACTGTACAATCAGTACCAAAAATGCCGCCAAGGTGTCCGATCAGTAGCTGACTACATTG
AAGAATTCCACCGCTTGAGTGCAAGAACAAACCTAAGCGAAAATGAACAACATCAGGTTGCAAGATTTGTGGGAGGTCTCCGATTCGACATCAAGGACAAGGTCAAACTA
CAACCATTCCGTTTCCTGTCTGAAGCAATAGCCTTTGCAGAAACAGTGGAGGAGATGATTGCGATTCGATCCAAGAACCTAAATAGAAGACCAGCATGGGAAACAAGTTC
AACAAAAAGCAAAACAAACGACCAACCTTCATCCTCAACAAAAGGAAAAGGGAAGGAAGTTGATAACCAAGAAGTAGGCTTTGAAAGAAAGAAAGAACAAACGTTCAAAC
CTAGTGGTCAGAACAGCTACTCCCGCCCGTCATTAGGAAAATGCTTCCGATGTGGCCAAACTGGCCACCTCTCCAACAACTGCCCGCAAAGAAAAACCATTGCAATAGCC
GAAGAAGGAGGACAGACGAGTGAAGACAGTATAGAAGCAGATGAAGAAACTGAACTGGTTGAAGCAGATGATGGGGAAAGGGTCTATTGCGTTATCCAACGGGTATTAAT
TACGCCCAAGGAAGAAAAGAACCTGCAACGCCACTGTCTTTTTAAGACAAGATGCACCATAAACGGAAGAGTATGTGATGTAATCATAGACAGCGGCAGCAGTGAAAACT
TCGTAGCAAAGAAATTAGTAACAGTCTTGAATCTAAAGGCTGAAGCACATCCAAACCCTTACAAGATAGGTTGGATAAGAAAAGGAGGAGAAGCCACGGTCAGCGAAATC
TGTACAGTCCCTCTCTCCATTGGAAACGCCTACAAAGACCAAATTGTTTGTGATGTCATTGAGATGGATGTATGCCATCTCCTACTAGGAAGACCTTGGCAGTATGATAC
TCAATCATTACACAAAGGGAGAGAAAATACATATGAATTTCAATGGATGGGAAGAAAGGTAGTTTTACTCCCAATAACAAAAAAGATCAACGAAGGATTAAGAGGTGAGA
AACAGTTATTCATTACTGTTAGTGGGAAAAATATGCTTAAAGAAAGGGAACAAGACATCCTAGGACTGGTTGTTATTGAAAAAACAAAGGAAAAACAGGTCGAAGACATA
GAACCCAAATTACAACAGCTCCTTCATGAGTTCCCACATATAAAGGAAGAACCAAAGGGACTCCCCCACTTCGAGACATACAGCACCACATAG
Protein sequenceShow/hide protein sequence
MDGNATPLPLFSTFSVSEISNVSNEPIISSSERVVTSCAPSIAMTFISNLFGKERNTFLTNLASEIPSPRLNASLAISSSRTLFFVIVSSSIILSDSNLEVNNSTRDVLT
LEVPSNVISIFLHASFAVAHKLISLQMDTPERKKVHLMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKDKVKL
QPFRFLSEAIAFAETVEEMIAIRSKNLNRRPAWETSSTKSKTNDQPSSSTKGKGKEVDNQEVGFERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIA
EEGGQTSEDSIEADEETELVEADDGERVYCVIQRVLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWIRKGGEATVSEI
CTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKQVEDI
EPKLQQLLHEFPHIKEEPKGLPHFETYSTT