; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:39491563..39502470
RNA-Seq ExpressionLag0022862
SyntenyLag0022862
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.3e-4937.19Show/hide
Query:  TSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLA--EEKKTT-------------EPEELTGGVEEGTTSNEAEKLNPE
        +++ A +KN+E Q+GQL + +N+  +G  P   E +  E CKA+++   +E + A  +E K+T             E EE+     E T           
Subjt:  TSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLA--EEKKTT-------------EPEELTGGVEEGTTSNEAEKLNPE

Query:  PSIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG
        P I +P +   +  +K+K   +F KFLD F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++ ++ ETV L+  CSA +Q+ +P+KL DPGSFT+PC  G
Subjt:  PSIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG

Query:  D-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERR
        D                             +K T++ LQLAD+S+  P GI+E++L+KV +F  P DF VLD++E+  +P+ILGRPFLATGR +ID+++ 
Subjt:  D-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERR

Query:  ELIIRVQQEKEVLKAFEDPK
        EL +RV +E+ +   ++  K
Subjt:  ELIIRVQQEKEVLKAFEDPK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]8.9e-5137.22Show/hide
Query:  TSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEE---------------TKLAEEKKTTEPEELTGGVEEGTTSNEAEKLNPE
        +++ AAIKNIE Q+GQL + +N+  +G  P   E +  E CKA+++   +E                 + + K   E +E+     E T           
Subjt:  TSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEE---------------TKLAEEKKTTEPEELTGGVEEGTTSNEAEKLNPE

Query:  PSIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG
        P I +P +   +  +K+K   +F KFLD F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++ ++ ETV L+  CSA +Q+ +P+KL DPGSFT+PC  G
Subjt:  PSIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG

Query:  D-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERR
        D                             +K T++ LQLAD+S+  P GI+E++L+KV +F  P DF VLD++E+  +P+ILGRPFLATGR +ID+++ 
Subjt:  D-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERR

Query:  ELIIRVQQEKEVLKAFE
        EL +RV +E+ + K ++
Subjt:  ELIIRVQQEKEVLKAFE

XP_010668059.1 PREDICTED: uncharacterized protein LOC104885048 [Beta vulgaris subsp. vulgaris]4.0e-5139.37Show/hide
Query:  HLKMESMHFNSIWDLANQ------SYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLAEEKKTTEPEE
        H K         W+LA +      S  I+K   K   I+ + +N+E QLGQL + +NS ++G  P + E +  ++C  V++   +E  L+  K T +  E
Subjt:  HLKMESMHFNSIWDLANQ------SYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLAEEKKTTEPEE

Query:  LTGGVEEGTTSNEAEK---LNPEPSIPS--PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTC
        +    E+   + EAEK   L P P +    P I   +  K+ KS  +F KFL  F  LH+NIPF DAL Q+P Y KF+KE +++KKK K+ ET+ L+  C
Subjt:  LTGGVEEGTTSNEAEK---LNPEPSIPS--PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTC

Query:  SARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKE
        SA +Q+ +P KL DPGSF+IPC  GD                             IKST+V LQL D+S+  P GI+EN+LIKV +F +PVDF +LD+ E
Subjt:  SARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKE

Query:  NPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS
        +  IPIILGRPFLAT   IID++   L   + +EK     F  PKN S
Subjt:  NPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS

XP_010694411.1 PREDICTED: uncharacterized protein LOC104907215 [Beta vulgaris subsp. vulgaris]1.3e-4936.97Show/hide
Query:  ENKEFVPTNATFSEEDHMRNNKPQSKDQWIKAMHLKMESMHFNSIWDLANQSYE-IDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSS
        + ++ V  N+    + H +  + +++  W  A            I  LAN + E I+K   K    + + +N+E QLGQL + +NS ++G  P + E + 
Subjt:  ENKEFVPTNATFSEEDHMRNNKPQSKDQWIKAMHLKMESMHFNSIWDLANQSYE-IDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSS

Query:  LEYCKAVSVHYEEETKLAEEKKTTEPEELTGGVEEGTTSNEAEK---LNPEPSIPS--PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMP
         ++C AV++   +E  L+  K T +  E+    E+ T + EAEK   L P P +    P I   +  K+ K   +F+KFL  F  LH+NIPF DAL Q+P
Subjt:  LEYCKAVSVHYEEETKLAEEKKTTEPEELTGGVEEGTTSNEAEK---LNPEPSIPS--PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMP

Query:  HYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVS
         Y KF+KE ++KKKK ++ ET+ L+  CSA +Q+ +P KL DPGSF+IPC  GD                             IKST+V LQLAD+S+  
Subjt:  HYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVS

Query:  PYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS
        P GI+EN+LIKV +F +PVDF +LD+ E+  +PIILGRPFLAT   IID++   L   + +EK     F   K+ S
Subjt:  PYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS

XP_019108345.1 PREDICTED: uncharacterized protein LOC104908276 [Beta vulgaris subsp. vulgaris]1.7e-4937.93Show/hide
Query:  HLKMESMHFNSIWDLANQ------SYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLAEEKKTTEPEE
        H K         W+LA +      S  I+K   K   I+ + +N+E QLGQL + +NS ++G  P   E +  ++C AV++   +E  +   K T +  E
Subjt:  HLKMESMHFNSIWDLANQ------SYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETKLAEEKKTTEPEE

Query:  LTGGVEEGTTSNEAEKLNPEPSIPS-----PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTC
        +    E+   + EAEK      +P      P I   +  K+ K   +F+KFL  F  LH+NIPF DAL Q+P Y KF+KE ++KKKK ++ ET+ L+  C
Subjt:  LTGGVEEGTTSNEAEKLNPEPSIPS-----PTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTC

Query:  SARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKE
        SA +Q+ +P KL DPGSF+IPC  GD                             IKST+V LQLAD+S+  P GI+EN+LIKV +F +PVDF +LD+ E
Subjt:  SARVQQGVPEKLSDPGSFTIPCNFGD-----------------------------IKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKE

Query:  NPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS
        +  +PIILGRPFLAT   IID++   L   + +EK     F   K+ S
Subjt:  NPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTS

TrEMBL top hitse value%identityAlignment
A0A5N6LUB5 Retrotrans_gag domain-containing protein1.2e-4533.68Show/hide
Query:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE
        N  + +++    ++   +    +N  +   Q++   +A H K E+ H  +  D  +Q +E      +  S  +AI+ IE Q+GQ+  ++    KGK P  
Subjt:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE

Query:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD
         E +  E+CKAV++   + TK  +   T++P   EE+    E   T  ++    P  EP  +  PTI      K +  +  + KFLD F  LH+N+PF +
Subjt:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD

Query:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA
        AL QMP Y KF+K+ L  K+K ++L  V L   CSA +Q  +PEK+ DPGSFTIPC                             + G+ K T + +QLA
Subjt:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA

Query:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN
        D+SV  P GIVEN+L+K+G+F  PVDF +LD+ E+  +P+ILGRPFLAT R ++D+   +L +RV +E+ V +  +  ++T +
Subjt:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN

A0A5N6MBJ1 Reverse transcriptase1.2e-4533.68Show/hide
Query:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE
        N  + +++    ++   +    +N  +   Q++   +A H K E+ H  +  D  +Q +E      +  S  +AI+ IE Q+GQ+  ++    KGK P  
Subjt:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE

Query:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD
         E +  E+CKAV++   + TK  +   T++P   EE+    E   T  ++    P  EP  +  PTI      K +  +  + KFLD F  LH+N+PF +
Subjt:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD

Query:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA
        AL QMP Y KF+K+ L  K+K ++L  V L   CSA +Q  +PEK+ DPGSFTIPC                             + G+ K T + +QLA
Subjt:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA

Query:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN
        D+SV  P GIVEN+L+K+G+F  PVDF +LD+ E+  +P+ILGRPFLAT R ++D+   +L +RV +E+ V +  +  ++T +
Subjt:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN

A0A5N6N4K2 Reverse transcriptase3.2e-4633.94Show/hide
Query:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE
        N  + +++    ++   +    +N  +   Q++   +A H K E+ H  +  D  +Q +E      +  S  +AI+ IE Q+GQ+  ++    KGK P  
Subjt:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE

Query:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD
         E +  E+CKAV++   + TK  +   T++P   EE+    E   T  ++    P  EP  +  PTI      K +K +  + KFLD F  LH+N+PF +
Subjt:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD

Query:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA
        AL QMP Y KF+K+ L  K+K ++L  V L   CSA +Q  +PEK+ DPGSFTIPC                             + G+ K T + +QLA
Subjt:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA

Query:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN
        D+SV  P GIVEN+L+K+G+F  PVDF +LD+ E+  +P+ILGRPFLAT R ++D+   +L +RV +E+ V +  +  ++T +
Subjt:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN

A0A5N6N9T2 Reverse transcriptase3.8e-4736.89Show/hide
Query:  KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETK---LAEEKKTTEPEE
        +A H K E+ H  +  D  +Q +E      +  S  + I+ IE Q+GQ+  ++    KGK P   E +  E+CKAV++   + TK   LA   K T  EE
Subjt:  KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAVSVHYEEETK---LAEEKKTTEPEE

Query:  LTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSA
        +    E   T  ++    P  EP  +  PTI      K +K +  + KFLD F  LH+N+PF +AL QMP Y KF+K++L  K+K ++L  V L   CSA
Subjt:  LTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSA

Query:  RVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENP
         +Q  +PEK+ DPGSFTIPC                             + G+ K T + +QLAD+SV  P GIVEN+L+K+G+F  PVDF +LD+ E+ 
Subjt:  RVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENP

Query:  VIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN
         +P+ILGRPFLAT R ++D+   +L +RV +E+ V +  +  ++T +
Subjt:  VIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN

A0A5N6P787 Retrotrans_gag domain-containing protein1.2e-4533.68Show/hide
Query:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE
        N  + +++    ++   +    +N  +   Q++   +A H K E+ H  +  D  +Q +E      +  S  +AI+ IE Q+GQ+  ++    KGK P  
Subjt:  NPQENKEFVPTNATFSEEDHMRNNKPQSKDQWI---KAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVE

Query:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD
         E +  E+CKAV++   + TK  +   T++P   EE+    E   T  ++    P  EP  +  PTI      K +  +  + KFLD F  LH+N+PF +
Subjt:  QEKSSLEYCKAVSVHYEEETKLAEEKKTTEP---EELTGGVEEGTTSNEAEKLNP--EP-SIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSD

Query:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA
        AL QMP Y KF+K+ L  K+K ++L  V L   CSA +Q  +PEK+ DPGSFTIPC                             + G+ K T + +QLA
Subjt:  ALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGDIKSTSVRLQLA

Query:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN
        D+SV  P GIVEN+L+K+G+F  PVDF +LD+ E+  +P+ILGRPFLAT R ++D+   +L +RV +E+ V +  +  ++T +
Subjt:  DQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSN

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-0742.67Show/hide
Query:  PIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG
        PI+  P++ + F +  DASD ALG VL Q        + Y SRTL+  + NY+ I+KE+  +V+A   FR YLLG
Subjt:  PIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG

P0CT41 Transposon Tf2-12 polyprotein5.0e-0432.5Show/hide
Query:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLR--DKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLGT
        S P++   ++++   +  DASD A+G VL Q    DK++   YY+++ +  A+ NY++ DKEM  ++ +   +R YL  T
Subjt:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLR--DKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLGT

P20825 Retrovirus-related Pol polyprotein from transposon 2971.8e-0641.33Show/hide
Query:  PIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG
        PI+  P++ + F +  DAS+ ALG VL Q        I + SRTL+  + NY+ I+KE+  +V+A   FR YLLG
Subjt:  PIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-0837.66Show/hide
Query:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG
        S+ I+  P + +PF +  DAS++A+G VL Q      + I Y SR+L+  ++NY  I+KEM  ++++ D  R YL G
Subjt:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLG

Q9UR07 Transposon Tf2-11 polyprotein5.0e-0432.5Show/hide
Query:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLR--DKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLGT
        S P++   ++++   +  DASD A+G VL Q    DK++   YY+++ +  A+ NY++ DKEM  ++ +   +R YL  T
Subjt:  SAPIIVAPNWNQPFDIMCDASDYALGVVLGQLR--DKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLGT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCACGCAAAGGAGAGGAAATCGAGAGAGAGAGAGAGACGCCGGAGATGGAGATTCGTGTGGGTCTTGCTGCTTGAGCCGAACGTCAATGCGAGAGAGAGAGAAGA
CGTATGCCGCGGGGAGGAGAAACCAAGAGAGAGGGAGCTGCGGCTAGACACGCGAGAGAGGAAGAGATCTGCGGTTGGCGGTTTCAACGCGAGGAAGAATGAGAGAGAGA
TATGGGTCTTCGGGGGGCTTGGCTGCACAAGAAGAAGAAACTCAGAGAGGACAGGTGCAGACAGCAAGGTAAGCAAGTTTAAATCTGAACTGTCTGAGCTTAAGGCTCTC
ATTAATCAGTTGGCTGCAGAGAAGGAATCCTCGTCTGAATGGAATACATATGAAGGATTTCCAGCAATGCGAAGTCAACTCAAATTCCAAGAGGAGACTAGAGCTGGGAT
GCAACAATTGGGAGACCAAGTTTCTCAATTGGCTGATACTGTCAGAAAATTGGAGGCCAAAATTGATAAACTACCTATTCATCCTGAGATCTCGAGAAGAGAAAAAGTTA
GCCCTGCAACCTACGCCTCCCAAGGAAAAGGAAAAAGAAGTCAACATCCCGCTTCTGCACCTATCATTGTTGCGCCTAATTGGAATCAACCATTTGATATCATGTGTGAT
GCTAGTGACTATGCTTTAGGAGTTGTTTTAGGGCAACTTCGTGATAAATTCTTTAAAGCTATATATTATGCAAGTAGAACTCTAGATAGTGCTAAACAGAATTACACCAT
TATTGACAAAGAGATGTTTGTTGTCGTATTTGCATTTGATAAATTTAGGCCCTATTTGCTTGGAACAAAGGACGAGAATGTTGGGTTTTATGCCCTAAAACTCGTAGGTA
GTGAATGTAAACAAATTTGCGCCGACTCAATAAGCCTATCATTTTGGGGACAAAGCCGAGTGGGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCCTTCCCGAC
TTTGAGGAAGTAGACCAGTGTTCCCTTAAGTGGTGTCTATGGGTCTTAAACAAAGGGCTGGGGTTTCTGTTTAGTGGTTGGATCACAAACAGAGTCTGTAAGTCCGGGTC
TAACATAGGGATGCCGATTGTTGAAATCACGGATGAATTTGAGGCCGAAAACATGAAGATCTGTCGCGGATTACGCCCAGAATGTGCTCCAGACGCAATAGAACGAGACG
TTGTCGCGATTCCAAGAGGACTGCAAGCGGGCGACGAAACGAAGACGACCGTGGTTCAGATCTGGCACAAAATCGAGGAGAAAACCCATATCTACCGAAGAAATCGAATG
AGGGCAAAAGACTACGAACGGGCTACAAATGAGCAGCGAACGGACTGCAGTCGAGCGGTGAACGGACTACAAGTGGGCAAACGAGCAGAGCGATTGCAAACGAGCGACGA
CTGCGAGCTACAAACGAGCGACGAACGGACTAAGAAGAAAGTCGGAAAGGAGCGACCGACGGTGGAGAAGAAGAGGGTGTCGTGGAGAAGAAGGAGAAAAGAGAAAGTTG
ACTGTCTGGCGAATTTTGAACCACGTTTTATCAAAAGTGTTTCTGAAACATTTTTTGAGTTGTGGAGAGGGCGTAAAGGATACCCTATGGAAATGAGATGTGGATACTTT
TTCAATCCACAAGAGAATAAAGAATTTGTACCGACAAATGCTACATTCTCAGAAGAAGACCACATGAGAAATAATAAGCCACAAAGTAAGGACCAGTGGATTAAAGCCAT
GCACTTGAAAATGGAATCTATGCATTTCAATTCTATCTGGGATCTTGCAAATCAATCTTATGAGATTGATAAGGATTTGAGGAAATTTACATCAATATCTGCAGCAATCA
AGAATATAGAGACTCAACTGGGACAACTGGTAAGTGTAGTCAACTCTATGAATAAAGGTAAAGCCCCAGTTGAACAGGAGAAATCTTCTTTGGAGTACTGCAAGGCTGTA
TCTGTGCATTATGAGGAGGAGACTAAATTAGCTGAAGAGAAGAAAACTACTGAACCAGAGGAACTCACAGGAGGAGTTGAAGAAGGCACCACCTCAAACGAAGCTGAAAA
GCTTAATCCTGAGCCTTCTATCCCTTCTCCTACTATTTTAGTCCATAAATCAAAGAAAAAGAAAAAATCTAAGGTTAAGTTTGACAAATTTTTAGATGCTTTTATGGGTT
TGCATGTTAATATTCCTTTTTCAGATGCTCTGGAGCAGATGCCTCATTACAGAAAATTCATGAAGGAATGGCTCAACAAGAAGAAAAAGGAAAAGCAGTTGGAGACTGTA
TATCTTGCATCGACGTGCAGTGCTCGTGTCCAACAGGGAGTACCAGAGAAATTGTCTGACCCGGGGAGTTTTACTATTCCTTGTAATTTTGGAGACATAAAATCTACTTC
TGTTAGACTTCAACTGGCTGACCAATCTGTGGTTAGTCCATATGGGATTGTTGAGAATATTCTTATTAAAGTAGGTAGATTTTTCCTTCCTGTTGATTTCTTTGTGCTAG
ATATTAAAGAGAATCCTGTTATACCTATCATATTAGGGAGACCATTCCTTGCTACAGGAAGGGTTATAATTGATATTGAACGGAGGGAGCTAATCATAAGAGTCCAACAG
GAGAAGGAAGTTTTAAAAGCTTTTGAGGACCCCAAGAACACATCAAATACAATGATGGAAATGGTTGATTGGAGCTATAATGCAATATTAGAGTTAATCGGGTGCTCGGG
ACGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGAGGAAAAAAGTCAAATTTCAGTCAACAGCAGGCTAGCGTCGAGACGGTAGCTCTTGAGCGTCGAGACGCTCACA
TTCCATATCTGATTAGGCGCGTAAAGGTCAAAGCGTCGAGACGCTGCGACCTTAGCGTCCCGACGCTGTGTTATTTCGCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCACGCAAAGGAGAGGAAATCGAGAGAGAGAGAGAGACGCCGGAGATGGAGATTCGTGTGGGTCTTGCTGCTTGAGCCGAACGTCAATGCGAGAGAGAGAGAAGA
CGTATGCCGCGGGGAGGAGAAACCAAGAGAGAGGGAGCTGCGGCTAGACACGCGAGAGAGGAAGAGATCTGCGGTTGGCGGTTTCAACGCGAGGAAGAATGAGAGAGAGA
TATGGGTCTTCGGGGGGCTTGGCTGCACAAGAAGAAGAAACTCAGAGAGGACAGGTGCAGACAGCAAGGTAAGCAAGTTTAAATCTGAACTGTCTGAGCTTAAGGCTCTC
ATTAATCAGTTGGCTGCAGAGAAGGAATCCTCGTCTGAATGGAATACATATGAAGGATTTCCAGCAATGCGAAGTCAACTCAAATTCCAAGAGGAGACTAGAGCTGGGAT
GCAACAATTGGGAGACCAAGTTTCTCAATTGGCTGATACTGTCAGAAAATTGGAGGCCAAAATTGATAAACTACCTATTCATCCTGAGATCTCGAGAAGAGAAAAAGTTA
GCCCTGCAACCTACGCCTCCCAAGGAAAAGGAAAAAGAAGTCAACATCCCGCTTCTGCACCTATCATTGTTGCGCCTAATTGGAATCAACCATTTGATATCATGTGTGAT
GCTAGTGACTATGCTTTAGGAGTTGTTTTAGGGCAACTTCGTGATAAATTCTTTAAAGCTATATATTATGCAAGTAGAACTCTAGATAGTGCTAAACAGAATTACACCAT
TATTGACAAAGAGATGTTTGTTGTCGTATTTGCATTTGATAAATTTAGGCCCTATTTGCTTGGAACAAAGGACGAGAATGTTGGGTTTTATGCCCTAAAACTCGTAGGTA
GTGAATGTAAACAAATTTGCGCCGACTCAATAAGCCTATCATTTTGGGGACAAAGCCGAGTGGGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCCTTCCCGAC
TTTGAGGAAGTAGACCAGTGTTCCCTTAAGTGGTGTCTATGGGTCTTAAACAAAGGGCTGGGGTTTCTGTTTAGTGGTTGGATCACAAACAGAGTCTGTAAGTCCGGGTC
TAACATAGGGATGCCGATTGTTGAAATCACGGATGAATTTGAGGCCGAAAACATGAAGATCTGTCGCGGATTACGCCCAGAATGTGCTCCAGACGCAATAGAACGAGACG
TTGTCGCGATTCCAAGAGGACTGCAAGCGGGCGACGAAACGAAGACGACCGTGGTTCAGATCTGGCACAAAATCGAGGAGAAAACCCATATCTACCGAAGAAATCGAATG
AGGGCAAAAGACTACGAACGGGCTACAAATGAGCAGCGAACGGACTGCAGTCGAGCGGTGAACGGACTACAAGTGGGCAAACGAGCAGAGCGATTGCAAACGAGCGACGA
CTGCGAGCTACAAACGAGCGACGAACGGACTAAGAAGAAAGTCGGAAAGGAGCGACCGACGGTGGAGAAGAAGAGGGTGTCGTGGAGAAGAAGGAGAAAAGAGAAAGTTG
ACTGTCTGGCGAATTTTGAACCACGTTTTATCAAAAGTGTTTCTGAAACATTTTTTGAGTTGTGGAGAGGGCGTAAAGGATACCCTATGGAAATGAGATGTGGATACTTT
TTCAATCCACAAGAGAATAAAGAATTTGTACCGACAAATGCTACATTCTCAGAAGAAGACCACATGAGAAATAATAAGCCACAAAGTAAGGACCAGTGGATTAAAGCCAT
GCACTTGAAAATGGAATCTATGCATTTCAATTCTATCTGGGATCTTGCAAATCAATCTTATGAGATTGATAAGGATTTGAGGAAATTTACATCAATATCTGCAGCAATCA
AGAATATAGAGACTCAACTGGGACAACTGGTAAGTGTAGTCAACTCTATGAATAAAGGTAAAGCCCCAGTTGAACAGGAGAAATCTTCTTTGGAGTACTGCAAGGCTGTA
TCTGTGCATTATGAGGAGGAGACTAAATTAGCTGAAGAGAAGAAAACTACTGAACCAGAGGAACTCACAGGAGGAGTTGAAGAAGGCACCACCTCAAACGAAGCTGAAAA
GCTTAATCCTGAGCCTTCTATCCCTTCTCCTACTATTTTAGTCCATAAATCAAAGAAAAAGAAAAAATCTAAGGTTAAGTTTGACAAATTTTTAGATGCTTTTATGGGTT
TGCATGTTAATATTCCTTTTTCAGATGCTCTGGAGCAGATGCCTCATTACAGAAAATTCATGAAGGAATGGCTCAACAAGAAGAAAAAGGAAAAGCAGTTGGAGACTGTA
TATCTTGCATCGACGTGCAGTGCTCGTGTCCAACAGGGAGTACCAGAGAAATTGTCTGACCCGGGGAGTTTTACTATTCCTTGTAATTTTGGAGACATAAAATCTACTTC
TGTTAGACTTCAACTGGCTGACCAATCTGTGGTTAGTCCATATGGGATTGTTGAGAATATTCTTATTAAAGTAGGTAGATTTTTCCTTCCTGTTGATTTCTTTGTGCTAG
ATATTAAAGAGAATCCTGTTATACCTATCATATTAGGGAGACCATTCCTTGCTACAGGAAGGGTTATAATTGATATTGAACGGAGGGAGCTAATCATAAGAGTCCAACAG
GAGAAGGAAGTTTTAAAAGCTTTTGAGGACCCCAAGAACACATCAAATACAATGATGGAAATGGTTGATTGGAGCTATAATGCAATATTAGAGTTAATCGGGTGCTCGGG
ACGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGAGGAAAAAAGTCAAATTTCAGTCAACAGCAGGCTAGCGTCGAGACGGTAGCTCTTGAGCGTCGAGACGCTCACA
TTCCATATCTGATTAGGCGCGTAAAGGTCAAAGCGTCGAGACGCTGCGACCTTAGCGTCCCGACGCTGTGTTATTTCGCTTACTGA
Protein sequenceShow/hide protein sequence
MKHAKERKSRERERRRRWRFVWVLLLEPNVNAREREDVCRGEEKPRERELRLDTRERKRSAVGGFNARKNEREIWVFGGLGCTRRRNSERTGADSKVSKFKSELSELKAL
INQLAAEKESSSEWNTYEGFPAMRSQLKFQEETRAGMQQLGDQVSQLADTVRKLEAKIDKLPIHPEISRREKVSPATYASQGKGKRSQHPASAPIIVAPNWNQPFDIMCD
ASDYALGVVLGQLRDKFFKAIYYASRTLDSAKQNYTIIDKEMFVVVFAFDKFRPYLLGTKDENVGFYALKLVGSECKQICADSISLSFWGQSRVGSWEHNHTRWNSLLPD
FEEVDQCSLKWCLWVLNKGLGFLFSGWITNRVCKSGSNIGMPIVEITDEFEAENMKICRGLRPECAPDAIERDVVAIPRGLQAGDETKTTVVQIWHKIEEKTHIYRRNRM
RAKDYERATNEQRTDCSRAVNGLQVGKRAERLQTSDDCELQTSDERTKKKVGKERPTVEKKRVSWRRRRKEKVDCLANFEPRFIKSVSETFFELWRGRKGYPMEMRCGYF
FNPQENKEFVPTNATFSEEDHMRNNKPQSKDQWIKAMHLKMESMHFNSIWDLANQSYEIDKDLRKFTSISAAIKNIETQLGQLVSVVNSMNKGKAPVEQEKSSLEYCKAV
SVHYEEETKLAEEKKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTILVHKSKKKKKSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETV
YLASTCSARVQQGVPEKLSDPGSFTIPCNFGDIKSTSVRLQLADQSVVSPYGIVENILIKVGRFFLPVDFFVLDIKENPVIPIILGRPFLATGRVIIDIERRELIIRVQQ
EKEVLKAFEDPKNTSNTMMEMVDWSYNAILELIGCSGREKMQRNEKSKRGKKSNFSQQQASVETVALERRDAHIPYLIRRVKVKASRRCDLSVPTLCYFAY