; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:9267266..9268054
RNA-Seq ExpressionMoc01g14840
SyntenyMoc01g14840
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]4.9e-10567.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVE+DLL TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

KAA0059106.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-10568Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVEDDL  TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        GNLRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]4.9e-10567.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVE+DLL TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]4.0e-11576.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        MMEG +LGIADVT+PFEVE DASDF LGGVLLQDGHP+AY+++KLNDA++RYAASEKEMLAVVHCLRA RQYLLG+KFV             + +LSSKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQEYLAE DFQFEH+PGR+NQAAD LSRKSE AALCMLAHLKASKLT S+REAI+ +L++DP  + IIQL  +  TRQF VE+DL FTKGN LYVPR+
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSR
        GNLRKLL+GECHDTMWAGHAGW RTYALLKKGYYW +LRDDVMQYTKTCLICQQDKV+R KIA LLEP P+PSR
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSR

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]9.8e-10667.27Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        M  GP+LG+ DVTKPFEVE DASDF LGGVL+Q+GHP+AY++RKLNDA++RY  SEKEMLAVVHCLR  RQYLLGS+FV             + +L++KQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE LAE DF+FEH+ G+SNQAAD LSRK EHAALCMLAH+ +SK+  S+R+ IK HL  DP+ K +++L +  KTRQFWVE DLL TKGNRLYVPR 
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYAL+KKGY+W N+RDD+MQYTKTCLICQQDKV++ K++ LLEP PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

TrEMBL top hitse value%identityAlignment
A0A5A7UXR6 Reverse transcriptase2.4e-10567.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVE+DLL TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

A0A5A7UY33 Reverse transcriptase6.2e-10668Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVEDDL  TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        GNLRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

A0A5D3BRZ6 Reverse transcriptase2.4e-10567.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVE+DLL TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

A0A5D3C4R1 Reverse transcriptase2.4e-10567.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        +MEGP+LGIADVTKPFEVE DASD+ LGGVLLQ+GHP+AY++RKLN A++RY  SEKEMLAVVHCLRA RQYLLGS FV             + +L+SKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQE+LAE DF+FEH+ G SNQAAD LSRK EHAA+C+LAHL+ S++  SVR+ ++  L+ D   + ++ L +  KTRQFWVE+DLL TKGNRLYVPRA
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
        G LRK L+ ECHDT+WAGH GW RTYALLKKGY+W N+RDDVMQYTKTCLICQQDKV+++K+A LL+P PVP+RP
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

A0A6J1DLQ6 uncharacterized protein LOC1110223201.9e-11576.64Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ
        MMEG +LGIADVT+PFEVE DASDF LGGVLLQDGHP+AY+++KLNDA++RYAASEKEMLAVVHCLRA RQYLLG+KFV             + +LSSKQ
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFV-------------EIELSSKQ

Query:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA
        ARWQEYLAE DFQFEH+PGR+NQAAD LSRKSE AALCMLAHLKASKLT S+REAI+ +L++DP  + IIQL  +  TRQF VE+DL FTKGN LYVPR+
Subjt:  ARWQEYLAELDFQFEHRPGRSNQAADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRA

Query:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSR
        GNLRKLL+GECHDTMWAGHAGW RTYALLKKGYYW +LRDDVMQYTKTCLICQQDKV+R KIA LLEP P+PSR
Subjt:  GNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.9e-2024.83Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV
        ++  P+L   D +K   +E DASD  +G VL Q       +PV Y + K++ A+  Y+ S+KEMLA++  L+  R Y                L+G    
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV

Query:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF
        E E  +K+ ARWQ +L + +F+  +RPG +N  AD LSR           SE  ++  +  +    +T   +  +     +D  +  ++   +       
Subjt:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF

Query:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
         ++D LL    +++ +P    L + ++ + H+     H G      ++ + + W  +R  + +Y + C  CQ +K +  K    L+P P   RP
Subjt:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

P0CT35 Transposon Tf2-2 polyprotein2.9e-2024.83Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV
        ++  P+L   D +K   +E DASD  +G VL Q       +PV Y + K++ A+  Y+ S+KEMLA++  L+  R Y                L+G    
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV

Query:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF
        E E  +K+ ARWQ +L + +F+  +RPG +N  AD LSR           SE  ++  +  +    +T   +  +     +D  +  ++   +       
Subjt:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF

Query:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
         ++D LL    +++ +P    L + ++ + H+     H G      ++ + + W  +R  + +Y + C  CQ +K +  K    L+P P   RP
Subjt:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

P0CT36 Transposon Tf2-3 polyprotein2.9e-2024.83Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV
        ++  P+L   D +K   +E DASD  +G VL Q       +PV Y + K++ A+  Y+ S+KEMLA++  L+  R Y                L+G    
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV

Query:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF
        E E  +K+ ARWQ +L + +F+  +RPG +N  AD LSR           SE  ++  +  +    +T   +  +     +D  +  ++   +       
Subjt:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF

Query:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
         ++D LL    +++ +P    L + ++ + H+     H G      ++ + + W  +R  + +Y + C  CQ +K +  K    L+P P   RP
Subjt:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

P0CT41 Transposon Tf2-12 polyprotein2.9e-2024.83Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV
        ++  P+L   D +K   +E DASD  +G VL Q       +PV Y + K++ A+  Y+ S+KEMLA++  L+  R Y                L+G    
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV

Query:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF
        E E  +K+ ARWQ +L + +F+  +RPG +N  AD LSR           SE  ++  +  +    +T   +  +     +D  +  ++   +       
Subjt:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF

Query:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
         ++D LL    +++ +P    L + ++ + H+     H G      ++ + + W  +R  + +Y + C  CQ +K +  K    L+P P   RP
Subjt:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

Q9UR07 Transposon Tf2-11 polyprotein2.9e-2024.83Show/hide
Query:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV
        ++  P+L   D +K   +E DASD  +G VL Q       +PV Y + K++ A+  Y+ S+KEMLA++  L+  R Y                L+G    
Subjt:  MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDG-----HPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQY----------------LLGSKFV

Query:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF
        E E  +K+ ARWQ +L + +F+  +RPG +N  AD LSR           SE  ++  +  +    +T   +  +     +D  +  ++   +       
Subjt:  EIELSSKQ-ARWQEYLAELDFQFEHRPGRSNQAADVLSR----------KSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQF

Query:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP
         ++D LL    +++ +P    L + ++ + H+     H G      ++ + + W  +R  + +Y + C  CQ +K +  K    L+P P   RP
Subjt:  WVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGYYWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAGGACCAATGCTTGGTATCGCAGATGTCACCAAGCCGTTCGAGGTAGAAATTGATGCCTCAGACTTCACCCTAGGCGGTGTTCTCTTGCAAGATGGACATCC
AGTGGCGTACAAAAACCGAAAGTTGAACGACGCGAAAAAGAGGTATGCGGCCTCTGAAAAAGAGATGTTGGCAGTAGTCCACTGCCTAAGAGCCTTGAGACAATATCTCT
TAGGCTCTAAGTTCGTTGAAATCGAGTTATCCTCGAAACAAGCCCGATGGCAAGAGTACCTCGCCGAATTGGACTTCCAATTCGAACATCGGCCTGGTCGATCAAACCAA
GCAGCAGACGTCCTTAGTCGAAAGAGCGAACATGCGGCCCTGTGCATGTTAGCTCATCTGAAAGCAAGCAAGTTGACAAGGTCTGTTCGTGAAGCCATCAAGGCACACCT
AAAAGATGATCCAACAGTGAAAACAATCATTCAGCTAGTAGAAGATGAGAAGACCCGCCAGTTTTGGGTCGAAGACGACCTCCTTTTTACCAAGGGCAATCGTCTATATG
TTCCGCGAGCAGGAAACCTGAGGAAACTCCTGATGGGGGAATGCCATGACACCATGTGGGCTGGCCACGCTGGGTGGCTGAGAACATATGCCCTACTGAAGAAAGGGTAC
TACTGGCTGAACCTGCGAGATGACGTTATGCAATACACCAAGACGTGTCTCATCTGCCAACAAGACAAAGTCAAGAGAATTAAAATTGCGAGCTTACTGGAGCCATTCCC
AGTGCCATCAAGGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAGGACCAATGCTTGGTATCGCAGATGTCACCAAGCCGTTCGAGGTAGAAATTGATGCCTCAGACTTCACCCTAGGCGGTGTTCTCTTGCAAGATGGACATCC
AGTGGCGTACAAAAACCGAAAGTTGAACGACGCGAAAAAGAGGTATGCGGCCTCTGAAAAAGAGATGTTGGCAGTAGTCCACTGCCTAAGAGCCTTGAGACAATATCTCT
TAGGCTCTAAGTTCGTTGAAATCGAGTTATCCTCGAAACAAGCCCGATGGCAAGAGTACCTCGCCGAATTGGACTTCCAATTCGAACATCGGCCTGGTCGATCAAACCAA
GCAGCAGACGTCCTTAGTCGAAAGAGCGAACATGCGGCCCTGTGCATGTTAGCTCATCTGAAAGCAAGCAAGTTGACAAGGTCTGTTCGTGAAGCCATCAAGGCACACCT
AAAAGATGATCCAACAGTGAAAACAATCATTCAGCTAGTAGAAGATGAGAAGACCCGCCAGTTTTGGGTCGAAGACGACCTCCTTTTTACCAAGGGCAATCGTCTATATG
TTCCGCGAGCAGGAAACCTGAGGAAACTCCTGATGGGGGAATGCCATGACACCATGTGGGCTGGCCACGCTGGGTGGCTGAGAACATATGCCCTACTGAAGAAAGGGTAC
TACTGGCTGAACCTGCGAGATGACGTTATGCAATACACCAAGACGTGTCTCATCTGCCAACAAGACAAAGTCAAGAGAATTAAAATTGCGAGCTTACTGGAGCCATTCCC
AGTGCCATCAAGGCCTTGA
Protein sequenceShow/hide protein sequence
MMEGPMLGIADVTKPFEVEIDASDFTLGGVLLQDGHPVAYKNRKLNDAKKRYAASEKEMLAVVHCLRALRQYLLGSKFVEIELSSKQARWQEYLAELDFQFEHRPGRSNQ
AADVLSRKSEHAALCMLAHLKASKLTRSVREAIKAHLKDDPTVKTIIQLVEDEKTRQFWVEDDLLFTKGNRLYVPRAGNLRKLLMGECHDTMWAGHAGWLRTYALLKKGY
YWLNLRDDVMQYTKTCLICQQDKVKRIKIASLLEPFPVPSRP