; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:17484993..17485832
RNA-Seq ExpressionMoc04g24230
SyntenyMoc04g24230
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036830.1 reverse transcriptase [Cucumis melo var. makuwa]5.9e-11773.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQ AF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GSIR+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

KAA0059106.1 reverse transcriptase [Cucumis melo var. makuwa]1.5e-11773.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQAAF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GS+R+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE+DL  TKGNRLYVP +GNLRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-11773.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQAAF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GS+R+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]8.4e-14089.57Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        +LL KNQKWNWTPEC AAFE+LKKAMMEG VLGIADVTRPFEVETDASDF LGGVLL DGHPIAYES+KLND ERRYAASEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        AKFVVKTDNSSV HFF+QP LSSKQARWQEY AEFDFQFEHKPGRANQAADALSRKSE AALCMLAHLKASKLTGSIREAI+ENLQNDP AQAIIQLANE
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        G TRQF VENDL FTKGN LYVP SGNLRKLLLGECHDTMWA    WQRTYALLKKGYYWP+LRDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]1.2e-11771.94Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  W+W+ +CQ AFENLK  M  G VLG+ DVT+PFEVETDASDF LGGVL+ +GHPIAYESRKLND ERRY  SEKEMLAVV+ LR WRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        ++FVVKTDNS+  HFFDQP L++KQARWQE  AEFDF+FEHK G++NQAADALSRK EHAALCMLAH+ +SK+ GS+R+ IKE+L  DP+A+A+++LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK L+ ECHDT+WA    WQRTYAL+KKGY+WPN+RDD+MQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

TrEMBL top hitse value%identityAlignment
A0A5A7T0E2 Reverse transcriptase2.9e-11773.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQ AF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GSIR+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

A0A5A7UY33 Reverse transcriptase7.5e-11873.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQAAF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GS+R+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE+DL  TKGNRLYVP +GNLRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

A0A5D3C4R1 Reverse transcriptase3.7e-11773.02Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQ AF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GS+R+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

A0A5D3C9P8 Reverse transcriptase1.3e-11773.38Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        ELL K+  WNW PECQAAF+ LK+A+MEG +LGIADVT+PFEVETDASD+ LGGVLL +GHPIAYESRKLN  ERRY  SEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        + FVVKTDNS+  HFF QP L+SKQARWQE+ AEFDF+FEHK G +NQAADALSRK EHAA+C+LAHL+ S++ GS+R+ ++E LQ D  AQ ++ LA  
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        GKTRQFWVE DLL TKGNRLYVP +G LRK LL ECHDT+WA    WQRTYALLKKGY+WPN+RDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

A0A6J1DLQ6 uncharacterized protein LOC1110223204.1e-14089.57Show/hide
Query:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE
        +LL KNQKWNWTPEC AAFE+LKKAMMEG VLGIADVTRPFEVETDASDF LGGVLL DGHPIAYES+KLND ERRYAASEKEMLAVV+ LRAWRQYLL 
Subjt:  ELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLE

Query:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE
        AKFVVKTDNSSV HFF+QP LSSKQARWQEY AEFDFQFEHKPGRANQAADALSRKSE AALCMLAHLKASKLTGSIREAI+ENLQNDP AQAIIQLANE
Subjt:  AKFVVKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANE

Query:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
        G TRQF VENDL FTKGN LYVP SGNLRKLLLGECHDTMWA    WQRTYALLKKGYYWP+LRDDVMQYTKTCLICQ
Subjt:  GKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.8e-2625.68Show/hide
Query:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ
        LL K+ +W WTP    A EN+K+ ++   VL   D ++   +ETDASD  +G VL   HD    +P+ Y S K++  +  Y+ S+KEMLA++  L+ WR 
Subjt:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ

Query:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK
        YL      F + TD+ ++      +    + + ARWQ +  +F+F+  ++PG AN  ADALSR           SE  ++  +  +    +T   +  + 
Subjt:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK

Query:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
            ND     ++   ++       +++ LL    +++ +P    L + ++ + H+         +    ++ + + W  +R  + +Y + C  CQ
Subjt:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

P0CT35 Transposon Tf2-2 polyprotein3.8e-2625.68Show/hide
Query:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ
        LL K+ +W WTP    A EN+K+ ++   VL   D ++   +ETDASD  +G VL   HD    +P+ Y S K++  +  Y+ S+KEMLA++  L+ WR 
Subjt:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ

Query:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK
        YL      F + TD+ ++      +    + + ARWQ +  +F+F+  ++PG AN  ADALSR           SE  ++  +  +    +T   +  + 
Subjt:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK

Query:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
            ND     ++   ++       +++ LL    +++ +P    L + ++ + H+         +    ++ + + W  +R  + +Y + C  CQ
Subjt:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

P0CT36 Transposon Tf2-3 polyprotein3.8e-2625.68Show/hide
Query:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ
        LL K+ +W WTP    A EN+K+ ++   VL   D ++   +ETDASD  +G VL   HD    +P+ Y S K++  +  Y+ S+KEMLA++  L+ WR 
Subjt:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ

Query:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK
        YL      F + TD+ ++      +    + + ARWQ +  +F+F+  ++PG AN  ADALSR           SE  ++  +  +    +T   +  + 
Subjt:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK

Query:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
            ND     ++   ++       +++ LL    +++ +P    L + ++ + H+         +    ++ + + W  +R  + +Y + C  CQ
Subjt:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

P0CT41 Transposon Tf2-12 polyprotein3.8e-2625.68Show/hide
Query:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ
        LL K+ +W WTP    A EN+K+ ++   VL   D ++   +ETDASD  +G VL   HD    +P+ Y S K++  +  Y+ S+KEMLA++  L+ WR 
Subjt:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ

Query:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK
        YL      F + TD+ ++      +    + + ARWQ +  +F+F+  ++PG AN  ADALSR           SE  ++  +  +    +T   +  + 
Subjt:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK

Query:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
            ND     ++   ++       +++ LL    +++ +P    L + ++ + H+         +    ++ + + W  +R  + +Y + C  CQ
Subjt:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

Q9UR07 Transposon Tf2-11 polyprotein3.8e-2625.68Show/hide
Query:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ
        LL K+ +W WTP    A EN+K+ ++   VL   D ++   +ETDASD  +G VL   HD    +P+ Y S K++  +  Y+ S+KEMLA++  L+ WR 
Subjt:  LLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVL--LHDG---HPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQ

Query:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK
        YL      F + TD+ ++      +    + + ARWQ +  +F+F+  ++PG AN  ADALSR           SE  ++  +  +    +T   +  + 
Subjt:  YLLEA--KFVVKTDNSSVYHFF--DQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSR----------KSEHAALCMLAHLKASKLTGSIREAIK

Query:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ
            ND     ++   ++       +++ LL    +++ +P    L + ++ + H+         +    ++ + + W  +R  + +Y + C  CQ
Subjt:  ENLQNDPTAQAIIQLANEGKTRQFWVENDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTACTAATGAAGAATCAGAAGTGGAATTGGACCCCAGAATGTCAAGCCGCATTTGAGAATTTGAAAAAGGCGATGATGGAGGGGCTGGTGTTGGGGATT
GCAGACGTCACACGACCGTTCGAAGTCGAGACCGATGCGTCAGACTTCGTCCTGGGAGGAGTGCTTCTCCATGACGGTCACCCCATTGCGTATGAGAGTCGGAAG
TTGAATGATGTTGAAAGGAGGTATGCTGCCTCCGAGAAAGAGATGTTAGCAGTAGTCTACTACTTGAGGGCCTGGAGGCAATATCTCCTAGAGGCCAAATTCGTT
GTCAAGACCGACAACAGCTCAGTCTATCACTTCTTCGACCAACCGATGTTATCGTCAAAGCAAGCTAGGTGGCAAGAATACCGTGCCGAGTTTGATTTTCAGTTC
GAACATAAGCCAGGTCGAGCAAATCAGGCAGCAGATGCCCTTAGCAGAAAGAGCGAACATGCGGCCCTGTGCATGTTAGCTCACCTAAAAGCGAGCAAGCTAACT
GGATCCATTCGTGAAGCCATCAAAGAGAACTTACAGAATGACCCAACTGCCCAAGCGATAATTCAGTTGGCCAACGAAGGGAAGACTAGACAGTTTTGGGTGGAG
AATGACCTCCTTTTCACCAAAGGTAACCGTTTGTACGTTCCCTGTTCTGGGAACCTGAGGAAGCTCCTACTTGGAGAATGTCATGACACAATGTGGGCAAGCGAT
GTTAGATGGCAGAGAACTTACGCCCTGCTAAAGAAAGGCTACTACTGGCCGAATTTGAGAGACGACGTGATGCAGTATACCAAAACGTGTCTCATCTGCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATTACTAATGAAGAATCAGAAGTGGAATTGGACCCCAGAATGTCAAGCCGCATTTGAGAATTTGAAAAAGGCGATGATGGAGGGGCTGGTGTTGGGGATT
GCAGACGTCACACGACCGTTCGAAGTCGAGACCGATGCGTCAGACTTCGTCCTGGGAGGAGTGCTTCTCCATGACGGTCACCCCATTGCGTATGAGAGTCGGAAG
TTGAATGATGTTGAAAGGAGGTATGCTGCCTCCGAGAAAGAGATGTTAGCAGTAGTCTACTACTTGAGGGCCTGGAGGCAATATCTCCTAGAGGCCAAATTCGTT
GTCAAGACCGACAACAGCTCAGTCTATCACTTCTTCGACCAACCGATGTTATCGTCAAAGCAAGCTAGGTGGCAAGAATACCGTGCCGAGTTTGATTTTCAGTTC
GAACATAAGCCAGGTCGAGCAAATCAGGCAGCAGATGCCCTTAGCAGAAAGAGCGAACATGCGGCCCTGTGCATGTTAGCTCACCTAAAAGCGAGCAAGCTAACT
GGATCCATTCGTGAAGCCATCAAAGAGAACTTACAGAATGACCCAACTGCCCAAGCGATAATTCAGTTGGCCAACGAAGGGAAGACTAGACAGTTTTGGGTGGAG
AATGACCTCCTTTTCACCAAAGGTAACCGTTTGTACGTTCCCTGTTCTGGGAACCTGAGGAAGCTCCTACTTGGAGAATGTCATGACACAATGTGGGCAAGCGAT
GTTAGATGGCAGAGAACTTACGCCCTGCTAAAGAAAGGCTACTACTGGCCGAATTTGAGAGACGACGTGATGCAGTATACCAAAACGTGTCTCATCTGCCAATAG
Protein sequenceShow/hide protein sequence
MELLMKNQKWNWTPECQAAFENLKKAMMEGLVLGIADVTRPFEVETDASDFVLGGVLLHDGHPIAYESRKLNDVERRYAASEKEMLAVVYYLRAWRQYLLEAKFV
VKTDNSSVYHFFDQPMLSSKQARWQEYRAEFDFQFEHKPGRANQAADALSRKSEHAALCMLAHLKASKLTGSIREAIKENLQNDPTAQAIIQLANEGKTRQFWVE
NDLLFTKGNRLYVPCSGNLRKLLLGECHDTMWASDVRWQRTYALLKKGYYWPNLRDDVMQYTKTCLICQ