; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0196121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0196121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:19519140..19520192
RNA-Seq ExpressionCmc07g0196121
SyntenyCmc07g0196121
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]1.3e-11057.43Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR++SLL  L+ EI  F+ L  LYE D +F E W KC+    + DFHI +GFLFK D+LCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ H G L GH G+DKT  ++ +RYYWP LRRD    V+RC  CQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK  DA  IA LFFRE+VRLHGV KTI+SDRD KFLSHFW TLWR   TTLK  STAHPQTD QTEVTN+TLGN+IR + G +PKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  P+  +DL  LP     S+ AE M E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]1.3e-11057.43Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR++SLL  L+ EI  F+ L  LYE D +F E W KC+    + DFHI +GFLFK D+LCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ H G L GH G+DKT  ++ +RYYWP LRRD    V+RC  CQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK  DA  IA LFFRE+VRLHGV KTI+SDRD KFLSHFW TLWR   TTLK  STAHPQTD QTEVTN+TLGN+IR + G +PKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  P+  +DL  LP     S+ AE M E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

KAA0063034.1 serine/threonine-protein kinase TIO-like [Cucumis melo var. makuwa]4.7e-204100Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

XP_024641774.2 uncharacterized protein LOC112422671 [Medicago truncatula]1.1e-10756.29Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR++SLL  L+ E+  F+ L  LYE D +F E + KC       DFHI EG+LFK DQLCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ HSG L GH G+DKT  ++ +R+YWP LR+D    V++C TCQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKM+
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CK+  DA  IA LFFRE+VRLHGV  +I SDRD KFLSHFW TLW+   T+L   STAHPQTD QTEVTN+TLGN+IRC+ G KPKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  PR  +DL  LP A   S+ AEKM E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]1.3e-10856Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR+ SLL  L+ E+  F+ L  LYE D +F E W KC      +DFH+ EGFLFK ++LCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ H G L GH G+DKT  ++ +R+YWP LR+D    V++C TCQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKM+
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK  DA  IA LFFRE+V LHGV K+I SDRD KFLSHFW TLW+  DT+L   STAHPQTD QTEVTN+TLGN+IRC+ G KPKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  PR  +DL  LP A   S+ AE M E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

TrEMBL top hitse value%identityAlignment
A0A5A7V732 Serine/threonine-protein kinase TIO-like2.3e-204100Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

A0A5B7BER3 Uncharacterized protein2.0e-12059.89Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MH RWI+FLQRF FV+KH+ G++NKVADALSR+++LL ++S EI +F+ L  LY++D DF + W KC     + +FHI +G+LFK +QLCI  TSLRE +
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        L++ HSG L GH G+DKT   + +RYYWPQL+RD   FVQ+CP CQ AKG + N GLY+PLP      +   +   L  P+ +   DSV VVVDRFSKM 
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI CKK +DA ++ANLFFREIVRLHGV K+I SDRDVKFLSHFW+TLWRK DT+L+Y STAHPQTD QTEVTN+TLGNLIRC SG +PKQWD+ L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEK
        FA+N M NRST K PFE+VYTK P+  LDLA LP    +S+ AE   ++
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEK

A0A6D2HLB5 Reverse transcriptase6.5e-11157.43Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR++SLL  L+ EI  F+ L  LYE D +F E W KC+    + DFHI +GFLFK D+LCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ H G L GH G+DKT  ++ +RYYWP LRRD    V+RC  CQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK  DA  IA LFFRE+VRLHGV KTI+SDRD KFLSHFW TLWR   TTLK  STAHPQTD QTEVTN+TLGN+IR + G +PKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  P+  +DL  LP     S+ AE M E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

A0A6D2IKM3 Reverse transcriptase6.5e-11157.43Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW+SFLQ+F F+I+H+ G  NKVADALSR++SLL  L+ EI  F+ L  LYE D +F E W KC+    + DFHI +GFLFK D+LCI  +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        +++ H G L GH G+DKT  ++ +RYYWP LRRD    V+RC  CQ +KG S N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKMT
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK  DA  IA LFFRE+VRLHGV KTI+SDRD KFLSHFW TLWR   TTLK  STAHPQTD QTEVTN+TLGN+IR + G +PKQWDL L Q E
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+  + +TGK PF +VYT  P+  +DL  LP     S+ AE M E+I
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

A0A6N2LVR1 Uncharacterized protein6.7e-11657.43Show/hide
Query:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL
        MHARW++F+QRF+F +KH+ G+ NKVADALSRK SLLT L  E+  F+ +  LY  D DF  TW KC   +  E  H  +G+LF+ +QLCI  +SLRE +
Subjt:  MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREAL

Query:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT
        + E H G L GH G+DKT     +RYYWPQL+RD  N V+RCPTCQ +KG + N GLY PLP      +   +   L  P+ +   DSV VVVDRFSKM 
Subjt:  LKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMT

Query:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE
        HFI+CKK +DA+++ANLFF+E+VRLHGV K+I SDRD KFLSHFW+TLWR+ DTTL + ST+HPQTD QTEV N+TLGNLIRCLSG +PKQWDL LAQAE
Subjt:  HFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAE

Query:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI
        FA+N+M NRSTGK PF+VVY + P+  LDL  LP     ++ AE M +++
Subjt:  FAFNNMKNRSTGKCPFEVVYTKQPRLTLDLASLPTAMNTSLEAEKMIEKI

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.8e-3429.61Show/hide
Query:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-
        ARW  FLQ F+F I ++ G  N +ADALSR                 + +  +S+  +    + + Y  D        K  N +  ED  + E    K+ 
Subjt:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-

Query:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL
              DQ+ + + T L   ++K+ H    + H G +     I +R+ W  +R+    +VQ C TCQ  K  S N   Y PL    P++ P+  +  +  
Subjt:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL

Query:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK
            P+    ++++ VVVDRFSKM   + C K   A   A +F + ++   G  K I++D D  F S  WK    K +  +K+     PQTD QTE TN+
Subjt:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK

Query:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS
        T+  L+RC+  T P  W   ++  + ++NN  + +T   PFE+V+   P L+ L+L S
Subjt:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS

P0CT41 Transposon Tf2-12 polyprotein2.8e-3429.61Show/hide
Query:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-
        ARW  FLQ F+F I ++ G  N +ADALSR                 + +  +S+  +    + + Y  D        K  N +  ED  + E    K+ 
Subjt:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-

Query:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL
              DQ+ + + T L   ++K+ H    + H G +     I +R+ W  +R+    +VQ C TCQ  K  S N   Y PL    P++ P+  +  +  
Subjt:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL

Query:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK
            P+    ++++ VVVDRFSKM   + C K   A   A +F + ++   G  K I++D D  F S  WK    K +  +K+     PQTD QTE TN+
Subjt:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK

Query:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS
        T+  L+RC+  T P  W   ++  + ++NN  + +T   PFE+V+   P L+ L+L S
Subjt:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.4e-3930.37Show/hide
Query:  RWISFLQRFDFVIKHQCGKENKVADALSRKSSLL---TLLSMEIEAFK--------------HLPSL---------------YEKDVDFSETWLKCSNFI
        RW+  L  +DF +++  G +N VADA+SR    +   T   ++ E++K              H+  L               Y+K ++ SET+ K     
Subjt:  RWISFLQRFDFVIKHQCGKENKVADALSRKSSLL---TLLSMEIEAFK--------------HLPSL---------------YEKDVDFSETWLKCSNFI

Query:  KAEDFHIMEGFLFKEDQLCILHTSLREALLKEAHSGRLV-GHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNA-GLYSPLPTQLPFGK
           ++ + +  ++ +D+L ++    + A+++  H   L  GHFG   T   IS  YYWP+L+     +++ C  CQ  K       GL  PLP       
Subjt:  KAEDFHIMEGFLFKEDQLCILHTSLREALLKEAHSGRLV-GHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNA-GLYSPLPTQLPFGK

Query:  IYQLILCLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQ
           +      P   +  + ++VVVDRFSK  HFI+ +K  DA  + +L FR I   HG  +TI SDRDV+  +  ++ L ++L       S  HPQTD Q
Subjt:  IYQLILCLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQ

Query:  TEVTNKTLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEV
        +E T +TL  L+R    T  + W + L Q EF +N+   R+ GK PFE+
Subjt:  TEVTNKTLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEV

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-3930.37Show/hide
Query:  RWISFLQRFDFVIKHQCGKENKVADALSRKSSLL---TLLSMEIEAFK--------------HLPSL---------------YEKDVDFSETWLKCSNFI
        RW+  L  +DF +++  G +N VADA+SR    +   T   ++ E++K              H+  L               Y+K ++ SET+ K     
Subjt:  RWISFLQRFDFVIKHQCGKENKVADALSRKSSLL---TLLSMEIEAFK--------------HLPSL---------------YEKDVDFSETWLKCSNFI

Query:  KAEDFHIMEGFLFKEDQLCILHTSLREALLKEAHSGRLV-GHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNA-GLYSPLPTQLPFGK
           ++ + +  ++ +D+L ++    + A+++  H   L  GHFG   T   IS  YYWP+L+     +++ C  CQ  K       GL  PLP       
Subjt:  KAEDFHIMEGFLFKEDQLCILHTSLREALLKEAHSGRLV-GHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNA-GLYSPLPTQLPFGK

Query:  IYQLILCLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQ
           +      P   +  + ++VVVDRFSK  HFI+ +K  DA  + +L FR I   HG  +TI SDRDV+  +  ++ L ++L       S  HPQTD Q
Subjt:  IYQLILCLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQ

Query:  TEVTNKTLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEV
        +E T +TL  L+R  + T  + W + L Q EF +N+   R+ GK PFE+
Subjt:  TEVTNKTLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEV

Q9UR07 Transposon Tf2-11 polyprotein2.8e-3429.61Show/hide
Query:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-
        ARW  FLQ F+F I ++ G  N +ADALSR                 + +  +S+  +    + + Y  D        K  N +  ED  + E    K+ 
Subjt:  ARWISFLQRFDFVIKHQCGKENKVADALSR---------------KSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKE-

Query:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL
              DQ+ + + T L   ++K+ H    + H G +     I +R+ W  +R+    +VQ C TCQ  K  S N   Y PL    P++ P+  +  +  
Subjt:  ------DQLCILH-TSLREALLKEAHSGRLVGHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPL----PTQLPFGKIYQLIL

Query:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK
            P+    ++++ VVVDRFSKM   + C K   A   A +F + ++   G  K I++D D  F S  WK    K +  +K+     PQTD QTE TN+
Subjt:  CLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFREIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNK

Query:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS
        T+  L+RC+  T P  W   ++  + ++NN  + +T   PFE+V+   P L+ L+L S
Subjt:  TLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLT-LDLAS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCTCGATGGATCTCTTTTTTACAGAGATTTGACTTCGTCATTAAACATCAATGTGGTAAAGAGAATAAAGTGGCAGATGCCCTATCCAGAAAAAGCTCCTTACT
CACTCTCCTATCCATGGAGATCGAAGCATTCAAACACCTTCCTAGCCTATATGAAAAAGATGTGGATTTTTCTGAGACATGGCTTAAATGTAGCAACTTTATCAAAGCTG
AAGACTTTCACATAATGGAAGGTTTTTTATTCAAAGAAGATCAGTTGTGTATTCTACACACATCACTTCGAGAAGCTCTACTAAAAGAAGCTCATTCAGGACGGTTGGTT
GGACACTTCGGGCAAGATAAGACCTTTGAAACAATCTCTAAGAGATACTATTGGCCTCAGTTGAGAAGAGACTGCAACAACTTTGTCCAAAGATGCCCTACTTGTCAAAG
AGCCAAGGGTACAAGCACAAACGCAGGCCTATACTCACCACTACCTACCCAACTTCCATTTGGGAAGATTTATCAATTGATTTTGTGTTTGGACTACCCAAAACTCAAAG
ACAAACATGACTCAGTCATGGTTGTGGTCGATAGATTTAGCAAAATGACACACTTTATATCTTGCAAAAAGATAAACGACGCCATCTACATAGCAAATCTCTTCTTTAGA
GAAATAGTTCGATTACATGGAGTACTAAAGACTATTGTCTCTGATCGGGATGTAAAATTCCTAAGCCATTTCTGGAAAACACTATGGAGAAAGCTTGACACGACACTTAA
ATACAGGTCAACAGCACATCCTCAAACAGATGACCAGACAGAAGTGACAAACAAAACCTTAGGCAACTTGATACGCTGCCTTAGTGGAACAAAACCGAAGCAATGGGACT
TGGTTCTTGCTCAGGCAGAATTTGCCTTCAACAATATGAAGAATAGATCGACCGGCAAATGCCCATTTGAGGTTGTATATACAAAACAACCAAGGCTAACCCTTGACCTA
GCATCACTCCCCACAGCTATGAACACCAGTTTAGAAGCAGAAAAGATGATAGAAAAGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACGCTCGATGGATCTCTTTTTTACAGAGATTTGACTTCGTCATTAAACATCAATGTGGTAAAGAGAATAAAGTGGCAGATGCCCTATCCAGAAAAAGCTCCTTACT
CACTCTCCTATCCATGGAGATCGAAGCATTCAAACACCTTCCTAGCCTATATGAAAAAGATGTGGATTTTTCTGAGACATGGCTTAAATGTAGCAACTTTATCAAAGCTG
AAGACTTTCACATAATGGAAGGTTTTTTATTCAAAGAAGATCAGTTGTGTATTCTACACACATCACTTCGAGAAGCTCTACTAAAAGAAGCTCATTCAGGACGGTTGGTT
GGACACTTCGGGCAAGATAAGACCTTTGAAACAATCTCTAAGAGATACTATTGGCCTCAGTTGAGAAGAGACTGCAACAACTTTGTCCAAAGATGCCCTACTTGTCAAAG
AGCCAAGGGTACAAGCACAAACGCAGGCCTATACTCACCACTACCTACCCAACTTCCATTTGGGAAGATTTATCAATTGATTTTGTGTTTGGACTACCCAAAACTCAAAG
ACAAACATGACTCAGTCATGGTTGTGGTCGATAGATTTAGCAAAATGACACACTTTATATCTTGCAAAAAGATAAACGACGCCATCTACATAGCAAATCTCTTCTTTAGA
GAAATAGTTCGATTACATGGAGTACTAAAGACTATTGTCTCTGATCGGGATGTAAAATTCCTAAGCCATTTCTGGAAAACACTATGGAGAAAGCTTGACACGACACTTAA
ATACAGGTCAACAGCACATCCTCAAACAGATGACCAGACAGAAGTGACAAACAAAACCTTAGGCAACTTGATACGCTGCCTTAGTGGAACAAAACCGAAGCAATGGGACT
TGGTTCTTGCTCAGGCAGAATTTGCCTTCAACAATATGAAGAATAGATCGACCGGCAAATGCCCATTTGAGGTTGTATATACAAAACAACCAAGGCTAACCCTTGACCTA
GCATCACTCCCCACAGCTATGAACACCAGTTTAGAAGCAGAAAAGATGATAGAAAAGATCTAA
Protein sequenceShow/hide protein sequence
MHARWISFLQRFDFVIKHQCGKENKVADALSRKSSLLTLLSMEIEAFKHLPSLYEKDVDFSETWLKCSNFIKAEDFHIMEGFLFKEDQLCILHTSLREALLKEAHSGRLV
GHFGQDKTFETISKRYYWPQLRRDCNNFVQRCPTCQRAKGTSTNAGLYSPLPTQLPFGKIYQLILCLDYPKLKDKHDSVMVVVDRFSKMTHFISCKKINDAIYIANLFFR
EIVRLHGVLKTIVSDRDVKFLSHFWKTLWRKLDTTLKYRSTAHPQTDDQTEVTNKTLGNLIRCLSGTKPKQWDLVLAQAEFAFNNMKNRSTGKCPFEVVYTKQPRLTLDL
ASLPTAMNTSLEAEKMIEKI