; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:14574546..14575532
RNA-Seq ExpressionMoc07g20220
SyntenyMoc07g20220
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032849.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

KAA0063412.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]3.3e-15988.89Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLSALQKKAEEVKEPLERGLMY EAW+NQ+AAKSTMVDSGAT+NFMTETEARRLNL WDKDPGKMKAVNS ALPIMGV KRVSVKL TWSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        VIVRMDDFDVVLG+KFLLEHKV+PMPLAKCLVVT SDP VVQTSIKQPSGVKMIS LQLKKG+A+D+P  MAIP+ EG  SEE VPREIQRVL  YADVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PDNLPK LPPR GIDHEIELL GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQKKKD SLRLCIDYRALNKL VRNK PLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQV
        DLFDQLHGAKYFSKLDLRSGYYQV
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQV

TrEMBL top hitse value%identityAlignment
A0A5A7T0E2 Reverse transcriptase4.2e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

A0A5D3B7E7 Reverse transcriptase4.2e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

A0A5D3BRZ6 Reverse transcriptase4.2e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

A0A5D3C4R1 Reverse transcriptase4.2e-14477.13Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLS+LQKK  E   P+ERGLMY + WINQ+  KSTMVDSGAT+NF+TE EA+RLNLRW+KD G+MKAVNS ALPI+G+ KR  ++L  WSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        V+V+MDDFDVVLGM+FLLEH+V+PMPLAKCLV+TG  P+VVQT ++QP G+KMIS +QLKKGL+RD+P  MAIP+     S E VP+EI RVL  Y DVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PD+LPK+LPPRR IDHEIEL+ GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQ+KKDGSLRLCIDYRALNKLTVRNKYPLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE
        DLFD+LHGAKYFSKLDLRSGYYQVRIAE
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQVRIAE

A0A6J1DLQ6 uncharacterized protein LOC1110223201.6e-15988.89Show/hide
Query:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF
        MGALKFLSALQKKAEEVKEPLERGLMY EAW+NQ+AAKSTMVDSGAT+NFMTETEARRLNL WDKDPGKMKAVNS ALPIMGV KRVSVKL TWSG VDF
Subjt:  MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDF

Query:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM
        VIVRMDDFDVVLG+KFLLEHKV+PMPLAKCLVVT SDP VVQTSIKQPSGVKMIS LQLKKG+A+D+P  MAIP+ EG  SEE VPREIQRVL  YADVM
Subjt:  VIVRMDDFDVVLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVM

Query:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT
        PDNLPK LPPR GIDHEIELL GAK PAKNAYRM P ELAELRKQL ELLNAGFIRPAKAPYGAPVLFQKKKD SLRLCIDYRALNKL VRNK PLPIIT
Subjt:  PDNLPKTLPPRRGIDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIIT

Query:  DLFDQLHGAKYFSKLDLRSGYYQV
        DLFDQLHGAKYFSKLDLRSGYYQV
Subjt:  DLFDQLHGAKYFSKLDLRSGYYQV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-1929.26Show/hide
Query:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE
        +V      P+ +   +       ++  K  L  +  V     E  +P   +   ++ A+   + LPK   P +G++ E+EL     + P +N Y + P +
Subjt:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE

Query:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI
        +  +  ++ + L +G IR +KA    PV+F  KK+G+LR+ +DY+ LNK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+
Subjt:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI

P0CT41 Transposon Tf2-12 polyprotein1.1e-1929.26Show/hide
Query:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE
        +V      P+ +   +       ++  K  L  +  V     E  +P   +   ++ A+   + LPK   P +G++ E+EL     + P +N Y + P +
Subjt:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE

Query:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI
        +  +  ++ + L +G IR +KA    PV+F  KK+G+LR+ +DY+ LNK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+
Subjt:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.1e-2241.46Show/hide
Query:  LPPRRG------IDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITD
        LPPR        + H+IE+  GA+ P    Y +T     E+ K + +LL+  FI P+K+P  +PV+   KKDG+ RLC+DYR LNK T+ + +PLP I +
Subjt:  LPPRRG------IDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITD

Query:  LFDQLHGAKYFSKLDLRSGYYQV
        L  ++  A+ F+ LDL SGY+Q+
Subjt:  LFDQLHGAKYFSKLDLRSGYYQV

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.1e-2241.46Show/hide
Query:  LPPRRG------IDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITD
        LPPR        + H+IE+  GA+ P    Y +T     E+ K + +LL+  FI P+K+P  +PV+   KKDG+ RLC+DYR LNK T+ + +PLP I +
Subjt:  LPPRRG------IDHEIELLSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITD

Query:  LFDQLHGAKYFSKLDLRSGYYQV
        L  ++  A+ F+ LDL SGY+Q+
Subjt:  LFDQLHGAKYFSKLDLRSGYYQV

Q9UR07 Transposon Tf2-11 polyprotein1.1e-1929.26Show/hide
Query:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE
        +V      P+ +   +       ++  K  L  +  V     E  +P   +   ++ A+   + LPK   P +G++ E+EL     + P +N Y + P +
Subjt:  VVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIELL-SGAKSPAKNAYRMTPSE

Query:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI
        +  +  ++ + L +G IR +KA    PV+F  KK+G+LR+ +DY+ LNK    N YPLP+I  L  ++ G+  F+KLDL+S Y+ +R+
Subjt:  LAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCACTCAAATTCTTATCGGCGCTTCAGAAGAAGGCCGAGGAAGTGAAGGAACCTCTGGAACGTGGTCTGATGTATGCGGAGGCGTGGATCAATCAGAGAGCTGC
AAAAAGCACTATGGTAGATTCTGGTGCCACCTACAATTTCATGACAGAAACTGAAGCACGTCGGCTGAACTTGCGATGGGACAAAGATCCAGGAAAGATGAAAGCGGTCA
ACTCGACCGCCCTCCCCATCATGGGAGTTACCAAGAGAGTCTCAGTAAAGCTAAGGACATGGAGTGGACAGGTCGACTTCGTGATAGTGCGGATGGATGACTTCGACGTA
GTATTGGGGATGAAATTTCTGCTGGAACACAAAGTCCTCCCCATGCCCCTCGCTAAGTGCCTGGTTGTAACGGGTTCCGATCCCACAGTTGTTCAGACAAGCATCAAACA
ACCAAGCGGAGTGAAGATGATATCAACACTCCAACTAAAGAAAGGTCTCGCTCGTGACAAGCCAATGCTCATGGCCATCCCTATTGTGGAAGGCGGAAAATCTGAAGAGC
CAGTTCCCAGAGAAATCCAACGAGTCTTGAATGTGTATGCCGATGTAATGCCAGACAACCTGCCCAAAACTCTACCTCCAAGACGTGGGATAGACCACGAGATCGAATTA
TTATCGGGGGCAAAATCGCCCGCCAAGAACGCCTATCGGATGACTCCTTCCGAGTTAGCCGAACTGAGGAAACAACTTGGTGAGTTACTGAATGCTGGATTCATCCGCCC
GGCTAAGGCTCCTTATGGGGCCCCAGTCCTTTTTCAGAAGAAGAAAGACGGAAGCCTCCGCCTATGCATTGACTATCGAGCATTGAACAAACTCACCGTCCGCAACAAGT
ACCCTTTGCCCATCATCACAGACCTTTTCGATCAACTTCATGGGGCAAAGTACTTCTCTAAGCTGGATCTGCGATCTGGGTACTATCAGGTGCGCATCGCAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCACTCAAATTCTTATCGGCGCTTCAGAAGAAGGCCGAGGAAGTGAAGGAACCTCTGGAACGTGGTCTGATGTATGCGGAGGCGTGGATCAATCAGAGAGCTGC
AAAAAGCACTATGGTAGATTCTGGTGCCACCTACAATTTCATGACAGAAACTGAAGCACGTCGGCTGAACTTGCGATGGGACAAAGATCCAGGAAAGATGAAAGCGGTCA
ACTCGACCGCCCTCCCCATCATGGGAGTTACCAAGAGAGTCTCAGTAAAGCTAAGGACATGGAGTGGACAGGTCGACTTCGTGATAGTGCGGATGGATGACTTCGACGTA
GTATTGGGGATGAAATTTCTGCTGGAACACAAAGTCCTCCCCATGCCCCTCGCTAAGTGCCTGGTTGTAACGGGTTCCGATCCCACAGTTGTTCAGACAAGCATCAAACA
ACCAAGCGGAGTGAAGATGATATCAACACTCCAACTAAAGAAAGGTCTCGCTCGTGACAAGCCAATGCTCATGGCCATCCCTATTGTGGAAGGCGGAAAATCTGAAGAGC
CAGTTCCCAGAGAAATCCAACGAGTCTTGAATGTGTATGCCGATGTAATGCCAGACAACCTGCCCAAAACTCTACCTCCAAGACGTGGGATAGACCACGAGATCGAATTA
TTATCGGGGGCAAAATCGCCCGCCAAGAACGCCTATCGGATGACTCCTTCCGAGTTAGCCGAACTGAGGAAACAACTTGGTGAGTTACTGAATGCTGGATTCATCCGCCC
GGCTAAGGCTCCTTATGGGGCCCCAGTCCTTTTTCAGAAGAAGAAAGACGGAAGCCTCCGCCTATGCATTGACTATCGAGCATTGAACAAACTCACCGTCCGCAACAAGT
ACCCTTTGCCCATCATCACAGACCTTTTCGATCAACTTCATGGGGCAAAGTACTTCTCTAAGCTGGATCTGCGATCTGGGTACTATCAGGTGCGCATCGCAGAATGA
Protein sequenceShow/hide protein sequence
MGALKFLSALQKKAEEVKEPLERGLMYAEAWINQRAAKSTMVDSGATYNFMTETEARRLNLRWDKDPGKMKAVNSTALPIMGVTKRVSVKLRTWSGQVDFVIVRMDDFDV
VLGMKFLLEHKVLPMPLAKCLVVTGSDPTVVQTSIKQPSGVKMISTLQLKKGLARDKPMLMAIPIVEGGKSEEPVPREIQRVLNVYADVMPDNLPKTLPPRRGIDHEIEL
LSGAKSPAKNAYRMTPSELAELRKQLGELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTVRNKYPLPIITDLFDQLHGAKYFSKLDLRSGYYQVRIAE