; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g25890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g25890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:18616034..18617730
RNA-Seq ExpressionMoc08g25890
SyntenyMoc08g25890
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK03044.1 uncharacterized protein E5676_scaffold46G001390 [Cucumis melo var. makuwa]3.6e-16354.47Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RV+E SEK   IDAV  RV+GLPI+++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N M+EDFKVT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCT++ WD LK ELR QFFP+NVE +ARRKLR+LRHTG IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN-----------AQRPISCFLCKG
        + KLYEQRVQDL +A A+AERL D +S+    ++N ++    N+  +P +PK+ G DKR     PG  R PY  +              +RP+SCF+C+G
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN-----------AQRPISCFLCKG

Query:  PHRVAECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQE
        PH   ECP++     F+       + K  ++      ++G + T                     S+ GL++VD  IN   AKSTMVDSGATHNFI E E
Subjt:  PHRVAECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQE

Query:  ARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        ARRLKL  EKD+G+MKAVNS ALPIVG+ KR T+KLG W G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  ARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-16354.06Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+EEQ+LYL EVPDS+R LE+R+ E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI+ WD LK ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVA
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVA

Query:  ECPHRAALTPFK-------------------------HPFSRAMNLKLERIRVNGPKGTS-EKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLK
        ECP++ A   F+                         +P   A+       +  G   T  E+GLM+VD  IN  P KSTMVDSGATHNFI E EA+RL 
Subjt:  ECPHRAALTPFK-------------------------HPFSRAMNLKLERIRVNGPKGTS-EKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

TYK29292.1 uncharacterized protein E5676_scaffold1212G00600 [Cucumis melo var. makuwa]3.6e-16355.03Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RV+E SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
        GEL+N+   ++++ N M+EDF+VT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCT++ WD LK ELR QFFP+NVE +ARRKLR+LRHTG IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRP-------QGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRV
        R KLYEQRVQDL +A A+AERL D +S+    ++N ++    N+  +P +PK+ G DKR        Q       R P  +S N +RP+SCF+C+GPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRP-------QGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRV

Query:  AECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRL
         ECP++     F+       + K  ++      + G + T                     S+ GL++VD  IN   AKSTM+DSGATHNFI E EARRL
Subjt:  AECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        +L  EKD+GKMKAVNS ALPIVG+ KR T+KLG W G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]5.8e-27889.91Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKS+VDRLVEIEEQLLYLREVPD LRLLEARV+EFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLK ELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFI+GLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR +S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL

Query:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK
        T                        + P  RA+  L   + RVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFI EQEARRLKLTIEKDTGKMK
Subjt:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
         VN EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.6e-27288.83Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSATKQLSKS+VDRLVEIEEQLLYLREVPDSLRLLEARV+EFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLK ELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL

Query:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK
        T                        + P   A+  L   + RVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFI EQEA RLKLTIEKDTGKMK
Subjt:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        AVNSEALPIVGVSKRVTLKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMP
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

TrEMBL top hitse value%identityAlignment
A0A5D3BV48 Retrotrans_gag domain-containing protein1.8e-16354.47Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RV+E SEK   IDAV  RV+GLPI+++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N M+EDFKVT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCT++ WD LK ELR QFFP+NVE +ARRKLR+LRHTG IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN-----------AQRPISCFLCKG
        + KLYEQRVQDL +A A+AERL D +S+    ++N ++    N+  +P +PK+ G DKR     PG  R PY  +              +RP+SCF+C+G
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN-----------AQRPISCFLCKG

Query:  PHRVAECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQE
        PH   ECP++     F+       + K  ++      ++G + T                     S+ GL++VD  IN   AKSTMVDSGATHNFI E E
Subjt:  PHRVAECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQE

Query:  ARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        ARRLKL  EKD+G+MKAVNS ALPIVG+ KR T+KLG W G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  ARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

A0A5D3BYE6 Reverse transcriptase1.8e-16354.06Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+EEQ+LYL EVPDS+R LE+R+ E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI+ WD LK ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVA
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVA

Query:  ECPHRAALTPFK-------------------------HPFSRAMNLKLERIRVNGPKGTS-EKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLK
        ECP++ A   F+                         +P   A+       +  G   T  E+GLM+VD  IN  P KSTMVDSGATHNFI E EA+RL 
Subjt:  ECPHRAALTPFK-------------------------HPFSRAMNLKLERIRVNGPKGTS-EKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

A0A5D3E078 Retrotrans_gag domain-containing protein1.8e-16355.03Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RV+E SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
        GEL+N+   ++++ N M+EDF+VT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCT++ WD LK ELR QFFP+NVE +ARRKLR+LRHTG IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRP-------QGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRV
        R KLYEQRVQDL +A A+AERL D +S+    ++N ++    N+  +P +PK+ G DKR        Q       R P  +S N +RP+SCF+C+GPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRP-------QGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRV

Query:  AECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRL
         ECP++     F+       + K  ++      + G + T                     S+ GL++VD  IN   AKSTM+DSGATHNFI E EARRL
Subjt:  AECPHRAALTPFKHPFSRAMNLKLERIR-----VNGPKGT---------------------SEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        +L  EKD+GKMKAVNS ALPIVG+ KR T+KLG W G VDFVVV+MDDFDVVLGMEFL+EH+VIPMP
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

A0A6J1D906 Reverse transcriptase2.1e-27890.09Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKS+VDRLVEIEEQLLYLREVPD LRLLEARV+EFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLK ELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR +S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL

Query:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK
        T                        + P  RA+  L   + RVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFI EQEARRLKLTIEKDTGKMK
Subjt:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
         VN EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

A0A6J1DK29 uncharacterized protein LOC1110218297.9e-27388.83Show/hide
Query:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSATKQLSKS+VDRLVEIEEQLLYLREVPDSLRLLEARV+EFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLK ELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAAL

Query:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK
        T                        + P   A+  L   + RVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFI EQEA RLKLTIEKDTGKMK
Subjt:  TPF----------------------KHPFSRAMN-LKLERIRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP
        AVNSEALPIVGVSKRVTLKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMP
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAACTGAGCAAGTCGAACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCTGATTCCCTCCGTCTGCTG
GAGGCGCGAGTTAATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACC
CTAGAAAGCAAAGCTACACGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCAT
TCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACGGAAATAAGCACTAGAGTGAACCTAACC
ATGCGAGCCGTGGGAAATCAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCCAAAGAT
CTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCA
AAGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAACTGGGATGATCTGAAGACAGAATTGAGGGGTCAGTTCTTCCCCGAC
AATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTTCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGAC
ATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGGGCCCGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCC
TCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAG
AGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGACCCTATCCACAAAGTCAAAATGCTCAAAGGCCGATATCATGCTTCTTGTGC
AAAGGTCCCCATCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACGCCCTTCAAGCATCCATTCAGTCGTGCAATGAACCTGAAGTTGGAACGGATTAGGGTC
AATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATTAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCT
AAAAGAGTGACGTTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAA
CATAAAGTCATACCGATGCCTGACTGGGATGTTTTTGCTGAACGCACGCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAACTGAGCAAGTCGAACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCTGATTCCCTCCGTCTGCTG
GAGGCGCGAGTTAATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACC
CTAGAAAGCAAAGCTACACGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCAT
TCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACGGAAATAAGCACTAGAGTGAACCTAACC
ATGCGAGCCGTGGGAAATCAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCCAAAGAT
CTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCA
AAGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAACTGGGATGATCTGAAGACAGAATTGAGGGGTCAGTTCTTCCCCGAC
AATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTTCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGAC
ATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGGGCCCGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCC
TCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAG
AGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGACCCTATCCACAAAGTCAAAATGCTCAAAGGCCGATATCATGCTTCTTGTGC
AAAGGTCCCCATCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACGCCCTTCAAGCATCCATTCAGTCGTGCAATGAACCTGAAGTTGGAACGGATTAGGGTC
AATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATTAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCT
AAAAGAGTGACGTTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAA
CATAAAGTCATACCGATGCCTGACTGGGATGTTTTTGCTGAACGCACGCGCTAA
Protein sequenceShow/hide protein sequence
MSATKQLSKSNVDRLVEIEEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSH
SAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDA
KLWWRSKVNDIQNGRCTINNWDDLKTELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMA
SAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTPFKHPFSRAMNLKLERIRV
NGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFILEQEARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIE
HKVIPMPDWDVFAERTR