; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g03410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g03410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:2479100..2480949
RNA-Seq ExpressionMoc08g03410
SyntenyMoc08g03410
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]2.0e-16153.25Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++  L K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   ID V  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R+++TMRA+ NQAP    +  +++K+PEPKPF G RD K LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+ KGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VI MPL K
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]2.2e-27789.78Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEID VNAR+DGLPIQDIAMRVETLESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVN+TMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRD KDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFI+GLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR MS FL KGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVETDCEKEEDEETPRM ALKFLSAIQKRVNGPK TSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
         VN EALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVI MPL K
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.5e-27087.99Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEID VNARVDGLPIQDIAMRVET ESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVN+TMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFL +GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRMGALKFLSAIQKRVNGPK TSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        AVNSEALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVI MPL K
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]2.6e-16154.16Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------
        MS TK   K+  DRL  IEE++L+L+EVPD+LR LE RV E SEK  ++D +  R+DGLPI ++  RV +LE +     + +P D               
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------

Query:  ------------------FKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                          F+ T+D L+  M+ +STR+ +TM+AV      Q + G NKL+ P+P+ F GNRD K+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ------------------FKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A M+L DDAKLWWR+KV DI++G CTI+SW+DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +   P+ C L KGPH+V+ 
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE
        CPHRA+LTALQ S+Q  NE  VET  +K+ED + PRMGALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHNFI++QEARRL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVLGMEFL+EHKVI MPL K
Subjt:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]2.6e-16154.16Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------
        MS TK   K+  DRL  IEE++L+L+EVPD+LR LE RV E SEK  ++D +  R+DGLPI ++  RV +LE +     + +P D               
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------

Query:  ------------------FKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                          F+ T+D L+  M+ +STR+ +TM+AV      Q + G NKL+ P+P+ F GNRD K+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ------------------FKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A M+L DDAKLWWR+KV DI++G CTI+SW+DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +   P+ C L KGPH+V+ 
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE
        CPHRA+LTALQ S+Q  NE  VET  +K+ED + PRMGALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHNFI++QEARRL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVLGMEFL+EHKVI MPL K
Subjt:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

TrEMBL top hitse value%identityAlignment
A0A5A7UIP7 Reverse transcriptase2.2e-16153.08Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   ID V  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R+++TMRA+ NQAP    +  +++K+PEPKPF G RD K LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+ KGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VI MPL K
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

A0A5D3BYE6 Reverse transcriptase9.8e-16253.25Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++  L K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   ID V  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R+++TMRA+ NQAP    +  +++K+PEPKPF G RD K LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+ KGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VI MPL K
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

A0A5D3DQ20 Retrotrans_gag domain-containing protein2.2e-16153.08Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   ID V  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R+++TMRA+ NQAP    +  +++K+PEPKPF G RD K LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+ KGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E + ++ E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VI MPL K
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

A0A6J1D906 Reverse transcriptase8.3e-27889.96Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEID VNAR+DGLPIQDIAMRVETLESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVN+TMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRD KDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR MS FL KGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVETDCEKEEDEETPRM ALKFLSAIQKRVNGPK TSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
         VN EALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVI MPL K
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

A0A6J1DK29 uncharacterized protein LOC1110218297.5e-27187.99Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEID VNARVDGLPIQDIAMRVET ESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVN+TMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNITMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFL +GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPMSCFLYKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRMGALKFLSAIQKRVNGPK TSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK
        AVNSEALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVI MPL K
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVK

SwissProt top hitse value%identityAlignment
Q54JB0 Protein DDI1 homolog1.6e-0433.01Show/hide
Query:  LMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDT---GKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHK
        +++++ +IN +P K+  VD+GA  + +SE+ A R +++   DT   G  K V +    I+G      LKLG    SV   +++  D D +LG++ L  H+
Subjt:  LMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDT---GKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHK

Query:  VIL
        VIL
Subjt:  VIL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAACTAAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACACAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACATAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACACAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTGGGATGACCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTC
CGACACACTGGAACAATTCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAA
ACCGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCGAAAA
AGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGA
CCCTATCCACAAAGTCAAAACGCTCAAAGGCCGATGTCATGCTTCTTGTACAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGC
ATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGA
GGGTCAATGGACCCAAAAGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCCGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTAGGAGTTTCTAAAAG
AGTGATGTTGAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTCA
TACTGATGCCTCTGGTCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAACTAAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACACAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACATAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACACAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTGGGATGACCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTC
CGACACACTGGAACAATTCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAA
ACCGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCGAAAA
AGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGA
CCCTATCCACAAAGTCAAAACGCTCAAAGGCCGATGTCATGCTTCTTGTACAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGC
ATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGA
GGGTCAATGGACCCAAAAGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCCGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTAGGAGTTTCTAAAAG
AGTGATGTTGAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTCA
TACTGATGCCTCTGGTCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGA
Protein sequenceShow/hide protein sequence
MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDTVNARVDGLPIQDIAMRVETLESKATRPEDFKVTIDTLRAEMTEISTRVNITMRAVGNQA
PNQANMGFNKLKVPEPKPFNGNRDTKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLREL
RHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRG
PYPQSQNAQRPMSCFLYKGPHRVAECPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMGALKFLSAIQKRVNGPKRTSEKGLMFVDATINCNPAKSTMVDSGATHN
FISEQEARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVILMPLVKFEERPQPGGTYLYGHSDGRTAGGDKRCPT