; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g24110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g24110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:17994869..17996715
RNA-Seq ExpressionMoc07g24110
SyntenyMoc07g24110
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053339.1 reverse transcriptase [Cucumis melo var. makuwa]2.5e-17355.89Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E +  + E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-17356.06Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E +  + E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

TYK25585.1 uncharacterized protein E5676_scaffold352G007440 [Cucumis melo var. makuwa]2.5e-17355.89Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E + ++ E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.2e-30596.77Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSTTKQLSKSHVDRLVEI EQLLYLREVPD LRLLEARV+EFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSE+MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT

Query:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLT+DAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSA+M+DIRDMSEKDKVFVFI+GLK WART
Subjt:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMS FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL

Query:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TA QASVQSCNEPEVETDCEKEEDEETPRM ALKFL AIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
         VN EALPIVGVSKRVMLKLGTWT SVDFVVVRMDDFDVVLGMEFLIEHKVIPM LAK
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]4.3e-29593.73Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEI EQLLYLREVPDSLRLLEARV+EFSEKFGEIDAVNAR+DGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT

Query:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLT+DAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSA+MLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL

Query:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TA QASVQSCNEPEV TDCEKEEDEETPRMGALKFL AIQKRVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        AVNSEALPIVGVSKRV LKLGTWT S DFVVVRMDDFDVVLGMEFLIEHKVIPM LAK
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

TrEMBL top hitse value%identityAlignment
A0A5A7UIP7 Reverse transcriptase1.2e-17355.89Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E +  + E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

A0A5D3BYE6 Reverse transcriptase5.3e-17456.06Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E +  + E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

A0A5D3DQ20 Retrotrans_gag domain-containing protein1.2e-17355.89Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+ EQ+LYL EVPDS+R LE+R+ E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +E+ K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMK

Query:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL+ DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVA

Query:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  AFQAS+ S +   + + E + ++ E+ + PRMGALKFL ++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTAFQASVQSCN---EPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+  VDFVVV+MDDFDVVLGMEFL+EH+VIPM LAK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

A0A6J1D906 Reverse transcriptase4.5e-30696.95Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSTTKQLSKSHVDRLVEI EQLLYLREVPD LRLLEARV+EFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSE+MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT

Query:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLT+DAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSA+M+DIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMS FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL

Query:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TA QASVQSCNEPEVETDCEKEEDEETPRM ALKFL AIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
         VN EALPIVGVSKRVMLKLGTWT SVDFVVVRMDDFDVVLGMEFLIEHKVIPM LAK
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

A0A6J1DK29 uncharacterized protein LOC1110218292.1e-29593.73Show/hide
Query:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEI EQLLYLREVPDSLRLLEARV+EFSEKFGEIDAVNAR+DGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEI-EQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVT

Query:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLT+DAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSA+MLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTNDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAAL

Query:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TA QASVQSCNEPEV TDCEKEEDEETPRMGALKFL AIQKRVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK
        AVNSEALPIVGVSKRV LKLGTWT S DFVVVRMDDFDVVLGMEFLIEHKVIPM LAK
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKVIPMTLAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGCGCG
AGTTAATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTCGAGACCCTAGAAAGCAAAG
CTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGTGAGTTAAATAACTCGCATTCCGCAATGATGCAATTG
TTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGC
TCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAAC
AGTACTTCAAGGCTACGGGGACCACGTCAGAAAAGATGAAAGTGACTTTGGCCACCATGCATCTTACTAATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATT
CAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCTGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACT
CCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCATGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGA
AACCGTGGGCGAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCGAAA
AAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGGCCTCAAGGCCCGAATCCCGGTCCTTCCCGAGG
ACCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATGTCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCTTTCAAG
CATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATATGCCATTCAAAAG
AGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCAGGTGCAACCCACAA
CTTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTAGGAGTTTCTAAAA
GAGTGATGTTGAAATTGGGGACGTGGACAGACAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTC
ATACCGATGACTCTGGCCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTG
A
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGCGCG
AGTTAATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTCGAGACCCTAGAAAGCAAAG
CTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGTGAGTTAAATAACTCGCATTCCGCAATGATGCAATTG
TTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGC
TCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAAC
AGTACTTCAAGGCTACGGGGACCACGTCAGAAAAGATGAAAGTGACTTTGGCCACCATGCATCTTACTAATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATT
CAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCTGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACT
CCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCATGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGA
AACCGTGGGCGAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCGAAA
AAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGGCCTCAAGGCCCGAATCCCGGTCCTTCCCGAGG
ACCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATGTCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCTTTCAAG
CATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATATGCCATTCAAAAG
AGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCAGGTGCAACCCACAA
CTTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTAGGAGTTTCTAAAA
GAGTGATGTTGAAATTGGGGACGTGGACAGACAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTC
ATACCGATGACTCTGGCCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTG
A
Protein sequenceShow/hide protein sequence
MSTTKQLSKSHVDRLVEIEQLLYLREVPDSLRLLEARVNEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQL
FNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEKMKVTLATMHLTNDAKLWWRSKVNDI
QNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAMMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMAAAERLLDYNSEPSHPK
KNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSCFLCKGPHRVAECPHRAALTAFQASVQSCNEPEVETDCEKEEDEETPRMGALKFLYAIQK
RVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTDSVDFVVVRMDDFDVVLGMEFLIEHKV
IPMTLAKFEERPQPGGTYLYGHSDGRTAGGDKRCPT