; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:12498985..12500834
RNA-Seq ExpressionMoc08g16280
SyntenyMoc08g16280
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035456.1 uncharacterized protein E6C27_scaffold285G00960 [Cucumis melo var. makuwa]6.7e-17155.36Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLES--KATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+EEQ+LYL EVPD +R LE+R+DE  EK   IDAV  R++G PIQ++  RV+ LE+  K  R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLES--KATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ N AP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IRDYVKQF+ +M+DIRDMSEKDKVF F+EGLKLWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A +A QAS+ S +   + ++E + ++ E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDF VVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

KAA0053339.1 reverse transcriptase [Cucumis melo var. makuwa]8.7e-17155.18Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+EEQ+LYL EVPD +R LE+R++E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +M+DIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]3.9e-17155.36Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+EEQ+LYL EVPD +R LE+R++E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +M+DIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]0.0e+0099.64Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFI+GLKLWART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
Subjt:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]2.3e-29694.09Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLKLWART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  S FLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTSEKGLMFVDA INCN AKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
         VN EALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
Subjt:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

TrEMBL top hitse value%identityAlignment
A0A5A7SZS9 Retrotrans_gag domain-containing protein3.2e-17155.36Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLES--KATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+EEQ+LYL EVPD +R LE+R+DE  EK   IDAV  R++G PIQ++  RV+ LE+  K  R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLES--KATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ N AP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IRDYVKQF+ +M+DIRDMSEKDKVF F+EGLKLWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A +A QAS+ S +   + ++E + ++ E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDF VVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

A0A5D3BYE6 Reverse transcriptase1.9e-17155.36Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++  L K+  DRLVE+EEQ+LYL EVPD +R LE+R++E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +M+DIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E +  + E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

A0A5D3DQ20 Retrotrans_gag domain-containing protein4.2e-17155.18Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVE+EEQ+LYL EVPD +R LE+R++E SEK   IDAV  R++G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +M+DIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWA

Query:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA
        +TKLYEQRVQDL +A AAAERL D +++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +S F+CKGPH   
Subjt:  RTKLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + + E + ++ E+ + PRM ALKFLS++QK+V       E+GLM+VD  IN    KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        L  EKD G+MK VN  ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAK
Subjt:  LTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

A0A6J1D906 Reverse transcriptase0.0e+0099.82Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
        VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
Subjt:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

A0A6J1DK29 uncharacterized protein LOC1110218291.1e-29694.09Show/hide
Query:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLKLWART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWART

Query:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  S FLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMAAAERLLDYNSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTSEKGLMFVDA INCN AKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
         VN EALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK
Subjt:  VVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTTCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGTGAGTTAAATAACTCGCATTCCGCCATGATGCAA
TTGTTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCTGTGGGAAATCA
GGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAG
AACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGAC
ATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGA
ACTCCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGATGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGAT
TGAAACTGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCG
AAAAAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCG
AGGACCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATGTCATACTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTC
AAGCATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGAGGGCGTTAAAATTCCTATCTGCCATTCAA
AAGAGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACTCTGCAAAGAGCACCATGGTGGATTCAGGTGCAACCCA
CAACTTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAAATGAAGGTCGTCAACTTCGAAGCTCTACCCATTGTAGGAGTTTCTA
AAAGAGTGATGTTGAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAA
GTCATACCGATGCCTCTGGCCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCAC
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTTCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGTGAGTTAAATAACTCGCATTCCGCCATGATGCAA
TTGTTTAACGAAATGACAGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCTGTGGGAAATCA
GGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAG
AACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGAC
ATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGA
ACTCCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGATGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGAT
TGAAACTGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCGCTGCCGAAAGATTGCTAGACTATAACAGTGAACCATCCCACCCG
AAAAAGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCG
AGGACCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATGTCATACTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTC
AAGCATCCGTTCAGTCGTGCAATGAACCTGAAGTCGAAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGAGGGCGTTAAAATTCCTATCTGCCATTCAA
AAGAGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACTCTGCAAAGAGCACCATGGTGGATTCAGGTGCAACCCA
CAACTTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAAATGAAGGTCGTCAACTTCGAAGCTCTACCCATTGTAGGAGTTTCTA
AAAGAGTGATGTTGAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAA
GTCATACCGATGCCTCTGGCCAAGTTTGAAGAAAGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCAC
CTGA
Protein sequenceShow/hide protein sequence
MSTTKQLSKSHVDRLVEIEEQLLYLREVPDFLRLLEARVDEFSEKFGEIDAVNARIDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQ
LFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
IQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMMDIRDMSEKDKVFVFIEGLKLWARTKLYEQRVQDLATAMAAAERLLDYNSEPSHP
KKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLMSYFLCKGPHRVAECPHRAALTALQASVQSCNEPEVETDCEKEEDEETPRMRALKFLSAIQ
KRVNGPKGTSEKGLMFVDATINCNSAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMKVVNFEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHK
VIPMPLAKFEERPQPGGTYLYGHSDGRTAGGDKRCPT