; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:7552827..7556066
RNA-Seq ExpressionMoc07g09860
SyntenyMoc07g09860
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]1.2e-18755.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

KAA0053339.1 reverse transcriptase [Cucumis melo var. makuwa]5.5e-18855.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

KAA0065409.1 reverse transcriptase [Cucumis melo var. makuwa]9.4e-18855.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]0.0e+0089.66Show/hide
Query:  TTEQVARRPTGR---DRRTTLYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN
        TT+Q+++    R        LYLREVPD LRLLEARVDEFSEKFGEIDAVNA +DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN
Subjt:  TTEQVARRPTGR---DRRTTLYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN

Query:  NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA
        NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA
Subjt:  NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA

Query:  TMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKL
        TMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFI+GLK WARTKL
Subjt:  TMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKL

Query:  YEQRVQDLATAMASAERLLDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTA
        YEQRVQDLATAMA+AERLLDY+SEPSHPKKNATN  GGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR +S FLCKGPHRVAECPHRAALTA
Subjt:  YEQRVQDLATAMASAERLLDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTA

Query:  LQASVQSCNETEVGT-----------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAV
        LQASVQSCNE EV T                        RVNGPKGTSEKGLMFVDA INCN AKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK V
Subjt:  LQASVQSCNETEVGT-----------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAV

Query:  NSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAI
        N EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVT SIKQPGGIRMISALQLKKGL+REEPTFMAI
Subjt:  NSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAI

Query:  PMVEQPVETRDVPPEIQVVMREYVDIMPDSLPKTLPPR
        PMVEQPVETRDVPPEIQVVM+EYVDIMPDSLPKTLPPR
Subjt:  PMVEQPVETRDVPPEIQVVMREYVDIMPDSLPKTLPPR

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]0.0e+0090.94Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFKVT
        LYLREVPDSLRLLEARVDEFSEKFGEIDAVNA VDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHS MMQLFNEMTEDFKVT
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFKVT

Query:  IDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQ
        IDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVTLATMHLTDDAKLWWRSKVNDIQ
Subjt:  IDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQ

Query:  NGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLD
        NGRCTIN+WDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WARTKLYEQRVQDLATAMASAERLLD
Subjt:  NGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLD

Query:  YSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNETEVGT-----
        YSSEPSHPKKNATN  GGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFLC+GPH+VAECPHRAALTALQASVQSCNE EVGT     
Subjt:  YSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNETEVGT-----

Query:  ------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT
                           RVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT
Subjt:  ------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT

Query:  WTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVETRDVPPEIQVVM
        WTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGL+REEPTFM      QPVETRDVPPEIQVVM
Subjt:  WTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVETRDVPPEIQVVM

Query:  REYVDIMPDSLPKTLPPR
        REYVDIMPDSLPKTLPPR
Subjt:  REYVDIMPDSLPKTLPPR

TrEMBL top hitse value%identityAlignment
A0A5A7UIP7 Reverse transcriptase2.7e-18855.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

A0A5A7VIW9 Reverse transcriptase4.5e-18855.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

A0A5D3C4R1 Reverse transcriptase5.9e-18855.33Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK
        LYL EVPDS+R LE+R++E SEK   IDAV   V+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+ EL++S   ++++ N M+EDF+
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKAT--RPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFK

Query:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND
         T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE KVTLATMHL++DAKLWWRS+  D
Subjt:  VTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVND

Query:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL
        IQ GRCTI+ WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA+TKLYEQRVQDL +A A+AERL
Subjt:  IQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERL

Query:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------
         D S++    +++ ++  GG++  +P +PK+ G D+R  G       N G S         + RP+SCF+CKGPH   ECP++ A  A QAS+       
Subjt:  LDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASV-------

Query:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG
        QS  E EV  T  V+ P+  +                  E+GLM+VD  IN  P KS MVDSGATHNFI+E EA RL L  EKD G+MKAVNS ALPI+G
Subjt:  QSCNETEVG-TIRVNGPKGTS------------------EKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVG

Query:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET
        + KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T  +P+VV   ++QP G++MISA+QLKKGLSR+EPTFMAIP+       
Subjt:  VSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVET

Query:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR
          VP EI  V+ +Y D+MPDSLPK+LPPR
Subjt:  RDVPPEIQVVMREYVDIMPDSLPKTLPPR

A0A6J1D906 Reverse transcriptase0.0e+0089.81Show/hide
Query:  TTEQVARRPTGR---DRRTTLYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN
        TT+Q+++    R        LYLREVPD LRLLEARVDEFSEKFGEIDAVNA +DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN
Subjt:  TTEQVARRPTGR---DRRTTLYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELN

Query:  NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA
        NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA
Subjt:  NSHSAMMQLFNEMTEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLA

Query:  TMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKL
        TMHLTDDAKLWWRSKVNDIQNGRCTIN+WDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLK WARTKL
Subjt:  TMHLTDDAKLWWRSKVNDIQNGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKL

Query:  YEQRVQDLATAMASAERLLDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTA
        YEQRVQDLATAMA+AERLLDY+SEPSHPKKNATN  GGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQR +S FLCKGPHRVAECPHRAALTA
Subjt:  YEQRVQDLATAMASAERLLDYSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTA

Query:  LQASVQSCNETEVGT-----------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAV
        LQASVQSCNE EV T                        RVNGPKGTSEKGLMFVDA INCN AKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK V
Subjt:  LQASVQSCNETEVGT-----------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAV

Query:  NSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAI
        N EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVT SIKQPGGIRMISALQLKKGL+REEPTFMAI
Subjt:  NSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAI

Query:  PMVEQPVETRDVPPEIQVVMREYVDIMPDSLPKTLPPR
        PMVEQPVETRDVPPEIQVVM+EYVDIMPDSLPKTLPPR
Subjt:  PMVEQPVETRDVPPEIQVVMREYVDIMPDSLPKTLPPR

A0A6J1DK29 uncharacterized protein LOC1110218290.0e+0090.94Show/hide
Query:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFKVT
        LYLREVPDSLRLLEARVDEFSEKFGEIDAVNA VDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHS MMQLFNEMTEDFKVT
Subjt:  LYLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFKVT

Query:  IDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQ
        IDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVTLATMHLTDDAKLWWRSKVNDIQ
Subjt:  IDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQ

Query:  NGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLD
        NGRCTIN+WDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WARTKLYEQRVQDLATAMASAERLLD
Subjt:  NGRCTINNWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLD

Query:  YSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNETEVGT-----
        YSSEPSHPKKNATN  GGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQRP SCFLC+GPH+VAECPHRAALTALQASVQSCNE EVGT     
Subjt:  YSSEPSHPKKNATNLIGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNETEVGT-----

Query:  ------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT
                           RVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT
Subjt:  ------------------IRVNGPKGTSEKGLMFVDAAINCNPAKSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGT

Query:  WTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVETRDVPPEIQVVM
        WTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGL+REEPTFM      QPVETRDVPPEIQVVM
Subjt:  WTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVTASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVETRDVPPEIQVVM

Query:  REYVDIMPDSLPKTLPPR
        REYVDIMPDSLPKTLPPR
Subjt:  REYVDIMPDSLPKTLPPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACGGCTCACTCTGTAAAGGTTACAGACGACTCGATCCAGGTTGTTCTCCTACGGTTGTTCCAAAACCTAGGATCGACGCATCCCGACTCGACCATTTGG
GAGTGCAAATTAATCAAAAGAAGCAAAAAGACGAAAAAAGCATCATGGGAGGCGCCAGGTGTTGTGGGCATGCTTGTCCAGGGTGTGGATTTTGGCCGAGCGGCT
TTGGAACTAGCTCCTCGACCCGAAAAAGTCCGGAACCATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTCTC
TACCTAAGAGAAGTCCCTGATTCCCTCCGTCTGCTGGAGGCGCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCTGTGTAGACGGG
TTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCAAAGCTACACGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAA
ATAGAAGTACGTATGGGAGAGTTAAACAACTCGCATTCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACTTCAAAGTGACCATCGACACCCTCCGAGCT
GAGATGACGGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAG
CCCAAACCATTCAATGGCAATAGAGACGCCAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACAGGGACAACGTCAGAAGAGATGAAA
GTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAACTGGGATGAT
CTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCCGGGACTACGTG
AAACAATTCTCTGCCGTAATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGGGCCCGAACCAAGCTGTAT
GAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCTCATT
GGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGACCCTATCCACAAAGT
CAAAACGCTCAAAGGCCGATATCATGCTTCTTGTGCAAAGGTCCCCATCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGCATCCGTTCAG
TCGTGCAATGAAACTGAAGTTGGAACGATTAGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTGCGATCAATTGTAACCCTGCA
AAGAGCATCATGGTGGATTCGGGTGCAACCCACAACTTCATATCAGAACAGGAAGCCCACCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCC
GTCAACTCCGAAGCTTTACCCATTGTGGGAGTTTCTAAAAGAGTGACGTTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGAC
TTCGACGTTGTACTGGGGATGGAATTCCTCATCGAACATAAAGTCATACCGATGCCTCTGGCCAAGTGTATGATTGTCACTAGTAATAGTCCCACTGTGGTCACA
GCTAGCATCAAACAACCTGGTGGCATAAGAATGATATCCGCGTTACAGTTGAAGAAAGGCCTCAGCCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAA
CAGCCGGTAGAGACAAGAGATGTCCCACCTGAAATTCAAGTTGTCATGAGAGAGTATGTTGACATAATGCCAGATAGTTTACCGAAGACCTTACCTCCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATACGGCTCACTCTGTAAAGGTTACAGACGACTCGATCCAGGTTGTTCTCCTACGGTTGTTCCAAAACCTAGGATCGACGCATCCCGACTCGACCATTTGG
GAGTGCAAATTAATCAAAAGAAGCAAAAAGACGAAAAAAGCATCATGGGAGGCGCCAGGTGTTGTGGGCATGCTTGTCCAGGGTGTGGATTTTGGCCGAGCGGCT
TTGGAACTAGCTCCTCGACCCGAAAAAGTCCGGAACCATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTCTC
TACCTAAGAGAAGTCCCTGATTCCCTCCGTCTGCTGGAGGCGCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCTGTGTAGACGGG
TTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCAAAGCTACACGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAA
ATAGAAGTACGTATGGGAGAGTTAAACAACTCGCATTCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACTTCAAAGTGACCATCGACACCCTCCGAGCT
GAGATGACGGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAG
CCCAAACCATTCAATGGCAATAGAGACGCCAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACAGGGACAACGTCAGAAGAGATGAAA
GTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAACTGGGATGAT
CTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCCGGGACTACGTG
AAACAATTCTCTGCCGTAATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGGGCCCGAACCAAGCTGTAT
GAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCTCATT
GGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGACCCTATCCACAAAGT
CAAAACGCTCAAAGGCCGATATCATGCTTCTTGTGCAAAGGTCCCCATCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGCATCCGTTCAG
TCGTGCAATGAAACTGAAGTTGGAACGATTAGGGTCAATGGACCCAAAGGAACGTCCGAAAAGGGGCTTATGTTCGTAGATGCTGCGATCAATTGTAACCCTGCA
AAGAGCATCATGGTGGATTCGGGTGCAACCCACAACTTCATATCAGAACAGGAAGCCCACCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCC
GTCAACTCCGAAGCTTTACCCATTGTGGGAGTTTCTAAAAGAGTGACGTTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGAC
TTCGACGTTGTACTGGGGATGGAATTCCTCATCGAACATAAAGTCATACCGATGCCTCTGGCCAAGTGTATGATTGTCACTAGTAATAGTCCCACTGTGGTCACA
GCTAGCATCAAACAACCTGGTGGCATAAGAATGATATCCGCGTTACAGTTGAAGAAAGGCCTCAGCCGGGAGGAACCTACCTTTATGGCCATTCCGATGGTCGAA
CAGCCGGTAGAGACAAGAGATGTCCCACCTGAAATTCAAGTTGTCATGAGAGAGTATGTTGACATAATGCCAGATAGTTTACCGAAGACCTTACCTCCTCGATGA
Protein sequenceShow/hide protein sequence
MDTAHSVKVTDDSIQVVLLRLFQNLGSTHPDSTIWECKLIKRSKKTKKASWEAPGVVGMLVQGVDFGRAALELAPRPEKVRNHVDDKTTEQVARRPTGRDRRTTL
YLREVPDSLRLLEARVDEFSEKFGEIDAVNACVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQLFNEMTEDFKVTIDTLRA
EMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINNWDD
LKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNLI
GGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNETEVGTIRVNGPKGTSEKGLMFVDAAINCNPA
KSIMVDSGATHNFISEQEAHRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNSPTVVT
ASIKQPGGIRMISALQLKKGLSREEPTFMAIPMVEQPVETRDVPPEIQVVMREYVDIMPDSLPKTLPPR