; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008417 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008417
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:20996756..20998353
RNA-Seq ExpressionLag0008417
SyntenyLag0008417
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterraneum]2.8e-9742.99Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFP-----------------------FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQP
        HKG+KC D++GR YIS+ V+F+E+ +P                       FP  +  ++ + P+ VSQQ  + PS         +  ++P+  P S+S P
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFP-----------------------FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQP

Query:  ISRTSDTTSPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKM
        +       S   ++ + S  +SS          P   P  + + +NH MITR K+G  KPK+FL    + EP   + AL    W KAMQ EY A + NK 
Subjt:  ISRTSDTTSPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKM

Query:  WELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPT
        W LVP+P + K +GCKW+F++K N D +++ YKARLV K F Q+A  D+ ETFSPV+KP+TIR++L+L +++ W I+QID+NNAFL+GVL E +YM QP+
Subjt:  WELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPT

Query:  GFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSY
        GF+      LVCKLHK+LYGLKQAPRAW+D+L   L  +GFV SR D SLL       C  +L+YM+DI+I+G++  ++  LI KLN +F+LK LG + Y
Subjt:  GFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSY

Query:  FLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI
        FL +EV + PSG L L+Q KYI DLL + HM ++K + +PM+
Subjt:  FLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI

PNX92906.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]6.3e-9746.14Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFPFPN---SSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL-----LSPLSNSQPISRTSDTTSPSHLSP
        HKG+KC D+ GR Y+S+ V+F+ET FP+ +   +SS S +N+ +  +        + +D       T+ PL      SPL+ +Q  S  S   +PSH   
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFPFPN---SSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL-----LSPLSNSQPISRTSDTTSPSHLSP

Query:  VPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGC
            T S H+T    + +P  QP H  S +NHPM+TR K+G  KPKV   +    EP + + AL    W KAMQ EY A + N  W LV +P + K +GC
Subjt:  VPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGC

Query:  KWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLH
        KW+F++K N D +++ YKARLVAK F Q+   D+ ETFSPV+KP TIR++L+L ++Y W ++QID+NNAFL+G+L E +YM QP+GF+V     LVCKLH
Subjt:  KWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLH

Query:  KALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLF
        K+LYGLKQAPRAW++RL   L  +GFV S+ D SLL       C  +L+Y++DI+I+G++  ++  LI KLN +FSLK LG + YFL IEV + PSG L 
Subjt:  KALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLF

Query:  LSQAKYIMDLLQRTHMAEAKAISTPMI
        L+Q+KYI DLL RT M   KAI +PM+
Subjt:  LSQAKYIMDLLQRTHMAEAKAISTPMI

PNX94499.1 retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense]8.2e-9746.12Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLS---PLSNSQPISRTSDTTSPSHLS-PVPSA
        HKG+KC D  GR Y+S+ V+F+E+ FP+ +    S TN             ++      ++L TNSP  S   P + SQ  S  + T   S +S P+ S+
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLS---PLSNSQPISRTSDTTSPSHLS-PVPSA

Query:  TISSH-ETCPTQEAEPDTQPTHVDSISN-HPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKW
          +SH    P+ E +P   PT   S  N HPM+TR K+G  KPK F+      EP + K AL   +W +AMQ EY A + N  W LVP+P + K +GCKW
Subjt:  TISSH-ETCPTQEAEPDTQPTHVDSISN-HPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKW

Query:  VFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKA
        +F+IK N D +++ YKARLVAK F Q+   D+ ETFSPV+KP+TIR++L+L ++Y W ++QID+NNAFL+GVL E +YM QP GF+      LVCKLHK+
Subjt:  VFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKA

Query:  LYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLS
        LYGLKQAPRAW++RL   L  LGFV S+ D SLL       C  +L+Y++DI+I+G++  ++  +I KLN +F+LK LG + YFL IEV + PSG L L+
Subjt:  LYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLS

Query:  QAKYIMDLLQRTHMAEAKAISTPMI
        Q+KYI DLL RT+M   K I +PM+
Subjt:  QAKYIMDLLQRTHMAEAKAISTPMI

PNY02741.1 retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense]1.3e-9746.85Show/hide
Query:  MHKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPS--YVNDHAFVSLPTN-----SPLLSPLSNSQPISRTS--DTTSPSHL
        +HKG+KC  S GR YIS+ V+F ET FPF + +S S T + S  S   S++P+  + N    VS+  +     SP+ S  +  QP+  TS   T +P  +
Subjt:  MHKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPS--YVNDHAFVSLPTN-----SPLLSPLSNSQPISRTS--DTTSPSHL

Query:  SPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIV
         P     I S     T    P ++P    ++++HPM+TR K+G  KPKVFL      EP N K+AL    W  AM+ EY A + N+ W LVP+P++ + +
Subjt:  SPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIV

Query:  GCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCK
        GCKWVF+IK N D S++ YKARLVAK + Q    DY ETFSPVVKP+TIR++L+L IS+ W I+QIDINNAFL+G+L E +YM QP GF +     LVC+
Subjt:  GCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCK

Query:  LHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGD
        LH+ALYGLKQAPRAW++RL   L   GF  S+ D SL       I   +LVY++DI+++G+S +++  LI+KLN +F+LK +G   YFL IEV + P+G+
Subjt:  LHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGD

Query:  LFLSQAKYIMDLLQRTHMAEAKAISTPMI
        + L+Q+KYI DLL R +MAEAK I+TPM+
Subjt:  LFLSQAKYIMDLLQRTHMAEAKAISTPMI

PNY10806.1 histone deacetylase [Trifolium pratense]9.4e-10148.39Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFP----FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL------LSPLSNS-----QPISRTSDTT
        HKGYKC    GR YIS+ V+F E  FP    FPN+S  + TN     S  +  +PS V+          SPL      +SP+ N+      PIS  S  +
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFP----FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL------LSPLSNS-----QPISRTSDTT

Query:  SPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPS
        +PS  SPV S +     + P ++ +P   PT     + HPM+TR K+G  KPKVFL      EP + K AL   +W KAM+ EY A + NK W LVP+P+
Subjt:  SPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPS

Query:  NHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSY
        + + +GCKWVF+IK N D SI+ YKARLVAK F Q    D+ ETFSPVVKP+TIR++L+L +S+ W I+QIDINNAFL+G+L E +YM QP GF  +G  
Subjt:  NHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSY

Query:  PLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSY
         LVCKLHKALYGLKQAPRAW+DRL   L S GF  S+ D SL       +   +LVY++DI+++G+S  ++  LIAKL+ +F+LK +G   YFL IEV Y
Subjt:  PLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSY

Query:  PPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI
         PSG++ L+Q+KYI DLL R +MA+AK I+TPM+
Subjt:  PPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI

TrEMBL top hitse value%identityAlignment
A0A2K3MQ67 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-9746.14Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFPFPN---SSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL-----LSPLSNSQPISRTSDTTSPSHLSP
        HKG+KC D+ GR Y+S+ V+F+ET FP+ +   +SS S +N+ +  +        + +D       T+ PL      SPL+ +Q  S  S   +PSH   
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFPFPN---SSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL-----LSPLSNSQPISRTSDTTSPSHLSP

Query:  VPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGC
            T S H+T    + +P  QP H  S +NHPM+TR K+G  KPKV   +    EP + + AL    W KAMQ EY A + N  W LV +P + K +GC
Subjt:  VPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGC

Query:  KWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLH
        KW+F++K N D +++ YKARLVAK F Q+   D+ ETFSPV+KP TIR++L+L ++Y W ++QID+NNAFL+G+L E +YM QP+GF+V     LVCKLH
Subjt:  KWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLH

Query:  KALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLF
        K+LYGLKQAPRAW++RL   L  +GFV S+ D SLL       C  +L+Y++DI+I+G++  ++  LI KLN +FSLK LG + YFL IEV + PSG L 
Subjt:  KALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLF

Query:  LSQAKYIMDLLQRTHMAEAKAISTPMI
        L+Q+KYI DLL RT M   KAI +PM+
Subjt:  LSQAKYIMDLLQRTHMAEAKAISTPMI

A0A2K3MUX8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)4.0e-9746.12Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLS---PLSNSQPISRTSDTTSPSHLS-PVPSA
        HKG+KC D  GR Y+S+ V+F+E+ FP+ +    S TN             ++      ++L TNSP  S   P + SQ  S  + T   S +S P+ S+
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLS---PLSNSQPISRTSDTTSPSHLS-PVPSA

Query:  TISSH-ETCPTQEAEPDTQPTHVDSISN-HPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKW
          +SH    P+ E +P   PT   S  N HPM+TR K+G  KPK F+      EP + K AL   +W +AMQ EY A + N  W LVP+P + K +GCKW
Subjt:  TISSH-ETCPTQEAEPDTQPTHVDSISN-HPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKW

Query:  VFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKA
        +F+IK N D +++ YKARLVAK F Q+   D+ ETFSPV+KP+TIR++L+L ++Y W ++QID+NNAFL+GVL E +YM QP GF+      LVCKLHK+
Subjt:  VFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKA

Query:  LYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLS
        LYGLKQAPRAW++RL   L  LGFV S+ D SLL       C  +L+Y++DI+I+G++  ++  +I KLN +F+LK LG + YFL IEV + PSG L L+
Subjt:  LYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLS

Query:  QAKYIMDLLQRTHMAEAKAISTPMI
        Q+KYI DLL RT+M   K I +PM+
Subjt:  QAKYIMDLLQRTHMAEAKAISTPMI

A0A2K3NI85 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)6.1e-9846.85Show/hide
Query:  MHKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPS--YVNDHAFVSLPTN-----SPLLSPLSNSQPISRTS--DTTSPSHL
        +HKG+KC  S GR YIS+ V+F ET FPF + +S S T + S  S   S++P+  + N    VS+  +     SP+ S  +  QP+  TS   T +P  +
Subjt:  MHKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPS--YVNDHAFVSLPTN-----SPLLSPLSNSQPISRTS--DTTSPSHL

Query:  SPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIV
         P     I S     T    P ++P    ++++HPM+TR K+G  KPKVFL      EP N K+AL    W  AM+ EY A + N+ W LVP+P++ + +
Subjt:  SPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIV

Query:  GCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCK
        GCKWVF+IK N D S++ YKARLVAK + Q    DY ETFSPVVKP+TIR++L+L IS+ W I+QIDINNAFL+G+L E +YM QP GF +     LVC+
Subjt:  GCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCK

Query:  LHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGD
        LH+ALYGLKQAPRAW++RL   L   GF  S+ D SL       I   +LVY++DI+++G+S +++  LI+KLN +F+LK +G   YFL IEV + P+G+
Subjt:  LHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGD

Query:  LFLSQAKYIMDLLQRTHMAEAKAISTPMI
        + L+Q+KYI DLL R +MAEAK I+TPM+
Subjt:  LFLSQAKYIMDLLQRTHMAEAKAISTPMI

A0A2K3P695 Histone deacetylase4.5e-10148.39Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFP----FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL------LSPLSNS-----QPISRTSDTT
        HKGYKC    GR YIS+ V+F E  FP    FPN+S  + TN     S  +  +PS V+          SPL      +SP+ N+      PIS  S  +
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFP----FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPL------LSPLSNS-----QPISRTSDTT

Query:  SPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPS
        +PS  SPV S +     + P ++ +P   PT     + HPM+TR K+G  KPKVFL      EP + K AL   +W KAM+ EY A + NK W LVP+P+
Subjt:  SPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPS

Query:  NHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSY
        + + +GCKWVF+IK N D SI+ YKARLVAK F Q    D+ ETFSPVVKP+TIR++L+L +S+ W I+QIDINNAFL+G+L E +YM QP GF  +G  
Subjt:  NHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSY

Query:  PLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSY
         LVCKLHKALYGLKQAPRAW+DRL   L S GF  S+ D SL       +   +LVY++DI+++G+S  ++  LIAKL+ +F+LK +G   YFL IEV Y
Subjt:  PLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSY

Query:  PPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI
         PSG++ L+Q+KYI DLL R +MA+AK I+TPM+
Subjt:  PPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI

A0A2Z6N0R7 Integrase catalytic domain-containing protein1.4e-9742.99Show/hide
Query:  HKGYKCPDSTGRTYISRHVLFYETCFP-----------------------FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQP
        HKG+KC D++GR YIS+ V+F+E+ +P                       FP  +  ++ + P+ VSQQ  + PS         +  ++P+  P S+S P
Subjt:  HKGYKCPDSTGRTYISRHVLFYETCFP-----------------------FPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQP

Query:  ISRTSDTTSPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKM
        +       S   ++ + S  +SS          P   P  + + +NH MITR K+G  KPK+FL    + EP   + AL    W KAMQ EY A + NK 
Subjt:  ISRTSDTTSPSHLSPVPSATISSHETCPTQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKM

Query:  WELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPT
        W LVP+P + K +GCKW+F++K N D +++ YKARLV K F Q+A  D+ ETFSPV+KP+TIR++L+L +++ W I+QID+NNAFL+GVL E +YM QP+
Subjt:  WELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPT

Query:  GFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSY
        GF+      LVCKLHK+LYGLKQAPRAW+D+L   L  +GFV SR D SLL       C  +L+YM+DI+I+G++  ++  LI KLN +F+LK LG + Y
Subjt:  GFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSY

Query:  FLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI
        FL +EV + PSG L L+Q KYI DLL + HM ++K + +PM+
Subjt:  FLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMI

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.2e-4634.85Show/hide
Query:  DSISNHPMITRSKSGIFKPKVFLTTYVDF-EPPNAKEALKC----SHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARL
        + +   P I+ ++      KV L  +  F + PN+ + ++     S W++A+  E +A   N  W +   P N  IV  +WVF +K N   +   YKARL
Subjt:  DSISNHPMITRSKSGIFKPKVFLTTYVDF-EPPNAKEALKC----SHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARL

Query:  VAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFL
        VA+ F Q   IDY ETF+PV +  + R +LSL I Y+  + Q+D+  AFL+G L E IYM  P G   +     VCKL+KA+YGLKQA R WF+     L
Subjt:  VAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFL

Query:  QSLGFVNSRADTSLLFQKSGNICCNI--LVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEA
        +   FVNS  D  +     GNI  NI  L+Y++D++I+   M  + +    L  +F + DL  + +F+ I +       ++LSQ+ Y+  +L + +M   
Subjt:  QSLGFVNSRADTSLLFQKSGNICCNI--LVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEA

Query:  KAISTPM
         A+STP+
Subjt:  KAISTPM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5735.83Show/hide
Query:  GYKCPDSTGRTYI-SRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQPISRTSDTTSPSHLSPVPSATISSH
        GY+  D   +  I SR V+F E+        S+ V N          I+P+      FV++P+ S        + P S  S T   S     P   I   
Subjt:  GYKCPDSTGRTYI-SRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQPISRTSDTTSPSHLSPVPSATISSH

Query:  ETCP--TQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEAL---KCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVF
        E      +E E  TQ              R +S  +    ++    D EP + KE L   + +   KAMQ E ++  +N  ++LV +P   + + CKWVF
Subjt:  ETCP--TQEAEPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEAL---KCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVF

Query:  KIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALY
        K+K++ D  +  YKARLV K F Q   ID+ E FSPVVK  +IR +LSL  S D  + Q+D+  AFLHG L E IYMEQP GF+V G   +VCKL+K+LY
Subjt:  KIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALY

Query:  GLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQK-SGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIE-VSYPPSGDLFLS
        GLKQAPR W+ + + F++S  ++ + +D  + F++ S N    +L+Y++D++I G    ++  L   L+  F +KDLGP    L ++ V    S  L+LS
Subjt:  GLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQK-SGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIE-VSYPPSGDLFLS

Query:  QAKYIMDLLQRTHMAEAKAISTPMIGH
        Q KYI  +L+R +M  AK +STP+ GH
Subjt:  QAKYIMDLLQRTHMAEAKAISTPMIGH

P92520 Uncharacterized mitochondrial protein AtMg008208.7e-2551.61Show/hide
Query:  MITRSKSGIFK--PK--VFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQS
        M+TRSK+GI K  PK  + +TT +  EP +   ALK   W +AMQ E DA  +NK W LVP P N  I+GCKWVFK K +SD ++   KARLVAK FHQ 
Subjt:  MITRSKSGIFK--PK--VFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQS

Query:  ADIDYFETFSPVVKPITIRVLLSL
          I + ET+SPVV+  TIR +L++
Subjt:  ADIDYFETFSPVVKPITIRVLLSL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-7738.59Show/hide
Query:  TGRTYISRHVLFYETCFPFPN------------SSSKSVTNAPSIVSQQLSIVPS-YVNDHAFVSLPTNSPLLSPLSNSQPISRTSDTT-----------
        T R YISRHV F E CFPF N              S  V +  + +  +  ++P+   +D    + P +SP  +P  NSQ  S   D++           
Subjt:  TGRTYISRHVLFYETCFPFPN------------SSSKSVTNAPSIVSQQLSIVPS-YVNDHAFVSLPTNSPLLSPLSNSQPISRTSDTT-----------

Query:  -----------------------------------SPSHL-------------SPVPSATISSHETCPTQEAEPDTQPTHVDSISN---------HPMIT
                                           SPS L             SP P+ + SS  T PT  +     P  +  I N         H M T
Subjt:  -----------------------------------SPSHL-------------SPVPSATISSHETCPTQEAEPDTQPTHVDSISN---------HPMIT

Query:  RSKSGIFKP----KVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNH-KIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSAD
        R+K+GI KP     + ++   + EP  A +ALK   W+ AM  E +AQI N  W+LVP P +H  IVGC+W+F  K NSD S++ YKARLVAK ++Q   
Subjt:  RSKSGIFKP----KVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNH-KIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSAD

Query:  IDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRA
        +DY ETFSPV+K  +IR++L + +   WPIRQ+D+NNAFL G L++ +YM QP GF        VCKL KALYGLKQAPRAW+  L  +L ++GFVNS +
Subjt:  IDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRA

Query:  DTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM
        DTSL   + G     +LVY++DI+I+GN   ++ + +  L+ RFS+KD   L YFL IE    P+G L LSQ +YI+DLL RT+M  AK ++TPM
Subjt:  DTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-7337.4Show/hide
Query:  TGRTYISRHVLFYETCFPFPNSS----------SKSVTNAPSIVSQQLSIV----PSYVNDHAFVS-LPTNSPLLSPLSNSQPIS-----------RTSD
        TGR Y SRHV F E CFPF  ++          S S  N PS  +   + +    P  +  H   S  P +SP  SPL  +Q  S            +S+
Subjt:  TGRTYISRHVLFYETCFPFPNSS----------SKSVTNAPSIVSQQLSIV----PSYVNDHAFVS-LPTNSPLLSPLSNSQPIS-----------RTSD

Query:  TTSPSHLSPVPSA----TISSHETCP-----------------------------------TQEAEPDTQPTHVDS---------------------ISN
         T+PSH  P P+A    T +S+   P                                   T  +EP++  +   S                     ++ 
Subjt:  TTSPSHLSPVPSA----TISSHETCP-----------------------------------TQEAEPDTQPTHVDS---------------------ISN

Query:  HPMITRSKSGIFKP----KVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELV-PVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVF
        H M TR+K GI KP        +   + EP  A +A+K   W++AM  E +AQI N  W+LV P P +  IVGC+W+F  K NSD S++ YKARLVAK +
Subjt:  HPMITRSKSGIFKP----KVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELV-PVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVF

Query:  HQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGF
        +Q   +DY ETFSPV+K  +IR++L + +   WPIRQ+D+NNAFL G L++ +YM QP GF        VC+L KA+YGLKQAPRAW+  L  +L ++GF
Subjt:  HQSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGF

Query:  VNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM
        VNS +DTSL   + G     +LVY++DI+I+GN   ++   +  L+ RFS+K+   L YFL IE    P G L LSQ +Y +DLL RT+M  AK ++TPM
Subjt:  VNSRADTSLLFQKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-5642.18Show/hide
Query:  EPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDI
        EP    EA +   W  AM  E  A      WE+  +P N K +GCKWV+KIK NSD +I  YKARLVAK + Q   ID+ ETFSPV K  +++++L++  
Subjt:  EPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQSADIDYFETFSPVVKPITIRVLLSLDI

Query:  SYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPL----VCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYM
         Y++ + Q+DI+NAFL+G L E IYM+ P G+       L    VC L K++YGLKQA R WF + ++ L   GFV S +D +   + +  +   +LVY+
Subjt:  SYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPL----VCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLFQKSGNICCNILVYM

Query:  NDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM
        +DIII  N+ A V  L ++L   F L+DLGPL YFL +E++   +G + + Q KY +DLL  T +   K  S PM
Subjt:  NDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM

ATMG00810.1 DNA/RNA polymerases superfamily protein4.6e-1345Show/hide
Query:  ILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM
        +L+Y++DI+++G+S  ++  LI +L+  FS+KDLGP+ YFL I++   PSG LFLSQ KY   +L    M + K +STP+
Subjt:  ILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-2651.61Show/hide
Query:  MITRSKSGIFK--PK--VFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQS
        M+TRSK+GI K  PK  + +TT +  EP +   ALK   W +AMQ E DA  +NK W LVP P N  I+GCKWVFK K +SD ++   KARLVAK FHQ 
Subjt:  MITRSKSGIFK--PK--VFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFHQS

Query:  ADIDYFETFSPVVKPITIRVLLSL
          I + ET+SPVV+  TIR +L++
Subjt:  ADIDYFETFSPVVKPITIRVLLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAGGGCTATAAATGTCCTGATTCTACTGGTCGTACATATATTTCTAGACATGTGCTTTTTTATGAGACTTGTTTTCCCTTTCCGAATTCTAGCTCTAAATCTGT
CACTAATGCTCCTTCAATTGTGTCTCAACAATTGTCTATTGTTCCCTCGTATGTTAATGATCATGCCTTTGTCTCTTTACCCACAAATAGTCCTTTACTTTCACCATTGT
CTAATTCTCAACCTATTTCTCGTACTTCAGATACTACTAGTCCTTCACATTTGTCTCCCGTTCCTTCGGCTACTATTTCCTCTCATGAAACTTGTCCAACTCAAGAAGCT
GAACCTGATACTCAACCTACTCATGTGGATTCTATTTCTAATCATCCAATGATTACTAGGAGTAAAAGTGGCATATTCAAACCAAAGGTATTTCTTACCACTTATGTTGA
TTTTGAACCACCCAATGCCAAGGAGGCTCTTAAATGTTCTCACTGGAAAAAGGCAATGCAGGTTGAATATGATGCACAAATACAGAATAAAATGTGGGAACTTGTTCCTG
TCCCTTCTAATCATAAAATAGTAGGTTGTAAGTGGGTGTTTAAAATTAAACGTAATTCAGATAGGTCTATTTCTCATTACAAGGCCAGATTAGTGGCAAAGGTGTTTCAT
CAATCAGCTGACATTGACTATTTTGAAACGTTTAGTCCTGTTGTTAAGCCCATAACGATTCGTGTGCTTCTTTCACTTGATATTTCTTATGATTGGCCAATTAGACAAAT
TGACATTAATAATGCCTTCTTGCATGGAGTGTTATCTGAAACTATTTATATGGAACAACCCACTGGTTTTCAAGTTCATGGTTCGTACCCACTTGTTTGCAAACTTCACA
AGGCTCTATACGGTCTAAAACAAGCTCCTCGTGCTTGGTTTGATAGACTTAACATGTTTCTTCAATCTCTTGGTTTTGTGAATTCTAGAGCTGATACTTCATTACTGTTT
CAAAAATCTGGAAATATATGTTGTAACATCCTTGTGTATATGAATGATATTATAATTTCGGGTAATTCAATGGCTGTTGTTACATCTCTAATTGCTAAGTTGAATGGTCG
GTTTTCTCTTAAAGATTTGGGTCCTCTTAGTTATTTTCTAGAAATTGAGGTGTCCTACCCACCATCAGGTGACCTATTTCTCTCCCAAGCAAAATATATCATGGATCTTT
TGCAGCGTACTCATATGGCAGAGGCTAAAGCTATATCAACTCCTATGATTGGGCATCAGACCCGAATGATCGAAAGTCAACGTCAGGTTTTTGTGTTTTCTTTGGATGCA
ATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAAGGGCTATAAATGTCCTGATTCTACTGGTCGTACATATATTTCTAGACATGTGCTTTTTTATGAGACTTGTTTTCCCTTTCCGAATTCTAGCTCTAAATCTGT
CACTAATGCTCCTTCAATTGTGTCTCAACAATTGTCTATTGTTCCCTCGTATGTTAATGATCATGCCTTTGTCTCTTTACCCACAAATAGTCCTTTACTTTCACCATTGT
CTAATTCTCAACCTATTTCTCGTACTTCAGATACTACTAGTCCTTCACATTTGTCTCCCGTTCCTTCGGCTACTATTTCCTCTCATGAAACTTGTCCAACTCAAGAAGCT
GAACCTGATACTCAACCTACTCATGTGGATTCTATTTCTAATCATCCAATGATTACTAGGAGTAAAAGTGGCATATTCAAACCAAAGGTATTTCTTACCACTTATGTTGA
TTTTGAACCACCCAATGCCAAGGAGGCTCTTAAATGTTCTCACTGGAAAAAGGCAATGCAGGTTGAATATGATGCACAAATACAGAATAAAATGTGGGAACTTGTTCCTG
TCCCTTCTAATCATAAAATAGTAGGTTGTAAGTGGGTGTTTAAAATTAAACGTAATTCAGATAGGTCTATTTCTCATTACAAGGCCAGATTAGTGGCAAAGGTGTTTCAT
CAATCAGCTGACATTGACTATTTTGAAACGTTTAGTCCTGTTGTTAAGCCCATAACGATTCGTGTGCTTCTTTCACTTGATATTTCTTATGATTGGCCAATTAGACAAAT
TGACATTAATAATGCCTTCTTGCATGGAGTGTTATCTGAAACTATTTATATGGAACAACCCACTGGTTTTCAAGTTCATGGTTCGTACCCACTTGTTTGCAAACTTCACA
AGGCTCTATACGGTCTAAAACAAGCTCCTCGTGCTTGGTTTGATAGACTTAACATGTTTCTTCAATCTCTTGGTTTTGTGAATTCTAGAGCTGATACTTCATTACTGTTT
CAAAAATCTGGAAATATATGTTGTAACATCCTTGTGTATATGAATGATATTATAATTTCGGGTAATTCAATGGCTGTTGTTACATCTCTAATTGCTAAGTTGAATGGTCG
GTTTTCTCTTAAAGATTTGGGTCCTCTTAGTTATTTTCTAGAAATTGAGGTGTCCTACCCACCATCAGGTGACCTATTTCTCTCCCAAGCAAAATATATCATGGATCTTT
TGCAGCGTACTCATATGGCAGAGGCTAAAGCTATATCAACTCCTATGATTGGGCATCAGACCCGAATGATCGAAAGTCAACGTCAGGTTTTTGTGTTTTCTTTGGATGCA
ATTTGA
Protein sequenceShow/hide protein sequence
MHKGYKCPDSTGRTYISRHVLFYETCFPFPNSSSKSVTNAPSIVSQQLSIVPSYVNDHAFVSLPTNSPLLSPLSNSQPISRTSDTTSPSHLSPVPSATISSHETCPTQEA
EPDTQPTHVDSISNHPMITRSKSGIFKPKVFLTTYVDFEPPNAKEALKCSHWKKAMQVEYDAQIQNKMWELVPVPSNHKIVGCKWVFKIKRNSDRSISHYKARLVAKVFH
QSADIDYFETFSPVVKPITIRVLLSLDISYDWPIRQIDINNAFLHGVLSETIYMEQPTGFQVHGSYPLVCKLHKALYGLKQAPRAWFDRLNMFLQSLGFVNSRADTSLLF
QKSGNICCNILVYMNDIIISGNSMAVVTSLIAKLNGRFSLKDLGPLSYFLEIEVSYPPSGDLFLSQAKYIMDLLQRTHMAEAKAISTPMIGHQTRMIESQRQVFVFSLDA
I