; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G21250 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G21250
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:19712159..19713369
RNA-Seq ExpressionCSPI04G21250
SyntenyCSPI04G21250
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PSR86206.1 Endonuclease [Actinidia chinensis var. chinensis]4.4e-14765.33Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELV+LP GSKPLG KWIFKRK+K DGSIDKYKAR V KGY+Q+EG DYFD YSPV+RITSIRM+IAI+AL   EIHQMDVKT FL G+LDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   NV++ +GF INEC+KCVYVK     YVI+CLY+DDM+I+GSN  +IK TK +L +KF+MKDM
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        G+ADVILG+KI+RT  GL LSQ+HYIDKIL K+ K +  +A+TPID +LHLSKN G+  +QLEYSRIIGSLMY+MSCTRPDIA+ VS+LSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP
        WKAI+R+L YL++T+++G+HYTR+PAVLEGY DANWIS+ KDSKSTSGY+F LGG AVSWKSSKQTCIARSTMESEFIALDKA EEA     F +  P
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP

PSS35063.1 Endonuclease [Actinidia chinensis var. chinensis]4.4e-14765.33Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELV+LP GSKPLG KWIFKRK+K DGSIDKYKAR V KGY+Q+EG DYFD YSPV+RITSIRM+IAI+AL   EIHQMDVKT FL G+LDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   NV++ +GF INEC+KCVYVK     YVI+CLY+DDM+I+GSN  +IK TK +L +KF+MKDM
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        G+ADVILG+KI+RT  GL LSQ+HYIDKIL K+ K +  +A+TPID +LHLSKN G+  +QLEYSRIIGSLMY+MSCTRPDIA+ VS+LSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP
        WKAI+R+L YL++T+++G+HYTR+PAVLEGY DANWIS+ KDSKSTSGY+F LGG AVSWKSSKQTCIARSTMESEFIALDKA EEA     F +  P
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP

RVW67960.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.2e-15066.83Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G+K LGCKWIFK+K+K +GSIDKYKAR VAKGYKQ+EG DYFD YSPV++ITSIRMLIAI+A++  EIHQMDVKT FL GELDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M +                                   N M+ NGF INEC+KCVYVK+ +  YVIVCLYVDDMLIIGS+ N+IK TKQML+++F+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVAD+ILGIK+S+T  GL+LSQSHYI+KIL+K+ K++I   KT +D +LHL KN G   +QLEYSRIIGSLMY+M+CT PDIA+ VSKLSR+T NPG +H
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP
        WK+I+RVLGYLK+T+NYG+H+TRYPAVLEGYSDANWIS++KDSKSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIA+DKAGEEA     F +  P
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP

RVW97088.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.7e-14766.08Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPL  KWIFKRK+K DGSIDKYKAR V KGY+Q EG DYFD YSPVTRI SIRM++AI+AL   EIHQMDVKT FL G+LDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   NVM+ +GF INEC+KCVYVK  EH YVIVCLYVDDMLI+GS+  +I  TK ML ++F+MKDM
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        G+ADVILGIKI RT   L+LSQSHY+DKIL K+ K    +A+TP+D +LHLSKN G+S++Q+EYSRIIGSLMY+MSCTRPDIA+ VSKLSRYT NPG  H
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP
        W+ I+RVL YL+ T++YG+HYTRYPAVLEGYSDANWISN KDSKS SGY+FTLGG AVSWKSSKQT IARSTMESEFIALDK GEEA     F +  P
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP

WP_140189331.1 DDE-type integrase/transposase/recombinase [Xylella fastidiosa]6.7e-14865.58Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP GSKPLG KWIFKRK+K DGSIDKYKAR V KGY+Q+EG DYFD YSPV+RI SIRM+IAI+AL   EIHQMDVKT FL G+L+EEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                     M+ +GF INEC+KCVYVK+ +  YVI+CLYVDDMLI+GSN  +I+ TK ML +KF+MKDM
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        G+ADVILG+KI+RT  GL LSQ+HY+DKIL+K+ + +  IA+TP+D SLHLSKN G+ ++QLEYSRIIGSLMY+MSCTRPDIA+ VS+LSRYT NPGHDH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP
        WKAI+RVL YL++T++ G+HY RYPAVLEGY DANWIS+ KDSKSTSGY+FTLGG AV+WKSSKQTCIARSTMESEFIALDKA EEA     F +  P
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFP

TrEMBL top hitse value%identityAlignment
A0A2N9F5X3 Integrase catalytic domain-containing protein6.8e-15469.67Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPLG KWIFKRK+K DGSIDKYKAR V KGYKQ+EG DYFD YSPVTRITSIRMLIAI+AL+  EIHQMDVKT FL GELDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   N MM NGF INEC+KCVYVK+    YVIVCLYVDDMLI+GSN +IIK TK+ML +KF+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVADVILGIKI+RT  GLVLSQSHYI K+L+K+ +++    KTPID +LHL+KN G+ I+QLEYS+IIGSLMYIM+CTRPDIA+ VSKLSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI
        WKAI+RVL YLK+T NYG+HYTRYPAVLEGYSDANWIS+T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIALDKAGEEA     F +  P+
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI

A0A2N9F8T0 Integrase catalytic domain-containing protein1.5e-15369.42Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPLG KWIFKRK+K DGSIDKYKAR V KGYKQ+EG DYFD YSPVTRITSIRMLIAI+AL+  EIHQMDVKT FL GEL+EEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   N MM NGF INEC+KCVY+K+    YVIVCLYVDDMLI+GSN +IIK TK+ML +KF+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVADVILGIKI+RT  GLVLSQSHYI K+L+K+ +++    KTPID +LHLSKN G+ I+QLEYS+IIGSLMYIM+CTRPDIA+ VSKLSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI
        WKAI+RVL YLK+T NYG+HYTRYPAVLEGYSDANWIS+T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIALDKAGEEA     F +  P+
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI

A0A2N9GRE4 Integrase catalytic domain-containing protein8.9e-15469.42Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPLG KWIFKRK+K DGSIDKYKAR V KGYKQ+EG DYFD YSPVTRITSIRMLIAI+AL+  EIHQMDVKT FL GELDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   N MM NGF INEC+KCVYVK+    YVIVCLYVDDMLI+GSN +IIK TK+ML +KF+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVADVILGIKI+RT  GLVLSQSHYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLMYIM+CTRPDIA+ VSKLSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI
        WKAI+RVL YLK+T NYG+HYTRYPAVLEGYSDANWIS+T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIALDKAGEEA     F +  P+
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI

A0A2N9HDE2 Integrase catalytic domain-containing protein8.9e-15469.42Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPLG KWIFKRK+K DGSIDKYKAR V KGYKQ+EG DYFD YSPVTRITSIRMLIAI+AL+  EIHQMDVKT FL GELDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   N MM NGF INEC+KCVYVK+    YVIVCLYVDDMLI+GSN +IIK TK+ML +KF+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVADVILGIKI+RT  GLVLSQSHYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLMYIM+CTRPDIA+ VSKLSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI
        WKAI+RVL YLK+T NYG+HYTRYPAVLEGYSDANWIS+T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIALDKAGEEA     F +  P+
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI

A0A2N9HX08 Integrase catalytic domain-containing protein8.9e-15469.42Show/hide
Query:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY
        + NHTWELVDLP G KPLG KWIFKRK+K DGSIDKYKAR V KGYKQ+EG DYFD YSPVTRITSIRMLIAI+AL+  EIHQMDVKT FL GELDEEIY
Subjt:  MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIY

Query:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM
        M++                                   N MM NGF INEC+KCVYVK+    YVIVCLYVDDMLI+GSN +IIK TK+ML +KF+MKD+
Subjt:  MKR-----------------------------------NVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDM

Query:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
        GVADVILGIKI+RT  GLVLSQSHYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLMYIM+CTRPDIA+ VSKLSRYT NPG DH
Subjt:  GVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI
        WKAI+RVL YLK+T NYG+HYTRYPAVLEGYSDANWIS+T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTMESEFIALDKAGEEA     F +  P+
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIFPI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-5532.56Show/hide
Query:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMK
        N+TW +   P     +  +W+F  K    G+  +YKAR VA+G+ Q+   DY + ++PV RI+S R ++++   +  ++HQMDVKT FL G L EEIYM+
Subjt:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMK

Query:  ---------RNVMMVNGFI-----------------INECE-------KCVYV--KSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMGV
                  NV  +N  I                 + ECE       +C+Y+  K N ++ + V LYVDD++I   ++  +   K+ L  KF M D+  
Subjt:  ---------RNVMMVNGFI-----------------INECE-------KCVYV--KSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMGV

Query:  ADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWK
            +GI+I      + LSQS Y+ KIL K+         TP+   ++    N D         +IG LMYIM CTRPD+   V+ LSRY+     + W+
Subjt:  ADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWK

Query:  AILRVLGYLKHTKNYGIHYTR---YPAVLEGYSDANWISNTKDSKSTSGYIFTL-GGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAV
         + RVL YLK T +  + + +   +   + GY D++W  +  D KST+GY+F +     + W + +Q  +A S+ E+E++AL +A  EA+
Subjt:  AILRVLGYLKHTKNYGIHYTR---YPAVLEGYSDANWISNTKDSKSTSGYIFTL-GGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAV

P0CV72 Secreted RxLR effector protein 1617.2e-2847.62Show/hide
Query:  YSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRY-PAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKS
        Y   +G++MY+M  TRPD+A  V  LS++  +P   HW+A+ RVL YL+ T+ YG+ +TR   A L GYSDA+W  + +  +STSGY+F L GG VSW+S
Subjt:  YSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRY-PAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKS

Query:  SKQTCIARSTMESEFIALDKAGEEAV
         KQ  +A S+ E E++AL +A +EAV
Subjt:  SKQTCIARSTMESEFIALDKAGEEAV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-7838.19Show/hide
Query:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMK
        N T++LV+LP G +PL CKW+FK K   D  + +YKAR V KG++Q++G D+ +I+SPV ++TSIR +++++A    E+ Q+DVKT FL G+L+EEIYM+
Subjt:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMK

Query:  R-----------------------------------NVMMVNGFIINECEKCVYVKS-NEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG
        +                                   + M    ++    + CVY K  +E++++I+ LYVDDMLI+G +  +I K K  L+  F+MKD+G
Subjt:  R-----------------------------------NVMMVNGFIINECEKCVYVKS-NEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG

Query:  VADVILGIKI--SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSK-------NNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRY
         A  ILG+KI   RT + L LSQ  YI+++L+++         TP+ G L LSK           ++A++ YS  +GSLMY M CTRPDIA  V  +SR+
Subjt:  VADVILGIKI--SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSK-------NNGDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRY

Query:  TGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEI
          NPG +HW+A+  +L YL+ T    + +     +L+GY+DA+   +  + KS++GY+FT  GGA+SW+S  Q C+A ST E+E+IA  + G+E +  + 
Subjt:  TGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEI

Query:  FWKIFPIGLNQCVQYVYTE
        F  +  +GL+Q    VY +
Subjt:  FWKIFPIGLNQCVQYVYTE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-5934.63Show/hide
Query:  NHTWELV-DLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYM
        NHTW+LV   PS    +GC+WIF +K  +DGS+++YKAR VAKGY Q+ G DY + +SPV + TSIR+++ ++    + I Q+DV   FL+G L +++YM
Subjt:  NHTWELV-DLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYM

Query:  K-----------------------------------RNVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG
                                            RN ++  GF+ +  +  ++V       V + +YVDD+LI G++  ++  T   L+ +F +KD  
Subjt:  K-----------------------------------RNVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG

Query:  VADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
             LGI+  R P GL LSQ  YI  +L +          TP+  S  LS  +G  +    EY  I+GSL Y ++ TRPDI++ V++LS++   P  +H
Subjt:  VADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEE
         +A+ R+L YL  T N+GI   +   + L  YSDA+W  +  D  ST+GYI  LG   +SW S KQ  + RS+ E+E+ ++     E
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-6134.88Show/hide
Query:  NHTWELVDLPSGSKPL-GCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYM
        NHTW+LV  P  S  + GC+WIF +K  +DGS+++YKAR VAKGY Q+ G DY + +SPV + TSIR+++ ++    + I Q+DV   FL+G L +E+YM
Subjt:  NHTWELVDLPSGSKPL-GCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYM

Query:  K-----------------------------------RNVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG
                                            R  ++  GF+ +  +  ++V       + + +YVDD+LI G++  ++K T   L+ +F +K+  
Subjt:  K-----------------------------------RNVMMVNGFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMG

Query:  VADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH
             LGI+  R PQGL LSQ  Y   +L +          TP+  S  L+ ++G  +    EY  I+GSL Y ++ TRPD+++ V++LS+Y   P  DH
Subjt:  VADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDH

Query:  WKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEE
        W A+ RVL YL  T ++GI   +   + L  YSDA+W  +T D  ST+GYI  LG   +SW S KQ  + RS+ E+E+ ++     E
Subjt:  WKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.0e-6633.5Show/hide
Query:  HTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMKR
        HTWE+  LP   KP+GCKW++K K  +DG+I++YKAR VAKGY QQEG D+ + +SPV ++TS+++++AISA++ F +HQ+D+   FL G+LDEEIYMK 
Subjt:  HTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMKR

Query:  --------------------------------------NVMMVN-GFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKD
                                              +V ++  GF+ +  +   ++K     ++ V +YVDD++I  +N   + + K  L + F+++D
Subjt:  --------------------------------------NVMMVN-GFIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKD

Query:  MGVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNN-GDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGH
        +G     LG++I+R+  G+ + Q  Y   +L +        +  P+D S+  S ++ GD +    Y R+IG LMY +  TR DI+F V+KLS+++  P  
Subjt:  MGVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNN-GDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGH

Query:  DHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIF
         H +A++++L Y+K T   G+ Y+    + L+ +SDA++ S     +ST+GY   LG   +SWKS KQ  +++S+ E+E+ AL  A +E +    F++  
Subjt:  DHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMESEFIALDKAGEEAVHFEIFWKIF

Query:  PIGLNQ
         + L++
Subjt:  PIGLNQ

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.3e-0528.95Show/hide
Query:  MSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGY
        ++ TRPD+ F V++LS+++        +A+ +VL Y+K T   G+ Y+    + L+ ++D++W S     +S +G+
Subjt:  MSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-2530.99Show/hide
Query:  LYVDDMLIIGSNINIIKKTKQMLANKFEMKDMGVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRII
        LYVDD+L+ GS+  ++      L++ F MKD+G     LGI+I   P GL LSQ+ Y ++IL      +     TP+   L+ S +        ++  I+
Subjt:  LYVDDMLIIGSNINIIKKTKQMLANKFEMKDMGVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRII

Query:  GSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTC
        G+L Y ++ TRPDI++ V+ + +    P    +  + RVL Y+K T  +G++  +   + ++ + D++W   T   +ST+G+   LG   +SW + +Q  
Subjt:  GSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAV-LEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQTC

Query:  IARSTMESEFIAL
        ++RS+ E+E+ AL
Subjt:  IARSTMESEFIAL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-1548.61Show/hide
Query:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAIS
        N TW LV  P     LGCKW+FK KL +DG++D+ KAR VAKG+ Q+EG  + + YSPV R  +IR ++ ++
Subjt:  NHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAACCATACTTGGGAACTAGTTGATCTTCCATCAGGAAGTAAACCACTTGGTTGCAAGTGGATTTTCAAACGAAAATTGAAGACCGATGGGTCAATAGATAAATA
TAAGGCAAGACGTGTCGCTAAAGGTTACAAGCAACAAGAAGGATTTGACTATTTTGATATATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAATAT
CAGCTTTGCATGGATTTGAGATACATCAGATGGATGTCAAGACGACATTTTTAAAGGGTGAGTTAGATGAAGAGATCTACATGAAAAGAAATGTAATGATGGTCAATGGG
TTTATAATCAATGAATGTGAGAAATGTGTATATGTCAAAAGCAATGAGCATGACTATGTCATTGTGTGCTTATATGTTGATGATATGCTAATTATAGGTAGCAATATCAA
CATTATTAAGAAAACCAAACAAATGTTGGCCAATAAATTTGAGATGAAAGACATGGGTGTCGCAGATGTTATTCTAGGTATCAAAATTTCTAGAACCCCACAAGGGTTAG
TGCTATCACAATCACATTATATTGATAAAATATTGAAGAAATATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAACAAT
GGAGATAGTATAGCACAATTAGAATACTCTCGCATCATTGGTAGTTTGATGTACATCATGAGTTGTACACGTCCTGATATAGCGTTTGTGGTAAGCAAGTTAAGTCGCTA
TACAGGTAATCCAGGTCATGATCATTGGAAAGCTATCTTGAGAGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATACACTATACTCGATATCCTGCTGTACTTG
AAGGCTATAGTGATGCCAATTGGATATCGAACACTAAAGACTCCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAA
ACATGTATAGCACGATCCACAATGGAATCTGAATTTATAGCCTTAGACAAGGCTGGAGAAGAAGCAGTGCACTTCGAAATTTTTTGGAAGATATTCCCAATTGGTCTAAA
TCAGTGCGTTCAATATGTATACACAGAATATTATGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATAACCATACTTGGGAACTAGTTGATCTTCCATCAGGAAGTAAACCACTTGGTTGCAAGTGGATTTTCAAACGAAAATTGAAGACCGATGGGTCAATAGATAAATA
TAAGGCAAGACGTGTCGCTAAAGGTTACAAGCAACAAGAAGGATTTGACTATTTTGATATATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAATAT
CAGCTTTGCATGGATTTGAGATACATCAGATGGATGTCAAGACGACATTTTTAAAGGGTGAGTTAGATGAAGAGATCTACATGAAAAGAAATGTAATGATGGTCAATGGG
TTTATAATCAATGAATGTGAGAAATGTGTATATGTCAAAAGCAATGAGCATGACTATGTCATTGTGTGCTTATATGTTGATGATATGCTAATTATAGGTAGCAATATCAA
CATTATTAAGAAAACCAAACAAATGTTGGCCAATAAATTTGAGATGAAAGACATGGGTGTCGCAGATGTTATTCTAGGTATCAAAATTTCTAGAACCCCACAAGGGTTAG
TGCTATCACAATCACATTATATTGATAAAATATTGAAGAAATATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAACAAT
GGAGATAGTATAGCACAATTAGAATACTCTCGCATCATTGGTAGTTTGATGTACATCATGAGTTGTACACGTCCTGATATAGCGTTTGTGGTAAGCAAGTTAAGTCGCTA
TACAGGTAATCCAGGTCATGATCATTGGAAAGCTATCTTGAGAGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATACACTATACTCGATATCCTGCTGTACTTG
AAGGCTATAGTGATGCCAATTGGATATCGAACACTAAAGACTCCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAA
ACATGTATAGCACGATCCACAATGGAATCTGAATTTATAGCCTTAGACAAGGCTGGAGAAGAAGCAGTGCACTTCGAAATTTTTTGGAAGATATTCCCAATTGGTCTAAA
TCAGTGCGTTCAATATGTATACACAGAATATTATGTATAA
Protein sequenceShow/hide protein sequence
MHNHTWELVDLPSGSKPLGCKWIFKRKLKTDGSIDKYKARRVAKGYKQQEGFDYFDIYSPVTRITSIRMLIAISALHGFEIHQMDVKTTFLKGELDEEIYMKRNVMMVNG
FIINECEKCVYVKSNEHDYVIVCLYVDDMLIIGSNINIIKKTKQMLANKFEMKDMGVADVILGIKISRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNN
GDSIAQLEYSRIIGSLMYIMSCTRPDIAFVVSKLSRYTGNPGHDHWKAILRVLGYLKHTKNYGIHYTRYPAVLEGYSDANWISNTKDSKSTSGYIFTLGGGAVSWKSSKQ
TCIARSTMESEFIALDKAGEEAVHFEIFWKIFPIGLNQCVQYVYTEYYV