; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy09g014680 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy09g014680
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRetrotransposon protein
Genome locationChr09:18468657..18474663
RNA-Seq ExpressionLcy09g014680
SyntenyLcy09g014680
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR009027 - Ribosomal protein L9/RNase H1, N-terminal
IPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR027806 - Harbinger transposase-derived nuclease domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]3.9e-16463.37Show/hide
Query:  LIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITAS
        +IHESDL CR+ST+MDRR FAILC LLR  +GL  TEIVDVEEMVAM LH++AHDVKNRVI+++F RSGETVSRHFN  L AVLRLY  L+K+P P+T++
Subjt:  LIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITAS

Query:  CQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEG
        C D RWK FENCLGALDGTY+KV+V A DRP +RTRKGEI+TNVLGV   KG+F++V+ GWEGSAADSR+LRDAIS+ NGL VPKGYYYLCD GY NAEG
Subjt:  CQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEG

Query:  FLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREMDLDVGLDEGDVGRSK-P
        FLAPY+G+RYHL EWRG  N PT  +E+FNMKHSSARNVIERAFG LKGRW ILRGKSYYP + QCR I AC LLHNLI REM     +++ D G S   
Subjt:  FLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREMDLDVGLDEGDVGRSK-P

Query:  VPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVI
             E+I +I+++ EW Q RDDLA  MF +    G D                   +ELV  GGW+ DNGTFRPGYL +L +M+ +K+  C + +T+VI
Subjt:  VPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVI

Query:  DCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWV
        DC++++LKR + AI+EMLG  CSGFGWNDE KCI AEKE++D WV
Subjt:  DCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWV

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]2.2e-16258.5Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH                         T +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVK+RVI+R+F RSGET+SRHFN  L AV+RL+  LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKH SARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEE
         ILRGKSYYP   QCR I ACCLLHNLI REM   D++  +DE D   S       ++I +I++S EW Q RD+LA      +M  + +  KH WT++EE
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEE

Query:  ARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW
        A LVE LVELV+ GGWR DNGTFRPGYL +L +M+  K+    I + S ID +++ +KR + A++EM G  CSGFGWNDE KCI AEKEV+D W
Subjt:  ARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]3.8e-16760.16Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+ST+MDRRCFAILC LLRT +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKHSSARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM-DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEEAR
         ILRGKSY+P   QC  I ACCLLHNLI REM + D+                 +NI  + SS+                      +  KH WT++EEA 
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM-DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEEAR

Query:  LVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW
            LVELV+ GGWR DNGTFRPGYL +L +M+  K+  C I + S ID +++ +KR + A++EM G  CSGFGWNDE KCI AEKEV+D W
Subjt:  LVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]2.1e-16561.36Show/hide
Query:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRH
        MNN+ R+ +  P  RH+IR+LA FR+IHESDL CR+ST+MDRR FAILC LLR  +GL  TEIVDVEEMVAM LHI AHDVKNRVI+R+F RSGETVSRH
Subjt:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRH

Query:  FNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAI
        FN  L AVLRLY  L+K+P P+T++C D RWK FENCLGALDGTY+KV+V A DRP +RTRKGEI+TNVLGV   KG+F++V+ GW+GSAADSR+LRDAI
Subjt:  FNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAI

Query:  SRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLL
        SR NGL VPKGYYYLCD GY NAEGFLAPYRG+RYHL EWRG  N PT  +E+FNMKHSSARNVIERAFG LKGRW ILRGKS          +T C   
Subjt:  SRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLL

Query:  HNLITREMDLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPG
                  D   DE +   +       E+I +I+++ EW Q RDDLA  MF +  M+ +++  +H+WTR+EE  LVE L+ELV  GGW+ DNGTFR G
Subjt:  HNLITREMDLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPG

Query:  YLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWVNL
        YL +L +M+ +K+  C + +T+VIDC++++LKR + AI+EM G  CSGFGWNDE KCI AEKE++D WV +
Subjt:  YLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWVNL

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]1.0e-14366.05Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+ST+MDRRCFAILC LLRT +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKHSSARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF
         ILRGKSYYP   QCR I ACCLLHNLI REM   D++  +DE D   S       ++I +I++S EW Q RD+LA  MF
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein1.8e-16760.16Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+ST+MDRRCFAILC LLRT +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKHSSARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM-DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEEAR
         ILRGKSY+P   QC  I ACCLLHNLI REM + D+                 +NI  + SS+                      +  KH WT++EEA 
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM-DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEEAR

Query:  LVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW
            LVELV+ GGWR DNGTFRPGYL +L +M+  K+  C I + S ID +++ +KR + A++EM G  CSGFGWNDE KCI AEKEV+D W
Subjt:  LVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW

A0A5A7SYW1 Retrotransposon protein1.0e-16561.36Show/hide
Query:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRH
        MNN+ R+ +  P  RH+IR+LA FR+IHESDL CR+ST+MDRR FAILC LLR  +GL  TEIVDVEEMVAM LHI AHDVKNRVI+R+F RSGETVSRH
Subjt:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRH

Query:  FNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAI
        FN  L AVLRLY  L+K+P P+T++C D RWK FENCLGALDGTY+KV+V A DRP +RTRKGEI+TNVLGV   KG+F++V+ GW+GSAADSR+LRDAI
Subjt:  FNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAI

Query:  SRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLL
        SR NGL VPKGYYYLCD GY NAEGFLAPYRG+RYHL EWRG  N PT  +E+FNMKHSSARNVIERAFG LKGRW ILRGKS          +T C   
Subjt:  SRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLL

Query:  HNLITREMDLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPG
                  D   DE +   +       E+I +I+++ EW Q RDDLA  MF +  M+ +++  +H+WTR+EE  LVE L+ELV  GGW+ DNGTFR G
Subjt:  HNLITREMDLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPG

Query:  YLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWVNL
        YL +L +M+ +K+  C + +T+VIDC++++LKR + AI+EM G  CSGFGWNDE KCI AEKE++D WV +
Subjt:  YLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWVNL

A0A5A7TXW1 Retrotransposon protein4.9e-14466.05Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+ST+MDRRCFAILC LLRT +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKHSSARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF
         ILRGKSYYP   QCR I ACCLLHNLI REM   D++  +DE D   S       ++I +I++S EW Q RD+LA  MF
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMF

E5GBB2 Retrotransposon protein1.9e-16463.37Show/hide
Query:  LIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITAS
        +IHESDL CR+ST+MDRR FAILC LLR  +GL  TEIVDVEEMVAM LH++AHDVKNRVI+++F RSGETVSRHFN  L AVLRLY  L+K+P P+T++
Subjt:  LIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITAS

Query:  CQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEG
        C D RWK FENCLGALDGTY+KV+V A DRP +RTRKGEI+TNVLGV   KG+F++V+ GWEGSAADSR+LRDAIS+ NGL VPKGYYYLCD GY NAEG
Subjt:  CQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEG

Query:  FLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREMDLDVGLDEGDVGRSK-P
        FLAPY+G+RYHL EWRG  N PT  +E+FNMKHSSARNVIERAFG LKGRW ILRGKSYYP + QCR I AC LLHNLI REM     +++ D G S   
Subjt:  FLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREMDLDVGLDEGDVGRSK-P

Query:  VPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVI
             E+I +I+++ EW Q RDDLA  MF +    G D                   +ELV  GGW+ DNGTFRPGYL +L +M+ +K+  C + +T+VI
Subjt:  VPLDGENITFIQSSTEWMQKRDDLANRMF-NALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVI

Query:  DCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWV
        DC++++LKR + AI+EMLG  CSGFGWNDE KCI AEKE++D WV
Subjt:  DCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAWV

E5GCB5 Retrotransposon protein1.0e-16258.5Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI
        HEL S+++    SQRQL  ++    N+  RI +     RH+IRQLA FR+IH                         T +GL  TE+VDVEEMVAM LHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHI

Query:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK
        +AHDVK+RVI+R+F RSGET+SRHFN  L AV+RL+  LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+DR RYRTRKGE++TNVLGV   K
Subjt:  IAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPK

Query:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW
        G+F++V+ GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GY NAEGFLAPYRG+RYHL EWRG  N P+T +EFFNMKH SARNVIERAFG LKGRW
Subjt:  GEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRW

Query:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEE
         ILRGKSYYP   QCR I ACCLLHNLI REM   D++  +DE D   S       ++I +I++S EW Q RD+LA      +M  + +  KH WT++EE
Subjt:  VILRGKSYYPARTQCRIITACCLLHNLITREM---DLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLANRMFNALMAGADKQTKHIWTRQEE

Query:  ARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW
        A LVE LVELV+ GGWR DNGTFRPGYL +L +M+  K+    I + S ID +++ +KR + A++EM G  CSGFGWNDE KCI AEKEV+D W
Subjt:  ARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFKCIQAEKEVYDAW

SwissProt top hitse value%identityAlignment
Q07762 Ribonuclease H1.5e-0452.5Show/hide
Query:  FYVVFVGRNPGIYTSWVECHRQANQFKGALHKSYPTFQEA
        FYVV VGR  GIY++W +C  Q   F GA++KS+ T  EA
Subjt:  FYVVFVGRNPGIYTSWVECHRQANQFKGALHKSYPTFQEA

Q17QR8 Putative nuclease HARBI12.3e-1030.56Show/hide
Query:  LGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVL-RDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYH
        +G +D  +V +     +   Y  RKG  S N L V   +G  + V   W GS  D  VL + ++S      + K  + L D  +     FL  +     H
Subjt:  LGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVL-RDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYH

Query:  LTEWRGRGNPPTTPREF-FNMKHSSARNVIERAFGALKGRWVIL---RGKSYYPARTQCRIITACCLLHNLITREMDLDV
        +         P TP E+ +NM HS+  +VIE+ F  L  R+  L   +G   Y       II ACC+LHN I+ E  +DV
Subjt:  LTEWRGRGNPPTTPREF-FNMKHSSARNVIERAFGALKGRWVIL---RGKSYYPARTQCRIITACCLLHNLITREMDLDV

Q96MB7 Putative nuclease HARBI11.1e-1030.56Show/hide
Query:  LGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVL-RDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYH
        +G +D  +V +     +   Y  RKG  S N L V   +G  + V   W GS  D  VL + ++S      + K  + L D  +     FL  +     H
Subjt:  LGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVL-RDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYH

Query:  LTEWRGRGNPPTTPREF-FNMKHSSARNVIERAFGALKGRWVIL---RGKSYYPARTQCRIITACCLLHNLITREMDLDV
        +         P TP E+ +NM HS+  +VIE+ F  L  R+  L   +G   Y       II ACC+LHN I+ E  +DV
Subjt:  LTEWRGRGNPPTTPREF-FNMKHSSARNVIERAFGALKGRWVIL---RGKSYYPARTQCRIITACCLLHNLITREMDLDV

Q9KEI9 Ribonuclease H2.2e-0856Show/hide
Query:  MGATKFYVVFVGRNPGIYTSWVECHRQANQFKGALHKSYPTFQEAEYAFR
        M  +K+YVV+ GR PGIYTSW  C  Q   + GA  KSYP+ +EAE AFR
Subjt:  MGATKFYVVFVGRNPGIYTSWVECHRQANQFKGALHKSYPTFQEAEYAFR

Q9M2U3 Protein ALP1-like7.5e-1726.62Show/hide
Query:  KMDRRCFAILCSLLRTTSGLVGTEIVD-------VEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSR---HFNATLNAVLRLYNVLLKKPEPITASCQD
        K+ R+ F  +CSL++           D       + + VA++L  +       VI   F  +  TVS+    F  ++      +     K + I +  + 
Subjt:  KMDRRCFAILCSLLRTTSGLVGTEIVD-------VEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSR---HFNATLNAVLRLYNVLLKKPEPITASCQD

Query:  GRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGE--ISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDA--------ISRPNGLIVPKG------Y
         +     NC GA+D T++ +++ A +        GE   S  +  VV P   F+ V+ GW GS  D  VL+++          R NG  +P         
Subjt:  GRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGE--ISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDA--------ISRPNGLIVPKG------Y

Query:  YYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQC-RIITACCLLHNLI
        Y + D G+      L PY+G+            P + P+  FN +HS A    + A   LK RW I+ G  + P R +  RII  CCLLHN+I
Subjt:  YYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQC-RIITACCLLHNLI

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.5e-3132.45Show/hide
Query:  FRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPE---
        +R + +    C +  +M   CF  LC++L+T   L  T  + +EE VAM L I  H+   R +  +F R+ ETV R F   L A   L    ++ P    
Subjt:  FRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPE---

Query:  ----PITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGL-IVPKGYYYL
            P         W +F   +GA+DGT+V V V    +  Y  R    S N++ +   K  F ++  G  GS  D+ VL+ A    +   + P   YYL
Subjt:  ----PITASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGL-IVPKGYYYL

Query:  CDPGYRNAEGFLAPYRGE-----RYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGR
         D GY N +G LAPYR       RYH++++   G  P    E FN  H+S R+VIER F   K +
Subjt:  CDPGYRNAEGFLAPYRGE-----RYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGR

AT5G28730.1 unknown protein4.2e-1530.52Show/hide
Query:  IHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRL--YNVLLKKPEPITA
        I+ +++ C+   +M    F  LC +L    GL  +  + ++E VA+ L I A +   R I  +F  + ET+ R F+  L A+ RL    +  +K E + A
Subjt:  IHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRL--YNVLLKKPEPITA

Query:  ---SCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAIS-RPNGLIVPKGYYYLCDPGY
             QD    W                      P      G  S NVL +      F +   G  GS  D+RVL  AIS  P   + P   YYL D GY
Subjt:  ---SCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAIS-RPNGLIVPKGYYYLCDPGY

Query:  RNAEGFLAPYRGE
         N  G+LAPYR E
Subjt:  RNAEGFLAPYRGE

AT5G28950.1 unknown protein4.2e-1545.68Show/hide
Query:  WKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISR-PNGLIVPK
        + +F++C+GA+D T++   VS    P +R RKG+IS N+L   +   EF++V+ GWEGSA DS+VL DA++R  N L VP+
Subjt:  WKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISR-PNGLIVPK

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.4e-2843.92Show/hide
Query:  FIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVI
        FI+V+ GWEGSA DSRVL DA+ +          +YL D G+ N   FLAP+RG RYHL E+ G+   P TP E FN++H S RNVIER FG  K R+ I
Subjt:  FIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVI

Query:  LRGKSYYPARTQCRIITACCLLHNLITREMDLD-------VGLDEGDV
         +    +  + Q  ++  C  LHN + +E   D       VG +EGDV
Subjt:  LRGKSYYPARTQCRIITACCLLHNLITREMDLD-------VGLDEGDV

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.3e-4831.88Show/hide
Query:  FRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPIT
        +++++  +  C E+ +MD+  F  LC LL+T   L  T  + +E  +A+ L II H+++ R ++  F  SGET+SRHFN  LNAV+ +     +      
Subjt:  FRLIHESDLCCRESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPIT

Query:  ASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNA
        +   +    +F++C+G +D  ++ V V   ++  +R   G ++ NVL   S    F +V+ GWEGSA+D +VL  A++R N L VP+G YY+ D  Y N 
Subjt:  ASCQDGRWKWFENCLGALDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNA

Query:  EGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREM--DLDVGLDE----G
         GF+APY G   +  E           +E FN +H      I R FGALK R+ IL     YP +TQ +++ A C LHN +  E   DL   + E     
Subjt:  EGFLAPYRGERYHLTEWRGRGNPPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREM--DLDVGLDE----G

Query:  DVGRSKPVPLDGENITFI--------QSSTEWMQKRDDLANRMFN
        + G  + V L+ E +  +        +   + ++ RD++A+ ++N
Subjt:  DVGRSKPVPLDGENITFI--------QSSTEWMQKRDDLANRMFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCTTCAACTTGTCGTGGGTGTCGACGATATGAATCGGTATCCGCCTGGTAGATTGGGGCACGGTGAGGGGTGGGACGAAGGTTTGCTCCAAAAAACACCG
TTATATTCGAGGACTATGGGGGCAACAAAATTCTATGTCGTGTTTGTTGGTCGCAACCCAGGAATTTACACCTCCTGGGTTGAATGCCACAGACAAGCTAACCAG
TTTAAGGGAGCACTACACAAGTCATATCCAACATTTCAAGAAGCAGAGTATGCATTTAGGCATTACGTTGCGGACACCAGTGGCGCACCAAACCTCGTTGACCAA
CATGAAGTGCGCACCAAACCTTACTGTAGGGGCTATGGAACTGTAGGAAACTGTTACAGTATTAGTCAGATGTTTGTAGGGTTCCTGTTCGGAATAATTGCTGAA
GTTGTACAGTTCATAATGGATTCCATAACCCCACATGAACTTGTGTCTGTACTTTCAATAATGGCTGACTCTCAGCGCCAACTATTCAACCTGATTAACTCCTTC
ATGAACAACCACCCTAGGATAGAAAACCAAACTCCATACCTCAGACACCAAATAAGGCAGTTAGCCTGCTTCCGGTTGATTCATGAAAGTGACCTATGTTGTCGA
GAAAGCACCAAGATGGATAGGAGATGTTTTGCCATTCTATGTAGTCTGTTGAGAACGACGTCCGGGTTGGTAGGAACAGAAATCGTAGACGTGGAAGAGATGGTC
GCGATGTCCTTGCACATCATTGCTCATGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGCGAAACCGTTTCTCGGCATTTCAACGCGACTTTG
AATGCCGTACTACGATTGTACAATGTTCTACTTAAGAAACCTGAACCGATCACGGCTTCTTGCCAAGATGGGAGGTGGAAATGGTTTGAGAATTGTTTAGGTGCA
TTGGACGGTACGTACGTAAAGGTCCATGTTAGTGCAACTGATCGACCAAGGTATAGGACGCGGAAGGGTGAGATTTCAACAAATGTTTTGGGCGTTGTGTCGCCA
AAAGGTGAATTCATTTTTGTTATGCCGGGATGGGAAGGTTCGGCTGCTGATTCTCGTGTACTTAGAGATGCTATATCACGCCCCAATGGACTAATAGTGCCGAAG
GGTTACTACTACCTCTGTGATCCCGGGTACCGAAATGCAGAGGGTTTCTTGGCACCTTACAGAGGAGAGCGGTACCACTTAACCGAGTGGCGTGGGAGAGGCAAT
CCACCTACTACACCAAGAGAGTTCTTCAACATGAAGCATTCTTCTGCACGTAATGTTATCGAGAGAGCATTCGGTGCTCTGAAAGGACGGTGGGTGATACTTCGG
GGGAAGTCCTACTACCCTGCTCGGACCCAGTGTCGAATCATAACAGCGTGCTGTTTACTCCACAACCTTATCACCCGGGAGATGGATCTGGATGTTGGATTGGAT
GAAGGTGATGTTGGTCGATCTAAACCCGTACCTCTAGATGGCGAGAACATTACCTTCATTCAAAGCTCCACTGAATGGATGCAAAAGCGAGATGACCTAGCGAAC
AGGATGTTCAACGCACTAATGGCAGGTGCAGATAAACAAACGAAACACATCTGGACGAGGCAGGAGGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCAC
GAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGCCCGGATACCTACCCCGACTGAAGCAGATGATAAAAGATAAAATGGTGACCTGCACCATAGAGTCAACG
TCCGTAATAGACTGCAAGGTACAGTCCTTGAAACGGCAATACAGTGCCATCTCGGAGATGCTGGGTCTGGGCTGCAGTGGATTCGGTTGGAATGATGAGTTTAAA
TGCATCCAGGCTGAGAAGGAGGTATATGATGCATGGGTGAATTTACGATCACTATTCATTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCTTCAACTTGTCGTGGGTGTCGACGATATGAATCGGTATCCGCCTGGTAGATTGGGGCACGGTGAGGGGTGGGACGAAGGTTTGCTCCAAAAAACACCG
TTATATTCGAGGACTATGGGGGCAACAAAATTCTATGTCGTGTTTGTTGGTCGCAACCCAGGAATTTACACCTCCTGGGTTGAATGCCACAGACAAGCTAACCAG
TTTAAGGGAGCACTACACAAGTCATATCCAACATTTCAAGAAGCAGAGTATGCATTTAGGCATTACGTTGCGGACACCAGTGGCGCACCAAACCTCGTTGACCAA
CATGAAGTGCGCACCAAACCTTACTGTAGGGGCTATGGAACTGTAGGAAACTGTTACAGTATTAGTCAGATGTTTGTAGGGTTCCTGTTCGGAATAATTGCTGAA
GTTGTACAGTTCATAATGGATTCCATAACCCCACATGAACTTGTGTCTGTACTTTCAATAATGGCTGACTCTCAGCGCCAACTATTCAACCTGATTAACTCCTTC
ATGAACAACCACCCTAGGATAGAAAACCAAACTCCATACCTCAGACACCAAATAAGGCAGTTAGCCTGCTTCCGGTTGATTCATGAAAGTGACCTATGTTGTCGA
GAAAGCACCAAGATGGATAGGAGATGTTTTGCCATTCTATGTAGTCTGTTGAGAACGACGTCCGGGTTGGTAGGAACAGAAATCGTAGACGTGGAAGAGATGGTC
GCGATGTCCTTGCACATCATTGCTCATGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGCGAAACCGTTTCTCGGCATTTCAACGCGACTTTG
AATGCCGTACTACGATTGTACAATGTTCTACTTAAGAAACCTGAACCGATCACGGCTTCTTGCCAAGATGGGAGGTGGAAATGGTTTGAGAATTGTTTAGGTGCA
TTGGACGGTACGTACGTAAAGGTCCATGTTAGTGCAACTGATCGACCAAGGTATAGGACGCGGAAGGGTGAGATTTCAACAAATGTTTTGGGCGTTGTGTCGCCA
AAAGGTGAATTCATTTTTGTTATGCCGGGATGGGAAGGTTCGGCTGCTGATTCTCGTGTACTTAGAGATGCTATATCACGCCCCAATGGACTAATAGTGCCGAAG
GGTTACTACTACCTCTGTGATCCCGGGTACCGAAATGCAGAGGGTTTCTTGGCACCTTACAGAGGAGAGCGGTACCACTTAACCGAGTGGCGTGGGAGAGGCAAT
CCACCTACTACACCAAGAGAGTTCTTCAACATGAAGCATTCTTCTGCACGTAATGTTATCGAGAGAGCATTCGGTGCTCTGAAAGGACGGTGGGTGATACTTCGG
GGGAAGTCCTACTACCCTGCTCGGACCCAGTGTCGAATCATAACAGCGTGCTGTTTACTCCACAACCTTATCACCCGGGAGATGGATCTGGATGTTGGATTGGAT
GAAGGTGATGTTGGTCGATCTAAACCCGTACCTCTAGATGGCGAGAACATTACCTTCATTCAAAGCTCCACTGAATGGATGCAAAAGCGAGATGACCTAGCGAAC
AGGATGTTCAACGCACTAATGGCAGGTGCAGATAAACAAACGAAACACATCTGGACGAGGCAGGAGGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCAC
GAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGCCCGGATACCTACCCCGACTGAAGCAGATGATAAAAGATAAAATGGTGACCTGCACCATAGAGTCAACG
TCCGTAATAGACTGCAAGGTACAGTCCTTGAAACGGCAATACAGTGCCATCTCGGAGATGCTGGGTCTGGGCTGCAGTGGATTCGGTTGGAATGATGAGTTTAAA
TGCATCCAGGCTGAGAAGGAGGTATATGATGCATGGGTGAATTTACGATCACTATTCATTAATTGA
Protein sequenceShow/hide protein sequence
MPLQLVVGVDDMNRYPPGRLGHGEGWDEGLLQKTPLYSRTMGATKFYVVFVGRNPGIYTSWVECHRQANQFKGALHKSYPTFQEAEYAFRHYVADTSGAPNLVDQ
HEVRTKPYCRGYGTVGNCYSISQMFVGFLFGIIAEVVQFIMDSITPHELVSVLSIMADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCR
ESTKMDRRCFAILCSLLRTTSGLVGTEIVDVEEMVAMSLHIIAHDVKNRVIRRQFARSGETVSRHFNATLNAVLRLYNVLLKKPEPITASCQDGRWKWFENCLGA
LDGTYVKVHVSATDRPRYRTRKGEISTNVLGVVSPKGEFIFVMPGWEGSAADSRVLRDAISRPNGLIVPKGYYYLCDPGYRNAEGFLAPYRGERYHLTEWRGRGN
PPTTPREFFNMKHSSARNVIERAFGALKGRWVILRGKSYYPARTQCRIITACCLLHNLITREMDLDVGLDEGDVGRSKPVPLDGENITFIQSSTEWMQKRDDLAN
RMFNALMAGADKQTKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLPRLKQMIKDKMVTCTIESTSVIDCKVQSLKRQYSAISEMLGLGCSGFGWNDEFK
CIQAEKEVYDAWVNLRSLFIN