; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0242561 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0242561
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase
Genome locationCMiso1.1chr09:4853566..4854784
RNA-Seq ExpressionCmc09g0242561
SyntenyCmc09g0242561
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038926.1 integrase [Cucumis melo var. makuwa]3.6e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

KAA0048003.1 integrase [Cucumis melo var. makuwa]3.6e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

KAA0057291.1 integrase [Cucumis melo var. makuwa]3.6e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

KAA0060377.1 integrase [Cucumis melo var. makuwa]3.6e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

TYJ95504.1 integrase [Cucumis melo var. makuwa]3.6e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

TrEMBL top hitse value%identityAlignment
A0A5A7TWN2 Integrase1.7e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

A0A5A7UDP7 Integrase1.7e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

A0A5A7V0P6 Integrase1.7e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

A0A5D3CLV1 Integrase1.7e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

A0A5D3E3T2 Integrase1.7e-18180.65Show/hide
Query:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD
        K  F    L+         G  + G   K+ +++K+L  ++ +  AWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFT NDKFLCD
Subjt:  KICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCD

Query:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA
        DFKNSMK EFEMSDMGLIHYFL IEVN+NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGE V+PSLYRSLVGSLMYLTATRPDILFA
Subjt:  DFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFA

Query:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG
        +SMLSRFMTNPKRSHWEAGKR                    + +  FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVV+LSTTEAEYISLAAAG
Subjt:  ISMLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAG

Query:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
        CQALWLRWMLKELKC +KCETVLFCDNG AIALSKN +FHGRSKHIRIKYHFIRDLV+DGEVIVKYCKTQDQVA IFTKALKFDLFVKFRGKLGVAQV
Subjt:  CQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-5030.83Show/hide
Query:  IVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYV-KEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNK
        + ++ K++  ++ ++  W+   +    +  F     +  +Y+  +    + + V LYVDD++    D    ++FK  + ++F M+D+  I +F+ I +  
Subjt:  IVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYV-KEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNK

Query:  NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMY-LTATRPDILFAISMLSRFMTNPKRSHWEAGKR------
         E +I +SQ  Y   +L KF MEN +  +TP+ + +     +  E  N    RSL+G LMY +  TRPD+  A+++LSR+ +      W+  KR      
Subjt:  NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMY-LTATRPDILFAISMLSRFMTNPKRSHWEAGKR------

Query:  ----------------NSVVCFCDSDWGGNVDDHKSTSGYVFSM-GSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKKCETVLF
                        N ++ + DSDW G+  D KST+GY+F M    +  W +K+Q+ V+ S+TEAEY++L  A  +ALWL+++L  +    +    ++
Subjt:  ----------------NSVVCFCDSDWGGNVDDHKSTSGYVFSM-GSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKKCETVLF

Query:  CDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQ
         DN   I+++ N   H R+KHI IKYHF R+ V++  + ++Y  T++Q+A IFTK L    FV+ R KLG+ Q
Subjt:  CDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-6136.29Show/hide
Query:  GRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEI
        G++  + ++ KSL  ++ +   WY + DSF     + +   +  +Y K      F+I+ LYVDD+L    DK L    K  + K F+M D+G     L +
Subjt:  GRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEI

Query:  EV--NKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKL----CKDDIGEVVNPSL--YRSLVGSLMY-LTATRPDILFAISMLSRFMTNPKRSH
        ++   +   ++ +SQ+KY   +L++F M+NA P +TP+  +LKL    C   + E  N +   Y S VGSLMY +  TRPDI  A+ ++SRF+ NP + H
Subjt:  EV--NKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKL----CKDDIGEVVNPSL--YRSLVGSLMY-LTATRPDILFAISMLSRFMTNPKRSH

Query:  WEA---------GKRNSVVCF----------CDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCT
        WEA         G     +CF           D+D  G++D+ KS++GY+F+   G  SW SK Q  V+LSTTEAEYI+    G + +WL+  L+EL   
Subjt:  WEA---------GKRNSVVCF----------CDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCT

Query:  KKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKAL---KFDL
        +K E V++CD+  AI LSKNS++H R+KHI ++YH+IR++V+D  + V    T +  A + TK +   KF+L
Subjt:  KKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKAL---KFDL

P92519 Uncharacterized mitochondrial protein AtMg008101.7e-2934.23Show/hide
Query:  LYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSL
        LYVDD+L T +   L +     +   F M D+G +HYFL I++  +   + +SQ KYA  +L    M +  P +TP+   L        +  +PS +RS+
Subjt:  LYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSL

Query:  VGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR-----------------NS---VVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSV
        VG+L YLT TRPDI +A++++ + M  P  + ++  KR                 NS   V  FCDSDW G     +ST+G+   +G  + SW++K+Q  
Subjt:  VGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR-----------------NS---VVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSV

Query:  VSLSTTEAEYISLAAAGCQALW
        VS S+TE EY +LA    +  W
Subjt:  VSLSTTEAEYISLAAAGCQALW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-5835.79Show/hide
Query:  GLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLI
        G  +  R   + ++ K+L  ++ +  AWY  + ++ L  GF     + +L+V +   GK ++  L YVDD+L T ND  L  +  +++ + F + D   +
Subjt:  GLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLI

Query:  HYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEA
        HYFL IE  +    + +SQ++Y  DLL +  M  A P  TPM  + KL      ++ +P+ YR +VGSL YL  TRPDI +A++ LS+FM  P   H +A
Subjt:  HYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEA

Query:  GKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKK
         KR                     S+  + D+DW G+ DD+ ST+GY+  +G    SW+SKKQ  V  S+TEAEY S+A    +  W+  +L EL     
Subjt:  GKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKK

Query:  CETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
           V++CDN  A  L  N +FH R KHI I YHFIR+ V+ G + V +  T DQ+A   TK L    F  F  K+GV +V
Subjt:  CETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-5734.09Show/hide
Query:  FFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTKNDKFLCDDF
        F + TL          G  +  R   + ++ K++  ++ +  AWY  + ++ L  GF     + +L+V +   G+ +I  L YVDD+L T ND  L    
Subjt:  FFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTKNDKFLCDDF

Query:  KNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAIS
         +++ + F + +   +HYFL IE  +    + +SQ++Y  DLL +  M  A P  TPM  + KL      ++ +P+ YR +VGSL YL  TRPD+ +A++
Subjt:  KNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAIS

Query:  MLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQ
         LS++M  P   HW A KR                     S+  + D+DW G+ DD+ ST+GY+  +G    SW+SKKQ  V  S+TEAEY S+A    +
Subjt:  MLSRFMTNPKRSHWEAGKR--------------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQ

Query:  ALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV
          W+  +L EL        V++CDN  A  L  N +FH R KHI + YHFIR+ V+ G + V +  T DQ+A   TK L    F  F  K+GV +V
Subjt:  ALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGRSKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.3e-4732.52Show/hide
Query:  IVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKN
        +  ++KS+  ++ +S  W+ +     +  GF +   +H  ++K      FL V +YVDD++   N+    D+ K+ +K  F++ D+G + YFL +E+ ++
Subjt:  IVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKN

Query:  EGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR--------
           I I Q+KYA DLL +  +    P + PMD ++       G+ V+   YR L+G LMYL  TR DI FA++ LS+F   P+ +H +A  +        
Subjt:  EGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR--------

Query:  ------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKKCETVLFCDNGF
                      +  F D+ +    D  +ST+GY   +G+ + SW SKKQ VVS S+ EAEY +L+ A  + +WL    +EL+      T+LFCDN  
Subjt:  ------------NSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKKCETVLFCDNGF

Query:  AIALSKNSIFHGRSKHIRIKYHFIRD
        AI ++ N++FH R+KHI    H +R+
Subjt:  AIALSKNSIFHGRSKHIRIKYHFIRD

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-3034.23Show/hide
Query:  LYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSL
        LYVDD+L T +   L +     +   F M D+G +HYFL I++  +   + +SQ KYA  +L    M +  P +TP+   L        +  +PS +RS+
Subjt:  LYVDDLLFTKNDKFLCDDFKNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSL

Query:  VGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR-----------------NS---VVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSV
        VG+L YLT TRPDI +A++++ + M  P  + ++  KR                 NS   V  FCDSDW G     +ST+G+   +G  + SW++K+Q  
Subjt:  VGSLMYLTATRPDILFAISMLSRFMTNPKRSHWEAGKR-----------------NS---VVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSV

Query:  VSLSTTEAEYISLAAAGCQALW
        VS S+TE EY +LA    +  W
Subjt:  VSLSTTEAEYISLAAAGCQALW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGTTTATCAAATGGATGCAAAATCTGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGAAAATGGGAGAAGAAGAAA
AATTGTACAAGTTGAAAAAAGCCTTGTATGGATTGAAGCAAGCTCCACGGCTTGGTACAGTCGTATCGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATG
AGCATGCACTCTATGTCAAAGAAGACAAGTATGGAAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTAAAAACGATAAATTTTTGTGTGATGATTTT
AAGAATTCCATGAAAAAGGAATTCGAAATGAGTGATATGGGTCTCATCCACTACTTTCTCGAAATTGAAGTTAATAAAAATGAAGGAGAAATTGTCATTTCACAACAAAA
GTATGCTCATGATTTATTAAAAAAATTTCGAATGGAAAATGCTTCACCTTGCAACACTCCAATGGATGCAAATTTGAAATTGTGCAAGGATGATATTGGAGAAGTAGTCA
ATCCAAGTTTATATCGAAGCTTAGTTGGAAGCTTAATGTACTTGACAGCAACAAGACCTGATATTTTATTTGCTATAAGTATGTTAAGCAGATTTATGACAAACCCGAAA
AGAAGTCATTGGGAAGCAGGTAAAAGAAACAGTGTTGTTTGCTTTTGTGATAGTGACTGGGGTGGTAATGTGGATGATCATAAAAGTACATCTGGTTATGTTTTTAGTAT
GGGTTCAGGTGTTTTTTCATGGACTTCAAAGAAACAATCTGTTGTTTCCCTTTCTACAACCGAAGCAGAATATATCTCGTTAGCTGCAGCTGGATGTCAAGCTTTATGGC
TTCGATGGATGTTAAAAGAATTGAAGTGTACTAAAAAATGTGAAACTGTTTTATTTTGTGATAATGGATTTGCCATAGCATTATCAAAGAATTCAATTTTCCATGGAAGA
AGCAAGCATATTAGAATCAAATATCATTTTATCAGAGACTTGGTTGAAGATGGAGAAGTGATAGTAAAATATTGCAAGACTCAAGATCAAGTGGCATATATTTTTACAAA
GGCGCTCAAGTTTGACTTATTTGTAAAATTCAGAGGAAAACTTGGAGTTGCTCAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGTTTATCAAATGGATGCAAAATCTGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGAAAATGGGAGAAGAAGAAA
AATTGTACAAGTTGAAAAAAGCCTTGTATGGATTGAAGCAAGCTCCACGGCTTGGTACAGTCGTATCGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATG
AGCATGCACTCTATGTCAAAGAAGACAAGTATGGAAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTAAAAACGATAAATTTTTGTGTGATGATTTT
AAGAATTCCATGAAAAAGGAATTCGAAATGAGTGATATGGGTCTCATCCACTACTTTCTCGAAATTGAAGTTAATAAAAATGAAGGAGAAATTGTCATTTCACAACAAAA
GTATGCTCATGATTTATTAAAAAAATTTCGAATGGAAAATGCTTCACCTTGCAACACTCCAATGGATGCAAATTTGAAATTGTGCAAGGATGATATTGGAGAAGTAGTCA
ATCCAAGTTTATATCGAAGCTTAGTTGGAAGCTTAATGTACTTGACAGCAACAAGACCTGATATTTTATTTGCTATAAGTATGTTAAGCAGATTTATGACAAACCCGAAA
AGAAGTCATTGGGAAGCAGGTAAAAGAAACAGTGTTGTTTGCTTTTGTGATAGTGACTGGGGTGGTAATGTGGATGATCATAAAAGTACATCTGGTTATGTTTTTAGTAT
GGGTTCAGGTGTTTTTTCATGGACTTCAAAGAAACAATCTGTTGTTTCCCTTTCTACAACCGAAGCAGAATATATCTCGTTAGCTGCAGCTGGATGTCAAGCTTTATGGC
TTCGATGGATGTTAAAAGAATTGAAGTGTACTAAAAAATGTGAAACTGTTTTATTTTGTGATAATGGATTTGCCATAGCATTATCAAAGAATTCAATTTTCCATGGAAGA
AGCAAGCATATTAGAATCAAATATCATTTTATCAGAGACTTGGTTGAAGATGGAGAAGTGATAGTAAAATATTGCAAGACTCAAGATCAAGTGGCATATATTTTTACAAA
GGCGCTCAAGTTTGACTTATTTGTAAAATTCAGAGGAAAACTTGGAGTTGCTCAAGTCTAG
Protein sequenceShow/hide protein sequence
MESLSNGCKICFFEWTLEGRDICCTTFGLCENGRRRKIVQVEKSLVWIEASSTAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTKNDKFLCDDF
KNSMKKEFEMSDMGLIHYFLEIEVNKNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEVVNPSLYRSLVGSLMYLTATRPDILFAISMLSRFMTNPK
RSHWEAGKRNSVVCFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVSLSTTEAEYISLAAAGCQALWLRWMLKELKCTKKCETVLFCDNGFAIALSKNSIFHGR
SKHIRIKYHFIRDLVEDGEVIVKYCKTQDQVAYIFTKALKFDLFVKFRGKLGVAQV