; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014136 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014136
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotransposon gag protein
Genome locationchr01:8474409..8475485
RNA-Seq ExpressionPI0014136
SyntenyPI0014136
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025369.1 uncharacterized protein E6C27_scaffold1204G00530 [Cucumis melo var. makuwa]1.5e-13074.61Show/hide
Query:  MSRSSNPHLEFFEDLNREVRRIRRERREE--INQLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI
        M+RSSN    +FEDLNREVRRIRRERREE  I+ L  Q PL   EP+L+   + NL R  +G+VREKTLREL+EPDEDQRPLCIVIP TTQPFELK GLI
Subjt:  MSRSSNPHLEFFEDLNREVRRIRRERREE--INQLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI

Query:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
        HLL IFKGS GED HKHLKDFHMVC SMRPH +SEEQLNL+AFPF LTD AKRWLYYLEP  ITTW SLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
Subjt:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES

Query:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNKRGGN-VKVAKCGVCGL
        LSEYWE  KEL ASFPHHHISDPSLIQYFYSGLL++DRNTVD AAGGALADKT  E RELISRM ENSQ F  ++      L K     +KV KCGVCGL
Subjt:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNKRGGN-VKVAKCGVCGL

Query:  IGHPNDKCPDLVEEANVIRKYDP
        +GHPNDKCP+++E+ N++++YDP
Subjt:  IGHPNDKCPDLVEEANVIRKYDP

KAA0031967.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-14271.55Show/hide
Query:  MSRSSNPHLEFFEDLNREVRRIRRERREEIN--QLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI
        M+RSSN    +FEDLN+EVRRIRRERREE N   L  Q PL   EP+L+   + NL R  MG+VREKT+R+L E DEDQRPLCIVI  TTQPFELK  LI
Subjt:  MSRSSNPHLEFFEDLNREVRRIRRERREEIN--QLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI

Query:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
        HLLPIFKG+SGEDPHKHLKDFHMVCDSMRPHG+SEEQLNL+AFPFSLTD AKRWLYYLEPGSITTWGSLKKKFLEKFFPASR NNIRKEIYGIRQAFGES
Subjt:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES

Query:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNK----------------
        L +YWE FKELCA+FPHHHI  PSLIQYFY GLL++DRNTVDAAAGGALA+K PTEARELISRMA+NSQ+F  ++      L K                
Subjt:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNK----------------

Query:  ----RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWGND
            +G  +KV KC VCGL+GHPNDKCP+++EE N+++KYDP+ NTYN GWRDNP LRWGND
Subjt:  ----RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWGND

KAA0032001.1 uncharacterized protein E6C27_scaffold134G00970 [Cucumis melo var. makuwa]9.6e-11478.11Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE
        MG+VREKTLREL EPDEDQRPLCIV+PPTTQPFELK GLIHLL IF KGS GEDPHKHLKDFHMVCDSMRPHG+ EEQLNL+AFPFSL D AKRWLYYLE
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE

Query:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE
          SITTWGS KKKFLEKFFPASRA+NIR     IRQAFGESLSEYWE FKELCASFPHHHI DPSLIQYFYSGLL+ DR TVDAAAGGAL +KTPTEARE
Subjt:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE

Query:  LISRMAENSQKFWQQSIRARQFLNKRGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG
        LISRMAENSQ F  ++      L K        +CGVCGL+GH NDKCP+L+E+ N++R+YDPHG
Subjt:  LISRMAENSQKFWQQSIRARQFLNKRGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG

KAA0058150.1 uncharacterized protein E6C27_scaffold274G004630 [Cucumis melo var. makuwa]5.6e-12275.7Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEP
        MG+VREK LREL+EP+ED RPLCIVIPPTTQPFELK GLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRP+ +SEEQLNL+AFPF LTD AK WLYYLEP
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEP

Query:  GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEAREL
        GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLS+YWE FKELCAS PH+HI DPSLIQYFYSGLL+ DRNTVDAA GGALADKTPTEAR+L
Subjt:  GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEAREL

Query:  ISRMAENSQKFWQQSIRARQFLNK--------------------RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG
        ISRM ENSQ F  ++      L K                    +G  +KV KCGVCGL+GHPNDKCP+++E+ N++R+YDPHG
Subjt:  ISRMAENSQKFWQQSIRARQFLNK--------------------RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG

XP_008460738.1 PREDICTED: uncharacterized protein LOC103499500 [Cucumis melo]2.9e-12675.17Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE
        MG+VREKTLREL EPDEDQRPLCIV+PPTTQPFELK GLIHLL IF KGS GEDPHKHLKDFHMVCDSMRPHG+ EEQLNL+AFPFSL D AKRWLYYLE
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE

Query:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE
          SITTWGS KKKFLEKFFPASRA+NIR     IRQAFGESLSEYWE FKELCASFPHHHI DPSLIQYFYSGLL+ DR TVDAAAGGAL +KTPTEARE
Subjt:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE

Query:  LISRMAENSQKFWQQSIRARQFLNKRGGN-------------------VKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWG
        LISRMAENSQ F  ++      L K                       +KV KCGVCGL+GH NDKCP+L+E+ N++R+YDPHGNTYN+GWRDNPNLRWG
Subjt:  LISRMAENSQKFWQQSIRARQFLNKRGGN-------------------VKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWG

Query:  ND
        ND
Subjt:  ND

TrEMBL top hitse value%identityAlignment
A0A1S3CD34 uncharacterized protein LOC1034995001.4e-12675.17Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE
        MG+VREKTLREL EPDEDQRPLCIV+PPTTQPFELK GLIHLL IF KGS GEDPHKHLKDFHMVCDSMRPHG+ EEQLNL+AFPFSL D AKRWLYYLE
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE

Query:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE
          SITTWGS KKKFLEKFFPASRA+NIR     IRQAFGESLSEYWE FKELCASFPHHHI DPSLIQYFYSGLL+ DR TVDAAAGGAL +KTPTEARE
Subjt:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE

Query:  LISRMAENSQKFWQQSIRARQFLNKRGGN-------------------VKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWG
        LISRMAENSQ F  ++      L K                       +KV KCGVCGL+GH NDKCP+L+E+ N++R+YDPHGNTYN+GWRDNPNLRWG
Subjt:  LISRMAENSQKFWQQSIRARQFLNKRGGN-------------------VKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWG

Query:  ND
        ND
Subjt:  ND

A0A5A7SRF5 Retrotransposon gag protein8.2e-14371.55Show/hide
Query:  MSRSSNPHLEFFEDLNREVRRIRRERREEIN--QLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI
        M+RSSN    +FEDLN+EVRRIRRERREE N   L  Q PL   EP+L+   + NL R  MG+VREKT+R+L E DEDQRPLCIVI  TTQPFELK  LI
Subjt:  MSRSSNPHLEFFEDLNREVRRIRRERREEIN--QLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI

Query:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
        HLLPIFKG+SGEDPHKHLKDFHMVCDSMRPHG+SEEQLNL+AFPFSLTD AKRWLYYLEPGSITTWGSLKKKFLEKFFPASR NNIRKEIYGIRQAFGES
Subjt:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES

Query:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNK----------------
        L +YWE FKELCA+FPHHHI  PSLIQYFY GLL++DRNTVDAAAGGALA+K PTEARELISRMA+NSQ+F  ++      L K                
Subjt:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNK----------------

Query:  ----RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWGND
            +G  +KV KC VCGL+GHPNDKCP+++EE N+++KYDP+ NTYN GWRDNP LRWGND
Subjt:  ----RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNPNLRWGND

A0A5A7USL5 Retrotrans_gag domain-containing protein2.7e-12275.7Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEP
        MG+VREK LREL+EP+ED RPLCIVIPPTTQPFELK GLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRP+ +SEEQLNL+AFPF LTD AK WLYYLEP
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEP

Query:  GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEAREL
        GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLS+YWE FKELCAS PH+HI DPSLIQYFYSGLL+ DRNTVDAA GGALADKTPTEAR+L
Subjt:  GSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEAREL

Query:  ISRMAENSQKFWQQSIRARQFLNK--------------------RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG
        ISRM ENSQ F  ++      L K                    +G  +KV KCGVCGL+GHPNDKCP+++E+ N++R+YDPHG
Subjt:  ISRMAENSQKFWQQSIRARQFLNK--------------------RGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG

A0A5D3CZ23 Retrotrans_gag domain-containing protein4.7e-11478.11Show/hide
Query:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE
        MG+VREKTLREL EPDEDQRPLCIV+PPTTQPFELK GLIHLL IF KGS GEDPHKHLKDFHMVCDSMRPHG+ EEQLNL+AFPFSL D AKRWLYYLE
Subjt:  MGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIF-KGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLE

Query:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE
          SITTWGS KKKFLEKFFPASRA+NIR     IRQAFGESLSEYWE FKELCASFPHHHI DPSLIQYFYSGLL+ DR TVDAAAGGAL +KTPTEARE
Subjt:  PGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARE

Query:  LISRMAENSQKFWQQSIRARQFLNKRGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG
        LISRMAENSQ F  ++      L K        +CGVCGL+GH NDKCP+L+E+ N++R+YDPHG
Subjt:  LISRMAENSQKFWQQSIRARQFLNKRGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHG

A0A5D3DIP6 Retrotrans_gag domain-containing protein7.2e-13174.61Show/hide
Query:  MSRSSNPHLEFFEDLNREVRRIRRERREE--INQLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI
        M+RSSN    +FEDLNREVRRIRRERREE  I+ L  Q PL   EP+L+   + NL R  +G+VREKTLREL+EPDEDQRPLCIVIP TTQPFELK GLI
Subjt:  MSRSSNPHLEFFEDLNREVRRIRRERREE--INQLFLQVPLENQEPTLE---NQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLI

Query:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
        HLL IFKGS GED HKHLKDFHMVC SMRPH +SEEQLNL+AFPF LTD AKRWLYYLEP  ITTW SLKKKFLEKFFPASRANNIRKEIYGIRQAFGES
Subjt:  HLLPIFKGSSGEDPHKHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGES

Query:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNKRGGN-VKVAKCGVCGL
        LSEYWE  KEL ASFPHHHISDPSLIQYFYSGLL++DRNTVD AAGGALADKT  E RELISRM ENSQ F  ++      L K     +KV KCGVCGL
Subjt:  LSEYWEHFKELCASFPHHHISDPSLIQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNKRGGN-VKVAKCGVCGL

Query:  IGHPNDKCPDLVEEANVIRKYDP
        +GHPNDKCP+++E+ N++++YDP
Subjt:  IGHPNDKCPDLVEEANVIRKYDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCGCTCTTCCAATCCTCACTTAGAATTTTTTGAAGACCTTAATAGAGAAGTGCGTAGAATTAGGAGAGAAAGGAGAGAAGAAATTAATCAACTATTCCTTCAAGT
ACCTCTAGAAAACCAAGAGCCTACCCTAGAAAACCAAAACCTTGATAGACAAGCCATGGGCCAAGTTAGAGAAAAAACTCTTAGAGAGCTTTCTGAACCTGATGAAGACC
AAAGGCCCCTATGTATAGTAATTCCCCCAACAACTCAACCCTTCGAGCTAAAACTTGGACTAATCCACCTTTTACCTATATTTAAGGGTAGTTCAGGCGAAGACCCTCAT
AAGCACCTAAAAGACTTTCATATGGTTTGTGACTCTATGAGACCCCACGGAGTTTCAGAGGAACAACTTAACCTGAAAGCCTTTCCATTTTCTTTAACCGACACAGCCAA
AAGATGGCTTTACTATCTAGAGCCAGGGTCTATAACCACTTGGGGAAGCCTTAAAAAGAAGTTCCTAGAAAAGTTCTTCCCTGCTTCTAGAGCAAATAATATTAGGAAGG
AAATCTACGGAATTAGGCAAGCCTTTGGGGAGTCTTTATCAGAATATTGGGAACATTTCAAAGAGCTTTGCGCTAGCTTTCCCCACCATCATATTTCTGACCCCTCTTTA
ATACAGTATTTTTATTCTGGACTACTAACCACGGATAGAAATACAGTAGATGCTGCAGCAGGTGGTGCCTTAGCTGATAAAACCCCTACTGAAGCAAGAGAGCTAATTTC
ACGGATGGCTGAGAACTCACAAAAATTTTGGCAACAGAGCATCCGAGCTCGACAATTCCTTAACAAAAGAGGTGGAAATGTGAAAGTTGCAAAATGTGGAGTTTGTGGTC
TCATTGGACATCCTAATGACAAGTGTCCCGACCTTGTGGAAGAGGCAAATGTCATTAGGAAGTACGATCCTCATGGAAACACTTACAATGCCGGTTGGAGAGATAACCCT
AACCTTAGATGGGGAAATGACGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCGCTCTTCCAATCCTCACTTAGAATTTTTTGAAGACCTTAATAGAGAAGTGCGTAGAATTAGGAGAGAAAGGAGAGAAGAAATTAATCAACTATTCCTTCAAGT
ACCTCTAGAAAACCAAGAGCCTACCCTAGAAAACCAAAACCTTGATAGACAAGCCATGGGCCAAGTTAGAGAAAAAACTCTTAGAGAGCTTTCTGAACCTGATGAAGACC
AAAGGCCCCTATGTATAGTAATTCCCCCAACAACTCAACCCTTCGAGCTAAAACTTGGACTAATCCACCTTTTACCTATATTTAAGGGTAGTTCAGGCGAAGACCCTCAT
AAGCACCTAAAAGACTTTCATATGGTTTGTGACTCTATGAGACCCCACGGAGTTTCAGAGGAACAACTTAACCTGAAAGCCTTTCCATTTTCTTTAACCGACACAGCCAA
AAGATGGCTTTACTATCTAGAGCCAGGGTCTATAACCACTTGGGGAAGCCTTAAAAAGAAGTTCCTAGAAAAGTTCTTCCCTGCTTCTAGAGCAAATAATATTAGGAAGG
AAATCTACGGAATTAGGCAAGCCTTTGGGGAGTCTTTATCAGAATATTGGGAACATTTCAAAGAGCTTTGCGCTAGCTTTCCCCACCATCATATTTCTGACCCCTCTTTA
ATACAGTATTTTTATTCTGGACTACTAACCACGGATAGAAATACAGTAGATGCTGCAGCAGGTGGTGCCTTAGCTGATAAAACCCCTACTGAAGCAAGAGAGCTAATTTC
ACGGATGGCTGAGAACTCACAAAAATTTTGGCAACAGAGCATCCGAGCTCGACAATTCCTTAACAAAAGAGGTGGAAATGTGAAAGTTGCAAAATGTGGAGTTTGTGGTC
TCATTGGACATCCTAATGACAAGTGTCCCGACCTTGTGGAAGAGGCAAATGTCATTAGGAAGTACGATCCTCATGGAAACACTTACAATGCCGGTTGGAGAGATAACCCT
AACCTTAGATGGGGAAATGACGCCTAA
Protein sequenceShow/hide protein sequence
MSRSSNPHLEFFEDLNREVRRIRRERREEINQLFLQVPLENQEPTLENQNLDRQAMGQVREKTLRELSEPDEDQRPLCIVIPPTTQPFELKLGLIHLLPIFKGSSGEDPH
KHLKDFHMVCDSMRPHGVSEEQLNLKAFPFSLTDTAKRWLYYLEPGSITTWGSLKKKFLEKFFPASRANNIRKEIYGIRQAFGESLSEYWEHFKELCASFPHHHISDPSL
IQYFYSGLLTTDRNTVDAAAGGALADKTPTEARELISRMAENSQKFWQQSIRARQFLNKRGGNVKVAKCGVCGLIGHPNDKCPDLVEEANVIRKYDPHGNTYNAGWRDNP
NLRWGNDA