; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036302 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036302
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr3:43682414..43683406
RNA-Seq ExpressionLag0036302
SyntenyLag0036302
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.7e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

A0A5A7TU93 Gag/pol protein1.7e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

A0A5A7TWB9 Gag/pol protein1.7e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

A0A5A7U869 Gag/pol protein1.7e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

A0A5A7UGV2 Gag/pol protein1.7e-8966.42Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDL FVL E+CP  PA NA++TV++ Y+RW KAN+K R YILASLSEVLAK+HESM+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNVAE NG VIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH
        Q +GEANV  S RKF +GS+SGTKS+ +SS  KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GH
Subjt:  QIEGEANVVHSKRKFEKGSSSGTKSVATSS--KKTQKKKGNKG-KAPSTAAKSKGKAKAMADKGKCFHCNVDGH

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-0522.61Show/hide
Query:  KLTGEN-YATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQEMFGQPSYQLHH
        K  G+N ++TW+  +  +L+   L  VL  D        A        + W   +++    I   LS+ +        TAR I   L+ ++   +     
Subjt:  KLTGEN-YATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQEMFGQPSYQLHH

Query:  DALKYVYSCRMKEGTSVREHVLDMMVQFNVAEAN-GVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVH
           K +Y+  M EGT+   H L++        AN GV I+E  +   +L SLP S+    +  +  K T  L  + + L L + + K K + +G+A +  
Subjt:  DALKYVYSCRMKEGTSVREHVLDMMVQFNVAEAN-GVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVH

Query:  SKRKFEKGSSSGTKSVATSSKKTQKKKGNKGKAPSTAAKSKGKAKAMADKGKCFHCNVDGH
         +                  +  Q+   N G+   + A+ K K ++ +    C++CN  GH
Subjt:  SKRKFEKGSSSGTKSVATSSKKTQKKKGNKGKAPSTAAKSKGKAKAMADKGKCFHCNVDGH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTGTCTCTCTGCTTAAAAATGAGAAATTAACCGGCGAAAATTATGCTACGTGGAAGTCGAACCTGAATACGATTCTTGTTGTCGATGATCTGTGGTT
CGTCTTAACGGAGGATTGTCCTCCAGCCCCTGCTCGTAATGCATCCCAGACAGTTAAGGATGCTTATGACCGCTGGACAAAGGCCAATGATAAGACTCGCGTCTATATCT
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGGCATGAGAGCATGGTAACAGCGAGGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGCTGTCGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGCAGAGGCAAACGGGGT
GGTCATAGATGAGCGTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTAGAAGCAATGCAATGATGAATAAAATAACATTCAACCTGACTA
GCCTCCTGAATGAGCTACAACTCTATCAGTCACTTCTTAAGAACAAGGGACAGATAGAAGGAGAGGCAAACGTTGTCCACTCTAAAAGAAAGTTCGAGAAGGGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCAGCACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTAT
GGCAGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATGCGTGTGGGTGTACAATCTCTAACTTTAATCCCATGATTCTCTCTCTTATGTGTGTGAGTGTGTTTC
ACTCGAGGGCTGGGACACTTGATCTCGAGTCTGAAGCTTCAATGGATTCTTGGAGCTTGAAGTCTTCCAGTCTTGAAGGAGTCTTCAATCTTCAAGAAGTGTTGATTTCC
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTGTCTCTCTGCTTAAAAATGAGAAATTAACCGGCGAAAATTATGCTACGTGGAAGTCGAACCTGAATACGATTCTTGTTGTCGATGATCTGTGGTT
CGTCTTAACGGAGGATTGTCCTCCAGCCCCTGCTCGTAATGCATCCCAGACAGTTAAGGATGCTTATGACCGCTGGACAAAGGCCAATGATAAGACTCGCGTCTATATCT
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGGCATGAGAGCATGGTAACAGCGAGGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGCTGTCGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGCAGAGGCAAACGGGGT
GGTCATAGATGAGCGTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTAGAAGCAATGCAATGATGAATAAAATAACATTCAACCTGACTA
GCCTCCTGAATGAGCTACAACTCTATCAGTCACTTCTTAAGAACAAGGGACAGATAGAAGGAGAGGCAAACGTTGTCCACTCTAAAAGAAAGTTCGAGAAGGGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCAGCACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTAT
GGCAGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATGCGTGTGGGTGTACAATCTCTAACTTTAATCCCATGATTCTCTCTCTTATGTGTGTGAGTGTGTTTC
ACTCGAGGGCTGGGACACTTGATCTCGAGTCTGAAGCTTCAATGGATTCTTGGAGCTTGAAGTCTTCCAGTCTTGAAGGAGTCTTCAATCTTCAAGAAGTGTTGATTTCC
TGA
Protein sequenceShow/hide protein sequence
MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLWFVLTEDCPPAPARNASQTVKDAYDRWTKANDKTRVYILASLSEVLAKRHESMVTAREIMNSLQEMFGQPSYQLH
HDALKYVYSCRMKEGTSVREHVLDMMVQFNVAEANGVVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVHSKRKFEKGSS
SGTKSVATSSKKTQKKKGNKGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHACGCTISNFNPMILSLMCVSVFHSRAGTLDLESEASMDSWSLKSSSLEGVFNLQEVLIS