; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017503 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017503
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold4:33747774..33759274
RNA-Seq ExpressionSpg017503
SyntenySpg017503
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.2e-8553.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.2e-8553.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.2e-8553.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.0e-8953.37Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF
        +  S AQ+ERR +C+YNHHIS KGYANLA++LEL+ D  NRA LWKEARK KN EY D  T     RI                + T EHRGR+RGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEA------------------------SSVKTEAPRRKQPQSDASSATHKK-SKGKDVVREIPENKEAGTPCHLAMGSM
        V+P+++YNVA+ K KL Q+ Q+EA                        SSV  +  +RK+ Q   +    KK  KGK VV++ PE    G PCHLA+GS+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEA------------------------SSVKTEAPRRKQPQSDASSATHKK-SKGKDVVREIPENKEAGTPCHLAMGSM

Query:  DNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIR
        DNIV VGTM+ES +Q  +I+ +PLG +NVR +VD+V+G+D ALPIP  D+++TL QAIGNFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+
Subjt:  DNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIR

Query:  LLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        LLNRYAM SMQ +D + IN+ E+ILGKE +I+L R+DI+QYCG  EIGYSCIL YI
Subjt:  LLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.0e-8953.37Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF
        +  S AQ+ERR +C+YNHHIS KGYANLA++LEL+ D  NRA LWKEARK KN EY D  T     RI                + T EHRGR+RGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEA------------------------SSVKTEAPRRKQPQSDASSATHKK-SKGKDVVREIPENKEAGTPCHLAMGSM
        V+P+++YNVA+ K KL Q+ Q+EA                        SSV  +  +RK+ Q   +    KK  KGK VV++ PE    G PCHLA+GS+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEA------------------------SSVKTEAPRRKQPQSDASSATHKK-SKGKDVVREIPENKEAGTPCHLAMGSM

Query:  DNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIR
        DNIV VGTM+ES +Q  +I+ +PLG +NVR +VD+V+G+D ALPIP  D+++TL QAIGNFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+
Subjt:  DNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIR

Query:  LLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        LLNRYAM SMQ +D + IN+ E+ILGKE +I+L R+DI+QYCG  EIGYSCIL YI
Subjt:  LLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X17.6e-8649.46Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF
        +  S AQ+ERR +C+YNHHIS KGYANLA++LEL+ D  NRA LWKEARK KN    D+ T   V RI                + T EHRGR+RGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRI----------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQ----------------PQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDV-----------------------VREIPEN
        V+P+++ NVAR   KLSQQ                 QS+A +   ++    + Q   SS + KK+KGK V                       V + PEN
Subjt:  VTPSVYYNVAREKSKLSQQ----------------PQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDV-----------------------VREIPEN

Query:  KEAGTPCHLAMGSMDNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVKA-KPI
           G PCHLA+GS+DN+V VG M+ES  Q  TIHG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAIGNFV WPRKLVI   +K+ P + A +  
Subjt:  KEAGTPCHLAMGSMDNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVKA-KPI

Query:  VQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
         QSSK+TDVHVTI+LLNRYAM +MQ ED + I++ E I GKE +I+L R+DI+QYCG  EIGYSCILTYI
Subjt:  VQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X15.8e-8653.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X45.8e-8653.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C398 uncharacterized protein LOC111007859 isoform X35.8e-8653.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X25.8e-8653.41Show/hide
Query:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF
        + LS+A KE R +CLYNHHIS KGYANLA++L+L+ D SNRAILWKEARKGKN EY D+ T    AR++ +             + T EH GRVRGVGEF
Subjt:  QALSKAQKERRERCLYNHHISHKGYANLAKDLELTDDSSNRAILWKEARKGKNKEYCDEVT---IARVNRI-------------VSTLEHRGRVRGVGEF

Query:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI
        V+PS+Y+NV + KSK +Q+ Q   S+            ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+
Subjt:  VTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN
        HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSMQ EDT+ IN
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        + + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGGGGTGTCCACTCCTTGACATTGATCCTGAGATAGAAAGAACCTTTCGTCATCGTAGGAAGGAACAAAGACGAAAGAGAAGGGAACAACAAGAGTTGAGCGC
ACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCCAATGGAACCTCCTGGAGTCGATCCTCAAGTTGATCCACAGAATCGTGGAAGGGAGCAAAATGGTGGGA
GAACTTCTCCTGTTCCTCCAGTTCCACCGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCAACGGCAAGAGGAGAACCTCGAAGCACCCATGCATGACACGAGA
AGGACGAGACCCACGGGTTTCTCGCCGGCGATCGTGAACCAAGGTACTTCCAACTCTCAAACTCCTTCTTCCTTGGCAATGCCGGTCAGCTCGAGGGAGAATCCGAGTTC
GTCTACACCTAGGAGGTCCACGCGCGCCACTGCCGTTCGCCAAACCCAAAAACCCGCAACTCAACAGTTCAAGAAACGTTCGCGGGAGTGGTTTTCAGTGATCCGGCCGA
TGGGAGCTCAGAGACGGGCTGCTCTTGAAGAAGAAGAAAATAGCCAAGATGAAGAAGAAGCCGCCAAAGCAGCAGGAAGCTCTCGGCAAGAGGGAACTTCAACGGGTAAA
ACGTCTGAACCTCAAGCTAACCCCTCCTCGTCTTGCAGGAACAAACCATTTGTTACCTATAACGCAAGGAGGAGGAGTCCCAAGAAAGTTGTGCTCGAGAAAGCACTTGT
AATCGAGCCTCTCAAAGTAGCAAGAATGCCCCTGGACGTGTTCGAGGACATAATTTGCCAAGCTGTGGCAAAGGCTCTCGTGATTGCTGAAGGGTATAAGGCTGAACAAG
AAGCCTTGAAGGATATTGAGGTTGAGAGAGAAATGGAAAATCAACATATGAGGGAAGAAGATGAAGGTGCAAGAGAAAGAGATCTTGAAGAAGAAAGGAAAAAGAAAGAG
GAAAGGCAAGAGGCCGATAGGGCCTTAAAAGCTGAAAAAGAGAGGAAGTTAGATGAAGACCTCAGGAGGGTAGCTGCTGATTTGCAACTCCTCAAGGAAGAAAAACGAAG
AAGGGAAGAATTGAAAGAAAACGAAGAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAAATTTTGAGCCACTCCAAAAGGCTCAAAGTGAAGTTGAATTGCTGCGAGGAA
GAGAAGAAAAGGCCCAACAGGGGCCAAGTAAAAAGAACCAAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAAAATGCGACCACATCTGGGCCGCATTCTGAA
GAAGGCCTAGCAGAGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCATCGGGAGAGAAGAGAGA
GATCAAGGAATTGGATGACGACCAAGTTCCTATCTCTGCGGCATTGAGGAGAAAGAGAAGAAGAGAGATTAAAGCTGAAAGGAGCACCAAAAATAAAAATGACCTCATAT
TTGCCAAGAGGCCGAGGACTAGGTCCATGGATGCCTCTCCAGCAGTTCCTCCAACCGTCTCACCCGCCAAGCCAAAGGCCAAATCACCGAAGGCTGCATCTCCTAAAAAT
CCATTTCCCGAGTATAAGTGGCAGGAGTTATGTGCTCACCCTCAGGAGGCTGTCGTGCCTTTAGTTCGAGAATTTTACGCTGACCTGAGGGAGGAAAGTATCAGTACGGC
GGTGGTGAGAGGCAAAATGGTTAGCTTCTCTTCTGTTGACATTAACCGGGTGTATAGACTCAAAGCACCCCTGAATCCAAAAGGGAACAACGTTATCAGGAACCCCTCGG
CCAAGCAGAAGAAAGAAACACTTAAACTCGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACGAAAGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCG
GCAGTTTGGCTTCACTTTCTAAAGAACCATTTGATGCCAACCAACACAATCTCAGTGGATAGAGTGATGCTACTCTATTGCATTATGAAGGGGTTGGAGATCAATATTGG
GACACTACAACAAACAAGTGGGTTTTCTCCAAACTTACAAACTGTCGGCCAGCACTTCTTCGTGGGTTCCAGATCTCTGTTTTTGGATAACAAATCAGTGGTGTTCGTCA
CTTTTTTTTTGTCATTCAAATTAAATATACAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATAATCATCATATCTCTCATAAGGGATATGCAAAT
CTTGCCAAAGACTTAGAATTGACAGACGATTCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTATAGCACG
TGTCAATCGAATTGTAAGCACGCTAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGA
GTCAGCAACCACAAAGCGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACGCTTCAAGTGCCACGCATAAAAAGTCAAAAGGAAAAGAT
GTCGTTCGTGAGATACCTGAGAATAAAGAGGCTGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGTCGTAGGCACAATGTACGAGTCGCCTTCACA
AAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTAC
AAACGTTGCATCAAGCGATCGGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCA
AGCAAACATACAGATGTCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTCAATGCAACAAGAAGATACACTAACGATCAATATGCACGAGCGTATCTTGGG
AAAGGAAGCATCAATATTTTTAAATCGTGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGGGGTGTCCACTCCTTGACATTGATCCTGAGATAGAAAGAACCTTTCGTCATCGTAGGAAGGAACAAAGACGAAAGAGAAGGGAACAACAAGAGTTGAGCGC
ACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCCAATGGAACCTCCTGGAGTCGATCCTCAAGTTGATCCACAGAATCGTGGAAGGGAGCAAAATGGTGGGA
GAACTTCTCCTGTTCCTCCAGTTCCACCGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCAACGGCAAGAGGAGAACCTCGAAGCACCCATGCATGACACGAGA
AGGACGAGACCCACGGGTTTCTCGCCGGCGATCGTGAACCAAGGTACTTCCAACTCTCAAACTCCTTCTTCCTTGGCAATGCCGGTCAGCTCGAGGGAGAATCCGAGTTC
GTCTACACCTAGGAGGTCCACGCGCGCCACTGCCGTTCGCCAAACCCAAAAACCCGCAACTCAACAGTTCAAGAAACGTTCGCGGGAGTGGTTTTCAGTGATCCGGCCGA
TGGGAGCTCAGAGACGGGCTGCTCTTGAAGAAGAAGAAAATAGCCAAGATGAAGAAGAAGCCGCCAAAGCAGCAGGAAGCTCTCGGCAAGAGGGAACTTCAACGGGTAAA
ACGTCTGAACCTCAAGCTAACCCCTCCTCGTCTTGCAGGAACAAACCATTTGTTACCTATAACGCAAGGAGGAGGAGTCCCAAGAAAGTTGTGCTCGAGAAAGCACTTGT
AATCGAGCCTCTCAAAGTAGCAAGAATGCCCCTGGACGTGTTCGAGGACATAATTTGCCAAGCTGTGGCAAAGGCTCTCGTGATTGCTGAAGGGTATAAGGCTGAACAAG
AAGCCTTGAAGGATATTGAGGTTGAGAGAGAAATGGAAAATCAACATATGAGGGAAGAAGATGAAGGTGCAAGAGAAAGAGATCTTGAAGAAGAAAGGAAAAAGAAAGAG
GAAAGGCAAGAGGCCGATAGGGCCTTAAAAGCTGAAAAAGAGAGGAAGTTAGATGAAGACCTCAGGAGGGTAGCTGCTGATTTGCAACTCCTCAAGGAAGAAAAACGAAG
AAGGGAAGAATTGAAAGAAAACGAAGAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAAATTTTGAGCCACTCCAAAAGGCTCAAAGTGAAGTTGAATTGCTGCGAGGAA
GAGAAGAAAAGGCCCAACAGGGGCCAAGTAAAAAGAACCAAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAAAATGCGACCACATCTGGGCCGCATTCTGAA
GAAGGCCTAGCAGAGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCATCGGGAGAGAAGAGAGA
GATCAAGGAATTGGATGACGACCAAGTTCCTATCTCTGCGGCATTGAGGAGAAAGAGAAGAAGAGAGATTAAAGCTGAAAGGAGCACCAAAAATAAAAATGACCTCATAT
TTGCCAAGAGGCCGAGGACTAGGTCCATGGATGCCTCTCCAGCAGTTCCTCCAACCGTCTCACCCGCCAAGCCAAAGGCCAAATCACCGAAGGCTGCATCTCCTAAAAAT
CCATTTCCCGAGTATAAGTGGCAGGAGTTATGTGCTCACCCTCAGGAGGCTGTCGTGCCTTTAGTTCGAGAATTTTACGCTGACCTGAGGGAGGAAAGTATCAGTACGGC
GGTGGTGAGAGGCAAAATGGTTAGCTTCTCTTCTGTTGACATTAACCGGGTGTATAGACTCAAAGCACCCCTGAATCCAAAAGGGAACAACGTTATCAGGAACCCCTCGG
CCAAGCAGAAGAAAGAAACACTTAAACTCGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACGAAAGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCG
GCAGTTTGGCTTCACTTTCTAAAGAACCATTTGATGCCAACCAACACAATCTCAGTGGATAGAGTGATGCTACTCTATTGCATTATGAAGGGGTTGGAGATCAATATTGG
GACACTACAACAAACAAGTGGGTTTTCTCCAAACTTACAAACTGTCGGCCAGCACTTCTTCGTGGGTTCCAGATCTCTGTTTTTGGATAACAAATCAGTGGTGTTCGTCA
CTTTTTTTTTGTCATTCAAATTAAATATACAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATAATCATCATATCTCTCATAAGGGATATGCAAAT
CTTGCCAAAGACTTAGAATTGACAGACGATTCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTATAGCACG
TGTCAATCGAATTGTAAGCACGCTAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGA
GTCAGCAACCACAAAGCGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACGCTTCAAGTGCCACGCATAAAAAGTCAAAAGGAAAAGAT
GTCGTTCGTGAGATACCTGAGAATAAAGAGGCTGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGTCGTAGGCACAATGTACGAGTCGCCTTCACA
AAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTAC
AAACGTTGCATCAAGCGATCGGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCA
AGCAAACATACAGATGTCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTCAATGCAACAAGAAGATACACTAACGATCAATATGCACGAGCGTATCTTGGG
AAAGGAAGCATCAATATTTTTAAATCGTGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTAA
Protein sequenceShow/hide protein sequence
MNKGCPLLDIDPEIERTFRHRRKEQRRKRREQQELSAQEPLEEASYIQEFPMEPPGVDPQVDPQNRGREQNGGRTSPVPPVPPSNPTAHEAEASVQRQEENLEAPMHDTR
RTRPTGFSPAIVNQGTSNSQTPSSLAMPVSSRENPSSSTPRRSTRATAVRQTQKPATQQFKKRSREWFSVIRPMGAQRRAALEEEENSQDEEEAAKAAGSSRQEGTSTGK
TSEPQANPSSSCRNKPFVTYNARRRSPKKVVLEKALVIEPLKVARMPLDVFEDIICQAVAKALVIAEGYKAEQEALKDIEVEREMENQHMREEDEGARERDLEEERKKKE
ERQEADRALKAEKERKLDEDLRRVAADLQLLKEEKRRREELKENEERRKEAEDFLANFEPLQKAQSEVELLRGREEKAQQGPSKKNQEKEKEREVEDEGQNATTSGPHSE
EGLAEATEVQPADEVFEPLFKDDPPAADSTSSGEKREIKELDDDQVPISAALRRKRRREIKAERSTKNKNDLIFAKRPRTRSMDASPAVPPTVSPAKPKAKSPKAASPKN
PFPEYKWQELCAHPQEAVVPLVREFYADLREESISTAVVRGKMVSFSSVDINRVYRLKAPLNPKGNNVIRNPSAKQKKETLKLVANKGVQWKESQTKVKTLVPSDLKPES
AVWLHFLKNHLMPTNTISVDRVMLLYCIMKGLEINIGTLQQTSGFSPNLQTVGQHFFVGSRSLFLDNKSVVFVTFFLSFKLNIQALSKAQKERRERCLYNHHISHKGYAN
LAKDLELTDDSSNRAILWKEARKGKNKEYCDEVTIARVNRIVSTLEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKKSKGKD
VVREIPENKEAGTPCHLAMGSMDNIVVVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPVKAKPIVQS
SKHTDVHVTIRLLNRYAMLSMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYIT