; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G013400 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G013400
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr07:19575386..19576258
RNA-Seq ExpressionLsi07G013400
SyntenyLsi07G013400
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR021139 - NYN domain, limkain-b1-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577601.1 hypothetical protein SDJN03_25175, partial [Cucurbita argyrosperma subsp. sororia]4.6e-13286.76Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE+RRERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTPK GYGLADELKRAGFFVKTVSDKPEAADVELRNDMVE+MDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                             GAWWDL SDAETE VSS S K
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK

TYK19480.1 Zinc finger family protein [Cucumis melo var. makuwa]9.2e-13385.22Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE++RERKMLNQLE KGVIKSIEPYLCRVCGRNFY  EKLVNHFKQIHESEHKKRLNQIESA+GSRRVKL+AKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTP+VGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                                 GAWWDLSSDAET+TVSSPSWK
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK

XP_008448905.1 PREDICTED: uncharacterized protein LOC103490929 [Cucumis melo]9.2e-13385.22Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE++RERKMLNQLE KGVIKSIEPYLCRVCGRNFY  EKLVNHFKQIHESEHKKRLNQIESA+GSRRVKL+AKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTP+VGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                                 GAWWDLSSDAET+TVSSPSWK
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK

XP_022965083.1 uncharacterized protein LOC111465052 [Cucurbita maxima]2.7e-13286.76Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE+RRERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTPK GYGLADELKRAGFFVKTVSDKPEAADVELRNDMVE+MDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                             GAWWDL+SDAETE VSS S K
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK

XP_038876815.1 uncharacterized protein LOC120069203 [Benincasa hispida]1.1e-13386.21Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE+RRERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR+RAECLVLVSDDSDFVNVL EAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK--------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKK+AVSV GKWKDRDVLKRLEWTY PQLEK                                GAWWDL+SDAETETVSSPSWK
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK--------------------------------GAWWDLSSDAETETVSSPSWK

TrEMBL top hitse value%identityAlignment
A0A0A0L708 C2H2-type domain-containing protein5.5e-13184.54Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE++RERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKL+AKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVL P+VGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR++AECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSW+
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNP LEK                                 GAWWDLSSDAET+TVSSPSW+
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK

A0A1S3BKT2 uncharacterized protein LOC1034909294.5e-13385.22Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE++RERKMLNQLE KGVIKSIEPYLCRVCGRNFY  EKLVNHFKQIHESEHKKRLNQIESA+GSRRVKL+AKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTP+VGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                                 GAWWDLSSDAET+TVSSPSWK
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK

A0A5D3D7P9 Zinc finger family protein4.5e-13385.22Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE++RERKMLNQLE KGVIKSIEPYLCRVCGRNFY  EKLVNHFKQIHESEHKKRLNQIESA+GSRRVKL+AKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTP+VGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                                 GAWWDLSSDAET+TVSSPSWK
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK---------------------------------GAWWDLSSDAETETVSSPSWK

A0A6J1E6T9 uncharacterized protein LOC1114311363.2e-13186.06Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE+RRERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEH KRLNQIESAKGSRRVKLLAKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
         ARDVLTPK GYGLADELKRAGFFVKTVSDKPEAADVELRNDMVE+MDR+RAECLVLVS+DSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                             GAWWDLSSDAETE VSS S K
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK

A0A6J1HMT7 uncharacterized protein LOC1114650521.3e-13286.76Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        MVAYANRHAFSYVPQVVRE+RRERKMLNQLE KGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AARDVLTPK GYGLADELKRAGFFVKTVSDKPEAADVELRNDMVE+MDR+RAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK
        EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK                             GAWWDL+SDAETE VSS S K
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK-----------------------------GAWWDLSSDAETETVSSPSWK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12240.1 zinc finger (C2H2 type) family protein3.2e-9967.05Show/hide
Query:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN
        M+AYANRHAFSYVP  VREQR++RK+LN+LE+KG++K  EPY C VC R FYTNEKL+NHFKQIHE+E++KR+ QIES+KG RRV+L+AKYSMKI+KYK 
Subjt:  MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKN

Query:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK
        AAR+VLTPK GYGLADELKRAGF+VK VSDKP+AAD  L+  +VE+MD++  EC+VLVSDDSDF  +L EAK RCLRTVV+GD N+G LKR AD  +SWK
Subjt:  AARDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWK

Query:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK------GAW---WDLSSDAETETVSSP
        E+ MGKAKKE   VVGKWKDRDVLK+LEWTY+P LEK      G W   +D   + E  T   P
Subjt:  EILMGKAKKEAVSVVGKWKDRDVLKRLEWTYNPQLEK------GAW---WDLSSDAETETVSSP

AT5G52010.1 C2H2-like zinc finger protein3.1e-4648.53Show/hide
Query:  AYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKNAA
        AYANRHAF ++P  V E+RRER+ L+ +E KG +  I+PY+C VCGR   TN  L  HFKQ+HE E +K++N++ S KG +R K   +Y    +KY  AA
Subjt:  AYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKNAA

Query:  RDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWKEI
        R +LTPKVGYGL  EL+RAG +VKTV DKP+AAD  ++  +   M R   + LVLVSDD DF ++L++A+   L T+VV D+ D  L R+AD    W  +
Subjt:  RDVLTPKVGYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWKEI

Query:  LMGK
          G+
Subjt:  LMGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGTATGCGAATCGGCATGCGTTTAGCTATGTACCGCAAGTTGTTAGAGAGCAGAGAAGAGAGAGGAAGATGCTTAATCAATTGGAGAGTAAGGGTGTGATTAA
ATCGATTGAGCCGTATCTGTGTCGTGTTTGTGGGAGGAATTTTTATACGAATGAGAAGTTAGTGAATCATTTTAAGCAAATTCATGAGAGTGAGCATAAGAAGAGGTTGA
ATCAGATTGAATCTGCGAAGGGTAGTAGAAGAGTGAAGTTGCTCGCCAAGTATTCGATGAAAATACAAAAGTATAAGAATGCTGCTAGGGATGTTTTGACTCCCAAAGTG
GGATATGGTTTAGCAGATGAGTTGAAGAGGGCAGGGTTTTTTGTGAAGACTGTGTCGGATAAGCCTGAAGCTGCTGATGTAGAATTGAGAAACGACATGGTTGAGATTAT
GGATAGGAAAAGAGCAGAGTGCTTGGTTCTTGTATCAGATGATTCTGATTTTGTGAATGTTTTGAAGGAGGCCAAGTTGAGATGTCTCAGGACGGTTGTTGTAGGGGATT
TAAATGATGGGCCATTGAAGAGAAATGCTGATACTGGGTTTTCTTGGAAGGAGATTTTAATGGGGAAGGCTAAAAAGGAGGCTGTGTCTGTTGTGGGAAAATGGAAGGAT
CGGGATGTTTTGAAGAGATTGGAATGGACATACAATCCTCAGTTGGAGAAGGGTGCTTGGTGGGATCTCAGCTCTGATGCTGAAACTGAAACTGTTTCGTCACCATCATG
GAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGTATGCGAATCGGCATGCGTTTAGCTATGTACCGCAAGTTGTTAGAGAGCAGAGAAGAGAGAGGAAGATGCTTAATCAATTGGAGAGTAAGGGTGTGATTAA
ATCGATTGAGCCGTATCTGTGTCGTGTTTGTGGGAGGAATTTTTATACGAATGAGAAGTTAGTGAATCATTTTAAGCAAATTCATGAGAGTGAGCATAAGAAGAGGTTGA
ATCAGATTGAATCTGCGAAGGGTAGTAGAAGAGTGAAGTTGCTCGCCAAGTATTCGATGAAAATACAAAAGTATAAGAATGCTGCTAGGGATGTTTTGACTCCCAAAGTG
GGATATGGTTTAGCAGATGAGTTGAAGAGGGCAGGGTTTTTTGTGAAGACTGTGTCGGATAAGCCTGAAGCTGCTGATGTAGAATTGAGAAACGACATGGTTGAGATTAT
GGATAGGAAAAGAGCAGAGTGCTTGGTTCTTGTATCAGATGATTCTGATTTTGTGAATGTTTTGAAGGAGGCCAAGTTGAGATGTCTCAGGACGGTTGTTGTAGGGGATT
TAAATGATGGGCCATTGAAGAGAAATGCTGATACTGGGTTTTCTTGGAAGGAGATTTTAATGGGGAAGGCTAAAAAGGAGGCTGTGTCTGTTGTGGGAAAATGGAAGGAT
CGGGATGTTTTGAAGAGATTGGAATGGACATACAATCCTCAGTTGGAGAAGGGTGCTTGGTGGGATCTCAGCTCTGATGCTGAAACTGAAACTGTTTCGTCACCATCATG
GAAATGA
Protein sequenceShow/hide protein sequence
MVAYANRHAFSYVPQVVREQRRERKMLNQLESKGVIKSIEPYLCRVCGRNFYTNEKLVNHFKQIHESEHKKRLNQIESAKGSRRVKLLAKYSMKIQKYKNAARDVLTPKV
GYGLADELKRAGFFVKTVSDKPEAADVELRNDMVEIMDRKRAECLVLVSDDSDFVNVLKEAKLRCLRTVVVGDLNDGPLKRNADTGFSWKEILMGKAKKEAVSVVGKWKD
RDVLKRLEWTYNPQLEKGAWWDLSSDAETETVSSPSWK