; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G001420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G001420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionS4 RNA-binding domain-containing protein
Genome locationchr05:2080029..2085857
RNA-Seq ExpressionLsi05G001420
SyntenyLsi05G001420
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR002942 - RNA-binding S4 domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR017506 - Photosystem II S4
IPR036986 - RNA-binding S4 domain superfamily
IPR040591 - YlmH, putative RNA-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146247.1 uncharacterized protein LOC101221944 [Cucumis sativus]3.1e-13996.65Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDI+LQEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDA+A
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWT ITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

XP_008456024.1 PREDICTED: putative RNA-binding protein YlmH isoform X2 [Cucumis melo]3.7e-14097.03Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGH+DELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDII+QEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDALA
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

XP_016901905.1 PREDICTED: putative RNA-binding protein YlmH isoform X1 [Cucumis melo]3.7e-14097.03Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGH+DELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDII+QEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDALA
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

XP_022142743.1 uncharacterized protein LOC111012786 isoform X1 [Momordica charantia]3.1e-13995.54Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDIILQ E GAQVV+VPELVDFL+SSLRKVGNVTVSCTRIPLTALDYEPPKTKTF TIEASLRVDA+A
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTP+TKNGT LKTGD++SVSGKGRLKIGEINSTKKGKF+VELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

XP_038899605.1 putative RNA-binding protein YlmH [Benincasa hispida]1.1e-13996.65Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRR VLHTNFLTPPVVKES+LA+QKLADVKAIAQGGYPEAERCRISVGHADEL SDPDI+S
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNF FHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVV+VPELVDFL SSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

TrEMBL top hitse value%identityAlignment
A0A0A0LC56 S4 RNA-binding domain-containing protein1.5e-13996.65Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDI+LQEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDA+A
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWT ITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

A0A1S3C3I2 putative RNA-binding protein YlmH isoform X21.8e-14097.03Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGH+DELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDII+QEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDALA
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

A0A1S4E1P4 putative RNA-binding protein YlmH isoform X11.8e-14097.03Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGH+DELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDII+QEE GAQVVIVPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDALA
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDI+SVSGKGRLKIGEINSTKKGKFAVELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

A0A6J1CLT4 uncharacterized protein LOC111012786 isoform X21.1e-13795.17Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDIILQ E GAQVV+VPELVDFL+SSLRKVGNVTVSCTRIPLTALDYEPPKTKTF TIEASLRVDA+A
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLI SGDVRVNWTP+TKNGT LKTGD++SVSGKGRLKIGEINSTKKGKF+VELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

A0A6J1CP18 uncharacterized protein LOC111012786 isoform X11.5e-13995.54Show/hide
Query:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
        ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS
Subjt:  ICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIIS

Query:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA
        ALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDIILQ E GAQVV+VPELVDFL+SSLRKVGNVTVSCTRIPLTALDYEPPKTKTF TIEASLRVDA+A
Subjt:  ALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALA

Query:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        SAGFKISRSKLVDLISSGDVRVNWTP+TKNGT LKTGD++SVSGKGRLKIGEINSTKKGKF+VELIRYV
Subjt:  SAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV

SwissProt top hitse value%identityAlignment
P71020 Putative RNA-binding protein YlmH3.4e-1930.67Show/hide
Query:  TNFLTPPVVKESML--AIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDI-ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGA
        T+FL P   +E ++  A+   ADV     GGY  AER R  +        + D  + A ++     F    H   LGA++G G+ R+K GDI+   E   
Subjt:  TNFLTPPVVKESML--AIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDI-ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGA

Query:  QVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVS
        Q+++  +  DF+ + L + G   VS  +I L+ L+      +      +SLR+DA+ ++  + SR K   L+ +G V+VNW  +     I+  GD++S+ 
Subjt:  QVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVS

Query:  GKGRLKIGEI-NSTKKGKFAVELIR
        G GR  + +I   TKK K+ V   R
Subjt:  GKGRLKIGEI-NSTKKGKFAVELIR

Arabidopsis top hitse value%identityAlignment
AT1G53120.1 RNA-binding S4 domain-containing protein2.7e-11774.91Show/hide
Query:  MRICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDI
        +R C   +A+KGD+D LL GVGD+ V  +VK IL MA+R+ S+REVLHT+FLTPP+VKES+  ++K ADVK +AQGGYPEAERCRIS+GH D LTSDPDI
Subjt:  MRICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDI

Query:  ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDA
        ++ALSITGNF F PCSHGDFLGAILGTGI+REKLGDI++QEEKGAQV+IVPELVDF+V++L KVGNV V+C++IPL AL+YEPP+T +FKT+EASLR+DA
Subjt:  ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDA

Query:  LASAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV
        +ASAGFKISRSKLVDLISS DVRVNW  +TKNGTI+KTGD++SVSGKGRLKIGEIN TKKGKFAVE+IRY+
Subjt:  LASAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAATATGCCAATTAGTGCAAGCTGTAAAGGGAGATATTGATGTTTTACTCAACGGAGTTGGAGATAAGGGTGTTATTGTAGACGTGAAACATATTCTTGTG
ATGGCCAAACGTTCATTATCAAGACGAGAAGTTCTCCATACAAACTTTCTCACCCCACCTGTGGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTG
AAAGCAATAGCTCAGGGAGGATACCCAGAGGCAGAACGCTGCCGGATTTCTGTTGGGCATGCAGATGAACTAACAAGTGATCCAGACATAATTTCAGCATTGAGT
ATCACAGGAAATTTTACGTTTCACCCTTGCTCACATGGGGACTTCCTTGGAGCAATTCTTGGAACAGGCATTGCCAGGGAAAAGCTTGGTGATATCATACTCCAG
GAAGAAAAGGGAGCTCAAGTAGTCATTGTTCCAGAACTTGTTGACTTCCTGGTATCATCACTGCGCAAGGTTGGCAATGTCACGGTTTCTTGTACGAGGATTCCA
TTGACAGCTCTTGATTATGAACCACCGAAGACTAAGACATTTAAAACCATTGAGGCATCTCTTAGGGTGGATGCTCTAGCAAGTGCTGGATTTAAGATTTCAAGA
TCTAAACTAGTGGATTTAATCAGTAGCGGCGATGTTCGTGTCAATTGGACGCCAATTACAAAAAATGGAACCATTTTAAAGACTGGTGATATCATTTCTGTTAGT
GGAAAAGGGAGACTAAAGATTGGAGAAATAAATTCAACAAAAAAGGGAAAATTTGCGGTTGAGCTTATCAGGTATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAATATGCCAATTAGTGCAAGCTGTAAAGGGAGATATTGATGTTTTACTCAACGGAGTTGGAGATAAGGGTGTTATTGTAGACGTGAAACATATTCTTGTG
ATGGCCAAACGTTCATTATCAAGACGAGAAGTTCTCCATACAAACTTTCTCACCCCACCTGTGGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTG
AAAGCAATAGCTCAGGGAGGATACCCAGAGGCAGAACGCTGCCGGATTTCTGTTGGGCATGCAGATGAACTAACAAGTGATCCAGACATAATTTCAGCATTGAGT
ATCACAGGAAATTTTACGTTTCACCCTTGCTCACATGGGGACTTCCTTGGAGCAATTCTTGGAACAGGCATTGCCAGGGAAAAGCTTGGTGATATCATACTCCAG
GAAGAAAAGGGAGCTCAAGTAGTCATTGTTCCAGAACTTGTTGACTTCCTGGTATCATCACTGCGCAAGGTTGGCAATGTCACGGTTTCTTGTACGAGGATTCCA
TTGACAGCTCTTGATTATGAACCACCGAAGACTAAGACATTTAAAACCATTGAGGCATCTCTTAGGGTGGATGCTCTAGCAAGTGCTGGATTTAAGATTTCAAGA
TCTAAACTAGTGGATTTAATCAGTAGCGGCGATGTTCGTGTCAATTGGACGCCAATTACAAAAAATGGAACCATTTTAAAGACTGGTGATATCATTTCTGTTAGT
GGAAAAGGGAGACTAAAGATTGGAGAAATAAATTCAACAAAAAAGGGAAAATTTGCGGTTGAGCTTATCAGGTATGTGTAAGCTCCGGCGATATTGGATCAGTTG
GAGGTGAACTGGGTAAGATGTCACGGAACCTCTTTGTGGGGTGAAATCAGGGTTGCTTCCTTGGGAATAATAAAAGAATCTATTGGTTGAGAGAAGTTCATACAA
AAAGGGAGATCAAGATATCTCTTCAACAGCTAAAAGATTTCTGAGAAGGAAGTGATTGTCTGGAACCCGTGATTATCAGGCAGTGGACCGTTTCCTGGTCGGATT
GAGATCGCCACCTGCAGCCCTGAAACGTTGTGGATTGCATTCGAAGAACTTCCTGACAGACATCTGTGATGAACAATATCTCTGATGGTGTATCAACTCCAACAG
ACTCTGAAATCTCCACGTAACTGCTTATTTCTCTTTTGTTCCTGAGATACAAGAGCAGTTCCACCATCATTTTTGTGATATTGAGTATGGTTAAAACATGGAAAA
CAAAAAGAGCAGCAATTAGTTACCCCACAGCAGTGTCAAAGAATCCTGATAGAAATTTTGTTCAAATATCACCAAACTTCCAAATGAGAATGAAGGCAAATCAAT
CACAAGGCATCTATTATCAGACAGGCCTGTCATGTACCAACTCAAGTATAAAATCATCAGTCTCATTTAATACGAGTTTAGTTTTCCCTAGATAGATTCCATTGT
GAGGAAAATTGTTTACACATCTGCAAAAATTTTGCTTCAAAGTGATTTAAGTAAGCTTGAGTTTAGTCTTCTCTACATACCAATGCGATTTGAAGGACTCAAAAG
GACATCAATTCAAATCTTGGATCATTGAATTGGAAGAAATCAAGAAAGATCGTTTTGATCTTATTGGAAATTACTCCTCACTTGGGTCAATAGTTTGTTGACAAA
GGTTGGGTCATTCTTCCATTAAAGCAAGATCTGAGTTGGGTTCTAAACATCTAGAGATAGTTTGGTCTAAATCTGGTGTGGGAGTGCGAGGATTAGGAGGACCTT
TCCCCATTTAATTTAAAATAGCAACTTGGACAGGTTTTAACAAGAATCAAGAGGGGGGATGGAATTACCTTAATGCCCGAAGCATGCCATCTCTCGAGAGCCTCT
GGCACGTCGTCAAAAACTACTCCCTCCAATTCATTGCCTGAAAATCCTGTTCTCCATATGTGACCCTGCAAAGGTAGAAGTCAGCTCACTCAGAAAATCTGCCAA
ATGAATTGCAGTAAGATAATTAGAGAATTATTTTACTTGCAATTGTTTCAAGGCAGTTATCTTGCGGTCAGCTTTTATCATTGCTTCAACATTAGCAACCAATGC
TGCAATGACTTCCTCTTTCCCAGAATTATCAGGAGGAATAGGCACAGCTCCAGCAACTCCTTTTTCCAAGTCTTCTTCAACCTGAACATATTTTTGTTGATAAAC
AGAAAATTCAGAAAACAACTCCCAAATAGCCTCTCAACTCCCGTAGCACACAGGCTTCTTTCCGGTTTAATTAAGACCAGCAATGCACAAGTAAATACTGTAAAC
AGAATAGACATTCTACATATTGCAACATAATAAAACAATCTTACTTGTGAACGC
Protein sequenceShow/hide protein sequence
MRICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIISALS
ITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISR
SKLVDLISSGDVRVNWTPITKNGTILKTGDIISVSGKGRLKIGEINSTKKGKFAVELIRYV