; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033199 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033199
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPeptide transporter family protein
Genome locationscaffold5:2018644..2022466
RNA-Seq ExpressionSpg033199
SyntenySpg033199
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580546.1 hypothetical protein SDJN03_20548, partial [Cucurbita argyrosperma subsp. sororia]6.6e-12681.44Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MA PI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSS SRKSLALNC+NKSNPSPPPKFPKSPRTP+Q Q+NS V  EGEVVHEPWVFQPPRKI RTC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPA SG GL E +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

XP_004137774.1 uncharacterized protein LOC101205491 [Cucumis sativus]5.5e-12580.76Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MASP+ YSAIDDKDFDDAALWAVIDSAAAAAASSSSSSK RKSLALNCINKSNPSPPPKFPKSP+TPYQ QRNS V  EGEVVHEPWVFQPPRKI +T  
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SEVS+SSPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  S  G+N  +E++RH LSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF+KPNHD+PSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

XP_022934850.1 uncharacterized protein LOC111441889 [Cucurbita moschata]1.1e-12581.44Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MA PI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSS SRKSLALNC+NKSNPSPPPKFPKSPRTP+Q Q+NS V  EGEVVHEPWVFQPPRKI RTC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPA SG GL E +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

XP_022983918.1 uncharacterized protein LOC111482396 [Cucurbita maxima]1.5e-12581.1Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MA PI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSS SRKSLALNC+NKSNPSPPPKFPKSPRTP+Q Q+NS V  EG+VVHEPWVFQPPRKI RTC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPA SG GL E +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

XP_038903743.1 uncharacterized protein LOC120090255 [Benincasa hispida]5.9e-12782.47Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MASPI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSSKSRKSLALNCINKSNPSPPPKFPKSP+TPYQ QRNS V  EGEVV EPWVFQPPRKI +TC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        S+VSE+SPLAVVCNNALRTPP PVYLSPEAYLSPQIASGSEGSPA S  G+NE +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

TrEMBL top hitse value%identityAlignment
A0A0A0LD66 Uncharacterized protein2.7e-12580.76Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MASP+ YSAIDDKDFDDAALWAVIDSAAAAAASSSSSSK RKSLALNCINKSNPSPPPKFPKSP+TPYQ QRNS V  EGEVVHEPWVFQPPRKI +T  
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SEVS+SSPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  S  G+N  +E++RH LSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF+KPNHD+PSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

A0A1S3B6Q7 uncharacterized protein LOC1034863903.5e-12581.1Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MASPI YS IDDKDFDDAALWAVIDSAAAAA+SSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQ QRNS V  EGEVV EPWVFQPPRKI +T  
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        +EVS+SSPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  S  G+NE +E+++HSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHD+PSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

A0A6J1CWB4 uncharacterized protein LOC1110149946.0e-12583.33Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDS---AAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGR
        MASPI YSAIDDKDFDDAALWAVIDS   AAAAAASSSSSSKSRKSLA+N  +KSNPSPPP+FPKSPRTPYQ QRNS    EGEVVHEPWVFQPPRKI R
Subjt:  MASPINYSAIDDKDFDDAALWAVIDS---AAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGR

Query:  TCTSEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAVEYCILCPFVTTWTNR
        TC SEVSESSPLA+V NN LRTPPAPVYLSPEAYLSPQIASGSEGSPA SG GLNE KEITRHSLSG+FPSVSLFKEYQNAAMA+         + +T  
Subjt:  TCTSEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAVEYCILCPFVTTWTNR

Query:  CKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
             +  +GWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  CKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

A0A6J1F8X3 uncharacterized protein LOC1114418895.4e-12681.44Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MA PI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSS SRKSLALNC+NKSNPSPPPKFPKSPRTP+Q Q+NS V  EGEVVHEPWVFQPPRKI RTC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPA SG GL E +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

A0A6J1J926 uncharacterized protein LOC1114823967.1e-12681.1Show/hide
Query:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT
        MA PI YSAIDDKDFDDAALWAVIDSAAAAA SSSSSS SRKSLALNC+NKSNPSPPPKFPKSPRTP+Q Q+NS V  EG+VVHEPWVFQPPRKI RTC 
Subjt:  MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCT

Query:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPA SG GL E +E+ RHSLSGQFPSVSLFKEYQNAAMA+    ++ ++   PF+   
Subjt:  SEVSESSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAV----EYCILC--PFVTTW

Query:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
                   +GWRKISFYFNLSFEIKDKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
Subjt:  TNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09430.1 unknown protein2.1e-6150.34Show/hide
Query:  SPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQ-PQRNSMVVAEGEVVHEPWVFQPPRKIGRT-CT
        S ++    D+KD DDA LWAVIDSAAAAA   + + KS K LA+   N ++P  P  +P       Q P RN  +   G  ++E      P K+ R+   
Subjt:  SPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQ-PQRNSMVVAEGEVVHEPWVFQPPRKIGRT-CT

Query:  SEVSESSPLAVVCNNALRTPP----APVYLSPEAYLSPQIASG---SEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAVEYCILCPFVTT
        SEV   +P+A+V      + P    +  + SPE+YLSP I      +E SP+ S C  N+     RHSLSG FPS +LFKEYQN AMA+         + 
Subjt:  SEVSESSPLAVVCNNALRTPP----APVYLSPEAYLSPQIASG---SEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAVEYCILCPFVTT

Query:  WTNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ
        +T    +  +  +GWRKISFYFN+S+EI+DKTIEFDENRNVQRAEF+VRA M GGRF DGWGSCERREK+F+KPNHDIPSTAETRAKN+ACQ
Subjt:  WTNRCKRGSLLYTGWRKISFYFNLSFEIKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCCCGATCAATTACTCTGCAATCGACGATAAAGATTTCGACGATGCGGCTTTGTGGGCCGTAATTGACTCCGCTGCTGCGGCTGCAGCTTCTTCCTCCTCTTC
CTCTAAATCTCGTAAGTCTCTAGCCCTTAATTGCATCAATAAATCAAACCCTTCCCCGCCGCCCAAATTCCCGAAAAGCCCTAGAACTCCGTACCAGCCGCAGAGGAATT
CTATGGTTGTTGCAGAGGGTGAAGTGGTGCACGAGCCTTGGGTGTTTCAACCTCCTCGGAAAATCGGGAGGACCTGTACATCGGAAGTGAGTGAGAGCAGTCCTCTTGCA
GTTGTCTGTAACAACGCGCTACGGACTCCCCCTGCGCCGGTGTATCTGTCTCCTGAAGCTTACTTGTCGCCGCAAATTGCTTCTGGTTCCGAAGGCTCTCCGGCTGGTAG
TGGATGTGGGCTGAACGAGGGGAAGGAAATTACAAGGCATAGCCTCTCTGGGCAGTTCCCTTCAGTTTCTCTCTTTAAGGAGTATCAAAATGCGGCAATGGCGGTCGAGT
ACTGTATTCTATGCCCTTTCGTTACAACATGGACGAATAGATGTAAAAGGGGCAGTTTACTTTATACAGGATGGAGGAAGATATCATTTTATTTCAATCTCTCTTTTGAA
ATTAAGGACAAGACAATTGAATTTGACGAGAACCGCAATGTCCAGCGTGCTGAGTTTGTTGTTCGAGCATATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCGTG
TGAACGGCGAGAGAAGAGATTTATAAAACCAAATCATGATATTCCTAGCACAGCAGAAACCAGGGCAAAGAATAAGGCATGCCAATTCACCATTAAACTAAAGACCTGCT
TGGCATTGGAGAGTATCGACCTGGTGCATGCCAGGGCCAAAAGTAAAGACATGCCAGATTCATCCAGCCATAATATCTTACATGTATTGGAAGGATTTGGAGTTGGATTT
CAGGCCATTGGACGAAAAAGGAAAATGGAGCTCTGGATGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCCCGATCAATTACTCTGCAATCGACGATAAAGATTTCGACGATGCGGCTTTGTGGGCCGTAATTGACTCCGCTGCTGCGGCTGCAGCTTCTTCCTCCTCTTC
CTCTAAATCTCGTAAGTCTCTAGCCCTTAATTGCATCAATAAATCAAACCCTTCCCCGCCGCCCAAATTCCCGAAAAGCCCTAGAACTCCGTACCAGCCGCAGAGGAATT
CTATGGTTGTTGCAGAGGGTGAAGTGGTGCACGAGCCTTGGGTGTTTCAACCTCCTCGGAAAATCGGGAGGACCTGTACATCGGAAGTGAGTGAGAGCAGTCCTCTTGCA
GTTGTCTGTAACAACGCGCTACGGACTCCCCCTGCGCCGGTGTATCTGTCTCCTGAAGCTTACTTGTCGCCGCAAATTGCTTCTGGTTCCGAAGGCTCTCCGGCTGGTAG
TGGATGTGGGCTGAACGAGGGGAAGGAAATTACAAGGCATAGCCTCTCTGGGCAGTTCCCTTCAGTTTCTCTCTTTAAGGAGTATCAAAATGCGGCAATGGCGGTCGAGT
ACTGTATTCTATGCCCTTTCGTTACAACATGGACGAATAGATGTAAAAGGGGCAGTTTACTTTATACAGGATGGAGGAAGATATCATTTTATTTCAATCTCTCTTTTGAA
ATTAAGGACAAGACAATTGAATTTGACGAGAACCGCAATGTCCAGCGTGCTGAGTTTGTTGTTCGAGCATATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCGTG
TGAACGGCGAGAGAAGAGATTTATAAAACCAAATCATGATATTCCTAGCACAGCAGAAACCAGGGCAAAGAATAAGGCATGCCAATTCACCATTAAACTAAAGACCTGCT
TGGCATTGGAGAGTATCGACCTGGTGCATGCCAGGGCCAAAAGTAAAGACATGCCAGATTCATCCAGCCATAATATCTTACATGTATTGGAAGGATTTGGAGTTGGATTT
CAGGCCATTGGACGAAAAAGGAAAATGGAGCTCTGGATGAAGTAA
Protein sequenceShow/hide protein sequence
MASPINYSAIDDKDFDDAALWAVIDSAAAAAASSSSSSKSRKSLALNCINKSNPSPPPKFPKSPRTPYQPQRNSMVVAEGEVVHEPWVFQPPRKIGRTCTSEVSESSPLA
VVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAGSGCGLNEGKEITRHSLSGQFPSVSLFKEYQNAAMAVEYCILCPFVTTWTNRCKRGSLLYTGWRKISFYFNLSFE
IKDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQFTIKLKTCLALESIDLVHARAKSKDMPDSSSHNILHVLEGFGVGF
QAIGRKRKMELWMK