; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G021060 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G021060
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPeptide transporter family protein
Genome locationchr03:32293389..32296624
RNA-Seq ExpressionLsi03G021060
SyntenyLsi03G021060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580546.1 hypothetical protein SDJN03_20548, partial [Cucurbita argyrosperma subsp. sororia]8.8e-12983.11Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MA PITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSS SRKSLALNC+NKSNPSPPPKFPKSP+TP+QAQ+NSRVFVEGEVVHEPW+FQ PRKI +TCA
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPAFS SGL EER+VARHSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGA QGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

XP_004137774.1 uncharacterized protein LOC101205491 [Cucumis sativus]4.0e-12982.43Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MASP+TYSAIDDKDFDDAALWAVIDSAAAAA SSSSSSK RKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVF+EGEVVHEPW+FQ PRKI KT A
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SEVS++SPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  SRSG+N ER+++RH LSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF+KPNHD+PSTAETRAKNKACQDLLGIGEYRPGACQGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

XP_008442555.1 PREDICTED: uncharacterized protein LOC103486390 [Cucumis melo]8.8e-12982.43Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MASPITYS IDDKDFDDAALWAVIDSAAAAA+SSSSSSKSRKSLALNCINKSNPSPPPKFPKSP+TPYQAQRNSRVF+EGEVV EPW+FQ PRKI KT A
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        +EVS++SPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  SRSG+NEER++++HSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHD+PSTAETRAKNKACQDLLGIGEYRPGACQGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

XP_022983918.1 uncharacterized protein LOC111482396 [Cucurbita maxima]2.0e-12882.77Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MA PITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSS SRKSLALNC+NKSNPSPPPKFPKSP+TP+QAQ+NSRVFVEG+VVHEPW+FQ PRKI +TCA
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPAFS SGL EER+VARHSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGA QGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

XP_038903743.1 uncharacterized protein LOC120090255 [Benincasa hispida]1.2e-13385.81Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MASPI YSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVV EPW+FQ PRKI+KTCA
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        S+VSENSPLAVVCNNALRTPP PVYLSPEAYLSPQIASGSEGSPAFSRSG+NEER++ARHSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQ QK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

TrEMBL top hitse value%identityAlignment
A0A0A0LD66 Uncharacterized protein1.9e-12982.43Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MASP+TYSAIDDKDFDDAALWAVIDSAAAAA SSSSSSK RKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVF+EGEVVHEPW+FQ PRKI KT A
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SEVS++SPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  SRSG+N ER+++RH LSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF+KPNHD+PSTAETRAKNKACQDLLGIGEYRPGACQGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

A0A1S3B6Q7 uncharacterized protein LOC1034863904.3e-12982.43Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MASPITYS IDDKDFDDAALWAVIDSAAAAA+SSSSSSKSRKSLALNCINKSNPSPPPKFPKSP+TPYQAQRNSRVF+EGEVV EPW+FQ PRKI KT A
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        +EVS++SPLAVVCNNALRTPPAPVYLSPEAYLSPQI SGSEGSP  SRSG+NEER++++HSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHD+PSTAETRAKNKACQDLLGIGEYRPGACQGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

A0A6J1CWB4 uncharacterized protein LOC1110149943.5e-12379.93Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDS---AAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMK
        MASPITYSAIDDKDFDDAALWAVIDS   AAAAA SSSSSSKSRKSLA+N  +KSNPSPPP+FPKSP+TPYQAQRNSR F EGEVVHEPW+FQ PRKI +
Subjt:  MASPITYSAIDDKDFDDAALWAVIDS---AAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMK

Query:  TCASEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----I
        TC SEVSE+SPLA+V NN LRTPPAPVYLSPEAYLSPQIASGSEGSPA S SGLNEE+++ RHSLSG+FPSVSLFKEYQNAAMA+    D   +     I
Subjt:  TCASEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----I

Query:  YIVGIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
           G ++ S         +DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
Subjt:  YIVGIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

A0A6J1F8X3 uncharacterized protein LOC1114418891.6e-12882.77Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MA PI YSAIDDKDFDDAALWAVIDSAAAAATSSSSSS SRKSLALNC+NKSNPSPPPKFPKSP+TP+QAQ+NSRVFVEGEVVHEPW+FQ PRKI +TCA
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPAFS SGL EER+VARHSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGA QGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

A0A6J1J926 uncharacterized protein LOC1114823969.5e-12982.77Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA
        MA PITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSS SRKSLALNC+NKSNPSPPPKFPKSP+TP+QAQ+NSRVFVEG+VVHEPW+FQ PRKI +TCA
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCA

Query:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV
        SE+SE SPLAVVCNNALR PPAPVYLSPEAYLSPQIAS SEGSPAFS SGL EER+VARHSLSGQFPSVSLFKEYQNAAMA+    D   +     I   
Subjt:  SEVSENSPLAVVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV----DVFCVR----IYIV

Query:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK
        G ++ S         +DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGA QGQK
Subjt:  GIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09430.1 unknown protein1.8e-5548.81Show/hide
Query:  SPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQ-AQRNSRVFVEGEVVHEPWMFQHPRKIMKT-CA
        S ++    D+KD DDA LWAVIDSAAAAAT    + KS K LA+   N ++P  P  +P       Q   RN  +   G  ++E      P K+ ++   
Subjt:  SPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQ-AQRNSRVFVEGEVVHEPWMFQHPRKIMKT-CA

Query:  SEVSENSPLAVVCNNALRTPP----APVYLSPEAYLSPQIASG---SEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV------DVFCV
        SEV   +P+A+V      + P    +  + SPE+YLSP I      +E SP+ S    N+  +  RHSLSG FPS +LFKEYQN AMA+       +   
Subjt:  SEVSENSPLAVVCNNALRTPP----APVYLSPEAYLSPQIASG---SEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAV------DVFCV

Query:  RIYI--VGIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYR
        + YI   G ++ S          DKTIEFDENRNVQRAEF+VRA M GGRF DGWGSCERREK+F+KPNHDIPSTAETRAKN+ACQDLLGIGEYR
Subjt:  RIYI--VGIQECS-------SQEDKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCCAATCACGTACTCTGCAATCGACGACAAAGATTTCGACGATGCGGCTTTGTGGGCTGTAATCGACTCTGCTGCTGCAGCTGCGACTTCTTCCTCCTCGTC
CTCTAAATCTCGTAAGTCTCTAGCACTTAATTGCATCAATAAATCAAACCCTTCCCCGCCACCAAAATTCCCAAAAAGCCCTAAAACTCCATACCAGGCGCAGAGAAATT
CTAGGGTTTTTGTAGAGGGTGAAGTGGTGCACGAGCCTTGGATGTTTCAACATCCTCGGAAAATTATGAAGACATGTGCATCGGAAGTCAGTGAGAACAGTCCTCTTGCG
GTCGTCTGTAACAACGCGCTACGGACTCCGCCTGCTCCGGTATATTTGTCTCCTGAAGCGTACTTGTCGCCGCAGATTGCTTCTGGTTCTGAAGGTTCTCCGGCTTTTAG
TAGAAGTGGGCTAAACGAGGAAAGGGATGTTGCAAGGCATAGCCTCTCTGGGCAGTTTCCTTCAGTCTCCCTCTTTAAGGAGTATCAAAATGCGGCAATGGCGGTAGATG
TGTTTTGTGTTAGAATATACATTGTGGGAATCCAAGAATGTAGCTCCCAAGAGGACAAGACGATTGAGTTTGACGAGAACCGCAATGTCCAGCGTGCTGAATTCGTTGTT
CGAGCGTATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCATGTGAACGACGTGAGAAGAGATTTATAAAGCCAAATCATGATATTCCTAGCACAGCAGAAACCAG
GGCAAAGAATAAGGCATGTCAAGACCTGCTTGGCATTGGAGAGTATCGACCTGGTGCATGCCAGGGGCAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
TTTATTTCGGGGTAGTGATCGCTTCAGATCGATCAGATGGAGCCGGAGACGGCGAATGAGCCTTCTCCGATGGTTTCAGCGTTTTGATTTCTGTATGTTTTTTTCTTCTT
GAATAAGCGAATCTCTCATTGAAGTCGCAGTTACGAAGTTCAAGCTTTCACTACAATGGCGTCTCCAATCACGTACTCTGCAATCGACGACAAAGATTTCGACGATGCGG
CTTTGTGGGCTGTAATCGACTCTGCTGCTGCAGCTGCGACTTCTTCCTCCTCGTCCTCTAAATCTCGTAAGTCTCTAGCACTTAATTGCATCAATAAATCAAACCCTTCC
CCGCCACCAAAATTCCCAAAAAGCCCTAAAACTCCATACCAGGCGCAGAGAAATTCTAGGGTTTTTGTAGAGGGTGAAGTGGTGCACGAGCCTTGGATGTTTCAACATCC
TCGGAAAATTATGAAGACATGTGCATCGGAAGTCAGTGAGAACAGTCCTCTTGCGGTCGTCTGTAACAACGCGCTACGGACTCCGCCTGCTCCGGTATATTTGTCTCCTG
AAGCGTACTTGTCGCCGCAGATTGCTTCTGGTTCTGAAGGTTCTCCGGCTTTTAGTAGAAGTGGGCTAAACGAGGAAAGGGATGTTGCAAGGCATAGCCTCTCTGGGCAG
TTTCCTTCAGTCTCCCTCTTTAAGGAGTATCAAAATGCGGCAATGGCGGTAGATGTGTTTTGTGTTAGAATATACATTGTGGGAATCCAAGAATGTAGCTCCCAAGAGGA
CAAGACGATTGAGTTTGACGAGAACCGCAATGTCCAGCGTGCTGAATTCGTTGTTCGAGCGTATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCATGTGAACGAC
GTGAGAAGAGATTTATAAAGCCAAATCATGATATTCCTAGCACAGCAGAAACCAGGGCAAAGAATAAGGCATGTCAAGACCTGCTTGGCATTGGAGAGTATCGACCTGGT
GCATGCCAGGGGCAAAAGTAAAGACTTGCCAGATTCATCCAACCCACATAACTAATCTAACTTGTATTGGAAGGAGTTTGAAGGTGGCAGTCTTTAACAACAACAACAAA
CATAACTACAAAAGTTCATTGAATGTAAGTTGTAACCAAACTGAGATCTCAAGGAACAACAGTTAGTAATGGAGAAACACCATCTACCAGAGAGAAGTTTATGGCTTCTT
GTAATGTTGTAGGCATTTTCTTATATTTGGTAAAACTTTCATATCTTTTTATAGAAAGTGATTTAGTTGGTTGCACCTTTTCTTATGTACTGGCATTACAATATGTAATA
ATGTATGATGTTGGTCCCACTCCCCTAGTTTGGAGGATCTTTTGTTTGTTGAGAAGTCACAATTACTTTCTTGGATGCCTGCTATGGAGCACTTTATTATTCCTCAGTCG
GTAGAGGCCATTGGTTTAGTA
Protein sequenceShow/hide protein sequence
MASPITYSAIDDKDFDDAALWAVIDSAAAAATSSSSSSKSRKSLALNCINKSNPSPPPKFPKSPKTPYQAQRNSRVFVEGEVVHEPWMFQHPRKIMKTCASEVSENSPLA
VVCNNALRTPPAPVYLSPEAYLSPQIASGSEGSPAFSRSGLNEERDVARHSLSGQFPSVSLFKEYQNAAMAVDVFCVRIYIVGIQECSSQEDKTIEFDENRNVQRAEFVV
RAYMQGGRFCDGWGSCERREKRFIKPNHDIPSTAETRAKNKACQDLLGIGEYRPGACQGQK