; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006121 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006121
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:3129800..3134852
RNA-Seq ExpressionSpg006121
SyntenySpg006121
Gene Ontology termsNA
InterPro domainsIPR040306 - Os02g0753200-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031490.1 hypothetical protein E6C27_scaffold139G002510 [Cucumis melo var. makuwa]1.5e-11377.18Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAE NSVFNHTSSDDDDE LAAAAASTVTVNLSIRD+SLFELPQPSS PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RD DQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESN  SN S
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        ISN T EGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

KAG6571683.1 putative CRM domain-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.8e-11175.17Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE L AAAAS+VT NLSIRD+SLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RDVDQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHG RRNRKEKKD PT                                                      GAV+EAKAQLVGIHERVRSD ESNQ SNPS
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        +SNTTQ+GKR+ATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHA+WKSETEM LRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

XP_008455215.2 PREDICTED: uncharacterized protein LOC103495434 [Cucumis melo]1.5e-11377.18Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAE NSVFNHTSSDDDDE LAAAAASTVTVNLSIRD+SLFELPQPSS PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RD DQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESN  SN S
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        ISN T EGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

XP_022141948.1 uncharacterized protein LOC111012199 [Momordica charantia]8.8e-11175Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE--GLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP
        MSLAL+QGYSSAE+EAEDNS+ + TSSDDD++    AAAA+S+ TVNLSIRDRSLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEEYA T+DVDQP
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE--GLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP

Query:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN
        RG HGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESNQPSN
Subjt:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN

Query:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        PSISN TQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEM LRQQFD
Subjt:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

XP_038888650.1 uncharacterized protein LOC120078451 [Benincasa hispida]2.3e-11176Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLA--AAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP
        MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE LA  AAAASTVTVNLSIRD+SLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEEYA  R+VDQP
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLA--AAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP

Query:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN
        RGGHGGRRNRKEKKDLPT                                                      GAVLEAK QLVGIHERVRSDVE+N  SN
Subjt:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN

Query:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        PSISN T E KRVAT ANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQFD
Subjt:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

TrEMBL top hitse value%identityAlignment
A0A1S3C0J1 uncharacterized protein LOC1034954347.0e-11477.18Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAE NSVFNHTSSDDDDE LAAAAASTVTVNLSIRD+SLFELPQPSS PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RD DQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESN  SN S
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        ISN T EGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

A0A5A7SL87 Uncharacterized protein7.0e-11477.18Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAE NSVFNHTSSDDDDE LAAAAASTVTVNLSIRD+SLFELPQPSS PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RD DQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESN  SN S
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        ISN T EGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

A0A6J1CM17 uncharacterized protein LOC1110121994.3e-11175Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE--GLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP
        MSLAL+QGYSSAE+EAEDNS+ + TSSDDD++    AAAA+S+ TVNLSIRDRSLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEEYA T+DVDQP
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE--GLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQP

Query:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN
        RG HGGRRNRKEKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVESNQPSN
Subjt:  RGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSN

Query:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        PSISN TQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEM LRQQFD
Subjt:  PSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

A0A6J1EIK4 uncharacterized protein LOC1114346943.1e-10974.83Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAED SVFN+TSSDDDD+ L AAAASTVTVNLSIRDRSLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEE+A  RDVDQPR 
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
          GGRRNR+EKKDLPT                                                      GAVLEAKAQLVGIHERVRSDVE+NQPSNPS
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPP----EPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQ
        ISNTTQ GKRVATA NPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPP    EP+SETKKKGSTVKDKEK+KRMRGQSSHATWKSETEMQLRQQ
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPP----EPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQ

Query:  FD
        FD
Subjt:  FD

A0A6J1HK72 uncharacterized protein LOC1114637999.5e-11174.83Show/hide
Query:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG
        MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDE L AAAAS+VT NLSIRD+SLFELPQPSS+PGLPSAFDAFSEVSGPPEFLNNSVEEYA  RDVDQPRG
Subjt:  MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRG

Query:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS
        GHG RRNRKEK+D PT                                                      GAV+EAKAQLVGIHERVRSD ESNQ SNPS
Subjt:  GHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPS

Query:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD
        +SNTTQ+GKR+ATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKGSTVKDKEK+KRMRGQSSHA+WKSETEM LRQQFD
Subjt:  ISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G64160.1 unknown protein5.2e-5345.18Show/hide
Query:  MSLALLQGYSSA-EEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVE--EYATTRDVDQ
        MSL LLQGYSSA EEEAE+ +  ++ +SD+D +       S+   + S    S       + N GLPSA D FS++SGPPEFLNN  E    A+ RD + 
Subjt:  MSLALLQGYSSA-EEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVE--EYATTRDVDQ

Query:  PRGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPS
            H  R +RK+KK  P                                                       G V+EAK QLVGIHERVR+D+++  PS
Subjt:  PRGGHGGRRNRKEKKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPS

Query:  NPSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQF
        + S        KR++TA NPNAE++A+LLRMC+ CG+PKT+++ARGM CP+CGDR P P+ + KKKGST+KDKEK KRMRGQSSHA+WKSETEMQLRQ F
Subjt:  NPSISNTTQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQF

Query:  D
        D
Subjt:  D


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTGGCACTTCTCCAGGGCTATTCTTCCGCCGAAGAAGAAGCTGAAGACAACTCTGTCTTCAACCACACCTCTTCCGACGATGACGACGAGGGTCTTGCCGCCGC
CGCCGCCTCTACGGTCACCGTCAATCTTTCTATACGCGACAGGTCGCTTTTCGAACTTCCGCAGCCCTCCTCTAATCCCGGCCTCCCATCCGCATTCGACGCTTTCTCCG
AAGTTTCAGGACCGCCGGAGTTTCTGAATAATTCGGTTGAGGAGTACGCTACAACGAGAGATGTCGATCAGCCGCGTGGGGGCCATGGGGGCCGCAGGAATCGTAAGGAG
AAGAAGGATTTGCCTACTGGGGGGATGGAAATTGACATAAGAAAACTGTTGGGAGTGTGGCTAAAGTCCTACATTGCTGCGAAAGGAAAAGATCATGAGTATATAAGTAT
GGGACAACATCTTCATTGTAGTTTGTATTTGGCATATTCCCTTGCTCATGCACAGGCTGATGGTTATGATGGTGCTGTATTGGAGGCAAAAGCTCAATTAGTTGGGATTC
ATGAGCGAGTGAGGAGTGACGTTGAGAGTAATCAACCTTCAAATCCATCCATTTCAAATACAACACAGGAAGGCAAGCGTGTGGCGACTGCAGCCAATCCAAATGCCGAA
GATGCTGCAGAGCTACTGAGAATGTGCCTGCATTGTGGCATTCCCAAGACCTTTTCGAATGCACGAGGGATGTTTTGCCCACTATGTGGCGATCGTCCTCCAGAGCCGAA
CAGTGAGACCAAAAAGAAGGGTTCTACTGTGAAAGATAAGGAAAAGGTAAAGAGAATGAGGGGACAGTCTTCTCATGCTACATGGAAGAGTGAAACAGAGATGCAGCTTA
GGCAACAGTTTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTGGCACTTCTCCAGGGCTATTCTTCCGCCGAAGAAGAAGCTGAAGACAACTCTGTCTTCAACCACACCTCTTCCGACGATGACGACGAGGGTCTTGCCGCCGC
CGCCGCCTCTACGGTCACCGTCAATCTTTCTATACGCGACAGGTCGCTTTTCGAACTTCCGCAGCCCTCCTCTAATCCCGGCCTCCCATCCGCATTCGACGCTTTCTCCG
AAGTTTCAGGACCGCCGGAGTTTCTGAATAATTCGGTTGAGGAGTACGCTACAACGAGAGATGTCGATCAGCCGCGTGGGGGCCATGGGGGCCGCAGGAATCGTAAGGAG
AAGAAGGATTTGCCTACTGGGGGGATGGAAATTGACATAAGAAAACTGTTGGGAGTGTGGCTAAAGTCCTACATTGCTGCGAAAGGAAAAGATCATGAGTATATAAGTAT
GGGACAACATCTTCATTGTAGTTTGTATTTGGCATATTCCCTTGCTCATGCACAGGCTGATGGTTATGATGGTGCTGTATTGGAGGCAAAAGCTCAATTAGTTGGGATTC
ATGAGCGAGTGAGGAGTGACGTTGAGAGTAATCAACCTTCAAATCCATCCATTTCAAATACAACACAGGAAGGCAAGCGTGTGGCGACTGCAGCCAATCCAAATGCCGAA
GATGCTGCAGAGCTACTGAGAATGTGCCTGCATTGTGGCATTCCCAAGACCTTTTCGAATGCACGAGGGATGTTTTGCCCACTATGTGGCGATCGTCCTCCAGAGCCGAA
CAGTGAGACCAAAAAGAAGGGTTCTACTGTGAAAGATAAGGAAAAGGTAAAGAGAATGAGGGGACAGTCTTCTCATGCTACATGGAAGAGTGAAACAGAGATGCAGCTTA
GGCAACAGTTTGATTAG
Protein sequenceShow/hide protein sequence
MSLALLQGYSSAEEEAEDNSVFNHTSSDDDDEGLAAAAASTVTVNLSIRDRSLFELPQPSSNPGLPSAFDAFSEVSGPPEFLNNSVEEYATTRDVDQPRGGHGGRRNRKE
KKDLPTGGMEIDIRKLLGVWLKSYIAAKGKDHEYISMGQHLHCSLYLAYSLAHAQADGYDGAVLEAKAQLVGIHERVRSDVESNQPSNPSISNTTQEGKRVATAANPNAE
DAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMQLRQQFD