; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036779 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036779
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRho_N domain-containing protein
Genome locationscaffold5:40924083..40925531
RNA-Seq ExpressionSpg036779
SyntenySpg036779
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446702.1 PREDICTED: SAP-like protein BP-73 [Cucumis melo]8.4e-9275.85Show/hide
Query:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK
        SGKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K
Subjt:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK

Query:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP
         N+QEEIIALFRKIQ SIAKESAS+ DE+S KDEHGA SILETLRE RKQ+KGK+SKK GAKV R KGTSEEKE +  S  P ADFKLVRPPSKFVKRSP
Subjt:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP

Query:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        IP          KV+ASQAIAE+ ELKFPSIE+MKL ELKALAKSRG KGYSKLKKNEL+E+LRS
Subjt:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

XP_011648505.1 uncharacterized protein LOC105434499, partial [Cucumis sativus]4.3e-8871.64Show/hide
Query:  NIWIFEELHFSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAH
        N+ I     F  KSDGSSSFP SNSNP SQF L  KKTH RFESLLNLAD+AD YPSK+IQ SVS +RPDGNAG+RPPRR+S PG+ R++E  SRKT+  
Subjt:  NIWIFEELHFSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAH

Query:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVR
        K+EE +KK ++N+QEE+IALFRKIQTSIAKESAS+ DE+S+KDE+   SILETLRESRKQ+KGK+SKK GAKVLR KG SEEKE +  S  P ADFKLVR
Subjt:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVR

Query:  PPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        PPSKFVKRSPIP         LKV+ASQAIAE+ ELKFPS E+MKLTELKALAKSRG KGYSKLKKNEL+E+LRS
Subjt:  PPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

XP_022139842.1 uncharacterized protein LOC111010657 isoform X1 [Momordica charantia]7.6e-9376.23Show/hide
Query:  FSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK
        F GKSDG SSFP SNS  +SQFGL WK+THFRFESLLNLA+IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A    EDLK PK
Subjt:  FSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK

Query:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIP
        SNNQEEIIALFRKIQTSIAK+SA+T DEDS +DE GAESILE+LRESRKQVKG++SKK G KVLRRKG SEE E  HTSP A+FKLVRPPSKFVKRSPIP
Subjt:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIP

Query:  SPSGGNG--SHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        SP G NG  S L+ E SQAIAE+ E+KFPS+E+MKLTELKA+AKSRG KGYSKLKKNELLELLRS
Subjt:  SPSGGNG--SHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

XP_038893910.1 SAP-like protein BP-73 isoform X1 [Benincasa hispida]1.4e-8680.43Show/hide
Query:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKD
        +LLNLADIAD YPSK IQLSVS+NRPDG AG+RPPRR SAPG+TR+NEP SRKT+AH NEEDLKK K NNQEEIIALFRKI+TSIAKESAS+NDE+S KD
Subjt:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKD

Query:  EHGAESILETLRESRKQVK----GKSSKKTGAKVLRRKGTSEEKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPS
        EHGAESILETLRESRKQVK    GKSSKK GAK LR +GTSEEKE +  S P ADF+LVRPPSKFVKRSPIPSP  GNGSH +V+A+QAIAE+ ELKFPS
Subjt:  EHGAESILETLRESRKQVK----GKSSKKTGAKVLRRKGTSEEKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPS

Query:  IEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        I++MKLTELKALAKSRG KGYSKLKKNEL+ELL S
Subjt:  IEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]2.5e-8881.82Show/hide
Query:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKD
        +LLNLADIAD YPSK IQLSVS+NRPDG AG+RPPRR SAPG+TR+NEP SRKT+AH NEEDLKK K NNQEEIIALFRKI+TSIAKESAS+NDE+S KD
Subjt:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKD

Query:  EHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDM
        EHGAESILETLRESRKQVKGKSSKK GAK LR +GTSEEKE +  S P ADF+LVRPPSKFVKRSPIPSP  GNGSH +V+A+QAIAE+ ELKFPSI++M
Subjt:  EHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDM

Query:  KLTELKALAKSRGFKGYSKLKKNELLELLRS
        KLTELKALAKSRG KGYSKLKKNEL+ELL S
Subjt:  KLTELKALAKSRGFKGYSKLKKNELLELLRS

TrEMBL top hitse value%identityAlignment
A0A1S3BFQ9 SAP-like protein BP-734.1e-9275.85Show/hide
Query:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK
        SGKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K
Subjt:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK

Query:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP
         N+QEEIIALFRKIQ SIAKESAS+ DE+S KDEHGA SILETLRE RKQ+KGK+SKK GAKV R KGTSEEKE +  S  P ADFKLVRPPSKFVKRSP
Subjt:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP

Query:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        IP          KV+ASQAIAE+ ELKFPSIE+MKL ELKALAKSRG KGYSKLKKNEL+E+LRS
Subjt:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

A0A5A7UMB7 SAP-like protein BP-734.1e-9275.85Show/hide
Query:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK
        SGKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K
Subjt:  SGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPK

Query:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP
         N+QEEIIALFRKIQ SIAKESAS+ DE+S KDEHGA SILETLRE RKQ+KGK+SKK GAKV R KGTSEEKE +  S  P ADFKLVRPPSKFVKRSP
Subjt:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTS--PVADFKLVRPPSKFVKRSP

Query:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        IP          KV+ASQAIAE+ ELKFPSIE+MKL ELKALAKSRG KGYSKLKKNEL+E+LRS
Subjt:  IPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

A0A6J1CF22 uncharacterized protein LOC111010657 isoform X13.7e-9376.23Show/hide
Query:  FSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK
        F GKSDG SSFP SNS  +SQFGL WK+THFRFESLLNLA+IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A    EDLK PK
Subjt:  FSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK

Query:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIP
        SNNQEEIIALFRKIQTSIAK+SA+T DEDS +DE GAESILE+LRESRKQVKG++SKK G KVLRRKG SEE E  HTSP A+FKLVRPPSKFVKRSPIP
Subjt:  SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIP

Query:  SPSGGNG--SHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        SP G NG  S L+ E SQAIAE+ E+KFPS+E+MKLTELKA+AKSRG KGYSKLKKNELLELLRS
Subjt:  SPSGGNG--SHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X21.2e-7569.6Show/hide
Query:  LIWKKTHFRFESLLN--------LADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQ
        ++++    RF +L++        L +IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A    EDLK PKSNNQEEIIALFRKIQ
Subjt:  LIWKKTHFRFESLLN--------LADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQ

Query:  TSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNG--SHLKVE
        TSIAK+SA+T DEDS +DE GAESILE+LRESRKQVKG++SKK G KVLRRKG SEE E  HTSP A+FKLVRPPSKFVKRSPIPSP G NG  S L+ E
Subjt:  TSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNG--SHLKVE

Query:  ASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
         SQAIAE+ E+KFPS+E+MKLTELKA+AKSRG KGYSKLKKNELLELLRS
Subjt:  ASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

A0A6J1GSF6 uncharacterized protein LOC1114567107.0e-7665.23Show/hide
Query:  IWIFEELH------FSGKSDGSSSFPDSNSNPISQFGLI-WKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPS-
        +WI   L+      F  KS+GSSS   SN NP  QFGL   ++THF FESLLNLA+IADGY SKSIQL+VSSN  DG  GH+P RRSSAPGRTR+N  S 
Subjt:  IWIFEELH------FSGKSDGSSSFPDSNSNPISQFGLI-WKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPS-

Query:  RKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADF
        RKTD HKN ED+KKPKSNNQEEIIALFRKIQTSIA+E+AS+ DE+S KDE G ESILE L ESRKQVKGK+ K  G K LRR GTSE          A+F
Subjt:  RKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADF

Query:  KLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        KLVRPPS FVKRSPIPSP+GGNG+HL                 ++E+MKL ELKA+AKSRG KGYSKLKKNELLELL S
Subjt:  KLVRPPSKFVKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor1.8e-0447.06Show/hide
Query:  EASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        +A +   E  E     + ++KL EL+ +AKSRG KG SK+KK EL+ELL S
Subjt:  EASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

AT4G18740.1 Rho termination factor4.1e-2040.46Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G +S K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLKVEASQAIAETM---ELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        VKR+P+ S + G    L V  S      +   E K   IE MKL ELK +AK+RG KGYSKL+K+ELLEL+RS
Subjt:  VKRSPIPSPSGGNGSHLKVEASQAIAETM---ELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

AT4G18740.2 Rho termination factor2.6e-1435.29Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G +S K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        VKR+P+ S + G                              ELK +AK+RG KGYSKL+K+ELLEL+RS
Subjt:  VKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS

AT4G18740.3 Rho termination factor2.6e-1435.29Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G +S K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS
        VKR+P+ S + G                              ELK +AK+RG KGYSKL+K+ELLEL+RS
Subjt:  VKRSPIPSPSGGNGSHLKVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATTTGGATATTCGAAGAATTGCATTTTTCTGGAAAATCAGATGGAAGCAGTAGTTTTCCAGACTCGAACTCTAATCCGATTTCCCAATTTGGTCTCATTTGGAA
GAAGACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGACATTGCAGATGGTTATCCCAGCAAGAGCATTCAATTATCTGTTTCAAGCAATAGGCCAGATGGAAATG
CAGGGCATCGACCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAGGAATGAACCCTCGAGGAAAACAGATGCCCATAAGAATGAGGAAGACCTAAAAAAACCCAAA
TCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAATCCGCAAGCACCAATGATGAAGATTCCCAGAAGGACGAACATGGAGC
CGAGTCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAATCTTCAAAGAAGACAGGAGCTAAAGTGTTGAGAAGAAAAGGCACGTCTGAAGAGAAGG
AAACGAATCATACTTCGCCGGTGGCAGATTTCAAATTAGTACGACCACCATCTAAATTTGTGAAGAGATCACCAATCCCGTCTCCATCAGGAGGAAATGGTTCACATCTT
AAAGTGGAGGCCTCTCAGGCCATAGCTGAAACCATGGAGTTGAAGTTCCCAAGTATAGAGGATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCAAGAGGATTTAA
GGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAACTCCTGAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACATTTGGATATTCGAAGAATTGCATTTTTCTGGAAAATCAGATGGAAGCAGTAGTTTTCCAGACTCGAACTCTAATCCGATTTCCCAATTTGGTCTCATTTGGAA
GAAGACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGACATTGCAGATGGTTATCCCAGCAAGAGCATTCAATTATCTGTTTCAAGCAATAGGCCAGATGGAAATG
CAGGGCATCGACCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAGGAATGAACCCTCGAGGAAAACAGATGCCCATAAGAATGAGGAAGACCTAAAAAAACCCAAA
TCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAATCCGCAAGCACCAATGATGAAGATTCCCAGAAGGACGAACATGGAGC
CGAGTCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAATCTTCAAAGAAGACAGGAGCTAAAGTGTTGAGAAGAAAAGGCACGTCTGAAGAGAAGG
AAACGAATCATACTTCGCCGGTGGCAGATTTCAAATTAGTACGACCACCATCTAAATTTGTGAAGAGATCACCAATCCCGTCTCCATCAGGAGGAAATGGTTCACATCTT
AAAGTGGAGGCCTCTCAGGCCATAGCTGAAACCATGGAGTTGAAGTTCCCAAGTATAGAGGATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCAAGAGGATTTAA
GGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAACTCCTGAGATCCTAA
Protein sequenceShow/hide protein sequence
MNIWIFEELHFSGKSDGSSSFPDSNSNPISQFGLIWKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK
SNNQEEIIALFRKIQTSIAKESASTNDEDSQKDEHGAESILETLRESRKQVKGKSSKKTGAKVLRRKGTSEEKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNGSHL
KVEASQAIAETMELKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELLELLRS