; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025013 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025013
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRho_N domain-containing protein
Genome locationchr10:7781426..7782767
RNA-Seq ExpressionLag0025013
SyntenyLag0025013
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446702.1 PREDICTED: SAP-like protein BP-73 [Cucumis melo]4.2e-9175Show/hide
Query:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS
        GKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K 
Subjt:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS

Query:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SAS+ DE+S KDEHGA SILETLRE RKQ+KGKTSKK GAKV R KGTSE+KE +  S  P ADFKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI

Query:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        P          +V+ASQAIAE+ E KFPSIE+MKL ELKALAKSRG KGYSKLKKNELME+LRS
Subjt:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

XP_011648505.1 uncharacterized protein LOC105434499, partial [Cucumis sativus]9.6e-8872.56Show/hide
Query:  FLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKP
        F  KSDGSSSFP SNSNP SQF L  KKTH RFESLLNLAD+AD YPSK+IQ SVS +RPDGNAG+RPPRR+S PG+ R++E  SRKT+  K+EE +KK 
Subjt:  FLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKP

Query:  KSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRS
        ++N+QEE+IALFRKIQTSIAK+SAS+ DE+S+KDE+   SILETLRESRKQ+KGKTSKK GAKVLR KG SE+KE +  S  P ADFKLVRPPSKFVKRS
Subjt:  KSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRS

Query:  PIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        PIP         L+V+ASQAIAE+ E KFPS E+MKLTELKALAKSRG KGYSKLKKNELME+LRS
Subjt:  PIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

XP_022139842.1 uncharacterized protein LOC111010657 isoform X1 [Momordica charantia]7.1e-9977.45Show/hide
Query:  MQFGYSKNCIFLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAH
        MQFGYSKNCIF GKSDG SSFPVSNS  +SQFGL  K+THFRFESLLNLA+IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A 
Subjt:  MQFGYSKNCIFLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAH

Query:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPP
           EDLK PKSNNQEEIIALFRKIQTSIAKDSA+T DEDS +DE GAESILE+LRESRKQVKG+TSKK G KVLRRKG SE+ E  HTSP A+FKLVRPP
Subjt:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPP

Query:  SKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        SKFVKRSPIPSP G NG  S LR E SQAIAE+ E KFPS+E+MKLTELKA+AKSRG KGYSKLKKNEL+ELLRS
Subjt:  SKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

XP_038893910.1 SAP-like protein BP-73 isoform X1 [Benincasa hispida]4.5e-8579.15Show/hide
Query:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKD
        +LLNLADIAD YPSK IQLSVS+NRPDG AG+RPPRR SAPG+TR+NEP SRKT+AH NEEDLKK K NNQEEIIALFRKI+TSIAK+SAS+NDE+S KD
Subjt:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKD

Query:  EHGAESILETLRESRKQVK----GKTSKKTGAKVLRRKGTSEDKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPS
        EHGAESILETLRESRKQVK    GK+SKK GAK LR +GTSE+KE +  S P ADF+LVRPPSKFVKRSPIPSP  GNGSH RV+A+QAIAE+ E KFPS
Subjt:  EHGAESILETLRESRKQVK----GKTSKKTGAKVLRRKGTSEDKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPS

Query:  IEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        I++MKLTELKALAKSRG KGYSKLKKNEL+ELL S
Subjt:  IEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]8.1e-8780.52Show/hide
Query:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKD
        +LLNLADIAD YPSK IQLSVS+NRPDG AG+RPPRR SAPG+TR+NEP SRKT+AH NEEDLKK K NNQEEIIALFRKI+TSIAK+SAS+NDE+S KD
Subjt:  SLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEP-SRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKD

Query:  EHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDM
        EHGAESILETLRESRKQVKGK+SKK GAK LR +GTSE+KE +  S P ADF+LVRPPSKFVKRSPIPSP  GNGSH RV+A+QAIAE+ E KFPSI++M
Subjt:  EHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS-PVADFKLVRPPSKFVKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDM

Query:  KLTELKALAKSRGFKGYSKLKKNELMELLRS
        KLTELKALAKSRG KGYSKLKKNEL+ELL S
Subjt:  KLTELKALAKSRGFKGYSKLKKNELMELLRS

TrEMBL top hitse value%identityAlignment
A0A1S3BFQ9 SAP-like protein BP-732.0e-9175Show/hide
Query:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS
        GKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K 
Subjt:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS

Query:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SAS+ DE+S KDEHGA SILETLRE RKQ+KGKTSKK GAKV R KGTSE+KE +  S  P ADFKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI

Query:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        P          +V+ASQAIAE+ E KFPSIE+MKL ELKALAKSRG KGYSKLKKNELME+LRS
Subjt:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

A0A5A7UMB7 SAP-like protein BP-732.0e-9175Show/hide
Query:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS
        GKSDGSSSFP S SNP+SQF L  KKTH RFESLLNLADIAD YPSK+IQLSVS+NRPDGNA +RPPRR+S PG+TR++E  SRK +A KNEE++KK K 
Subjt:  GKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNE-PSRKTDAHKNEEDLKKPKS

Query:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SAS+ DE+S KDEHGA SILETLRE RKQ+KGKTSKK GAKV R KGTSE+KE +  S  P ADFKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTS--PVADFKLVRPPSKFVKRSPI

Query:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        P          +V+ASQAIAE+ E KFPSIE+MKL ELKALAKSRG KGYSKLKKNELME+LRS
Subjt:  PSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

A0A6J1CF22 uncharacterized protein LOC111010657 isoform X13.4e-9977.45Show/hide
Query:  MQFGYSKNCIFLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAH
        MQFGYSKNCIF GKSDG SSFPVSNS  +SQFGL  K+THFRFESLLNLA+IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A 
Subjt:  MQFGYSKNCIFLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAH

Query:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPP
           EDLK PKSNNQEEIIALFRKIQTSIAKDSA+T DEDS +DE GAESILE+LRESRKQVKG+TSKK G KVLRRKG SE+ E  HTSP A+FKLVRPP
Subjt:  KNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPP

Query:  SKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        SKFVKRSPIPSP G NG  S LR E SQAIAE+ E KFPS+E+MKLTELKA+AKSRG KGYSKLKKNEL+ELLRS
Subjt:  SKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X29.1e-7675.77Show/hide
Query:  LADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE
        L +IA    SK IQ+SV+SN   GNAG RPPRRSS PGRTR+NEP+R   A    EDLK PKSNNQEEIIALFRKIQTSIAKDSA+T DEDS +DE GAE
Subjt:  LADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE

Query:  SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTE
        SILE+LRESRKQVKG+TSKK G KVLRRKG SE+ E  HTSP A+FKLVRPPSKFVKRSPIPSP G NG  S LR E SQAIAE+ E KFPS+E+MKLTE
Subjt:  SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNG--SHLRVEASQAIAETTESKFPSIEDMKLTE

Query:  LKALAKSRGFKGYSKLKKNELMELLRS
        LKA+AKSRG KGYSKLKKNEL+ELLRS
Subjt:  LKALAKSRGFKGYSKLKKNELMELLRS

A0A6J1GSF6 uncharacterized protein LOC1114567108.2e-7767.55Show/hide
Query:  FLGKSDGSSSFPVSNSNPISQFGLI-RKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPS-RKTDAHKNEEDLKK
        F  KS+GSSS  VSN NP  QFGL  +++THF FESLLNLA+IADGY SKSIQL+VSSN  DG  GH+P RRSSAPGRTR+N  S RKTD HKN ED+KK
Subjt:  FLGKSDGSSSFPVSNSNPISQFGLI-RKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPS-RKTDAHKNEEDLKK

Query:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKFVKRSP
        PKSNNQEEIIALFRKIQTSIA+++AS+ DE+S KDE G ESILE L ESRKQVKGKT K  G K LRR GTSE          A+FKLVRPPS FVKRSP
Subjt:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKFVKRSP

Query:  IPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        IPSP+GGNG+HL                 ++E+MKL ELKA+AKSRG KGYSKLKKNEL+ELL S
Subjt:  IPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor8.3e-0547.06Show/hide
Query:  EASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        +A +   E  E     + ++KL EL+ +AKSRG KG SK+KK EL+ELL S
Subjt:  EASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

AT4G18740.1 Rho termination factor2.8e-2141.62Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G TS K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLRVEAS-QAIAETT--ESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        VKR+P+ S + G    L V  S +A+ + T  E K   IE MKL ELK +AK+RG KGYSKL+K+EL+EL+RS
Subjt:  VKRSPIPSPSGGNGSHLRVEAS-QAIAETT--ESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

AT4G18740.2 Rho termination factor1.2e-1435.29Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G TS K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        VKR+P+ S + G                              ELK +AK+RG KGYSKL+K+EL+EL+RS
Subjt:  VKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS

AT4G18740.3 Rho termination factor1.2e-1435.29Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF
        P  +NQEEII+L ++IQ+SI+K  +   +E+   DE   E     +IL+ L +SRK+ +G TS K                     P    +L RPPS F
Subjt:  PKSNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAE-----SILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKF

Query:  VKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS
        VKR+P+ S + G                              ELK +AK+RG KGYSKL+K+EL+EL+RS
Subjt:  VKRSPIPSPSGGNGSHLRVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTTGGATATTCAAAGAATTGCATTTTTCTGGGAAAATCAGATGGAAGCAGTAGTTTTCCAGTCTCGAACTCTAATCCGATTTCCCAATTTGGTCTCATTCGGAA
GAAGACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGACATTGCAGATGGTTATCCCAGCAAGAGCATTCAATTATCTGTTTCAAGCAATAGGCCAGATGGAAATG
CAGGGCATCGACCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAGGAATGAACCCTCGAGGAAAACAGATGCCCATAAGAATGAGGAAGACCTAAAAAAACCCAAA
TCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGATTCCGCAAGCACCAATGATGAAGATTCCCAGAAGGATGAACATGGAGC
CGAGTCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAAACTTCAAAGAAGACAGGAGCTAAAGTGTTGAGAAGAAAAGGCACGTCTGAAGATAAGG
AAACGAATCATACTTCGCCGGTGGCAGATTTCAAATTAGTACGACCACCATCTAAATTTGTGAAGAGATCACCCATCCCGTCTCCCTCAGGAGGAAATGGTTCACATCTT
AGAGTGGAGGCCTCTCAGGCCATAGCTGAAACCACAGAGTCGAAGTTCCCAAGTATAGAGGATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCAAGAGGATTTAA
GGGTTACTCCAAATTGAAGAAAAATGAGCTCATGGAACTCCTGAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTTGGATATTCAAAGAATTGCATTTTTCTGGGAAAATCAGATGGAAGCAGTAGTTTTCCAGTCTCGAACTCTAATCCGATTTCCCAATTTGGTCTCATTCGGAA
GAAGACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGACATTGCAGATGGTTATCCCAGCAAGAGCATTCAATTATCTGTTTCAAGCAATAGGCCAGATGGAAATG
CAGGGCATCGACCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAGGAATGAACCCTCGAGGAAAACAGATGCCCATAAGAATGAGGAAGACCTAAAAAAACCCAAA
TCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGATTCCGCAAGCACCAATGATGAAGATTCCCAGAAGGATGAACATGGAGC
CGAGTCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAAACTTCAAAGAAGACAGGAGCTAAAGTGTTGAGAAGAAAAGGCACGTCTGAAGATAAGG
AAACGAATCATACTTCGCCGGTGGCAGATTTCAAATTAGTACGACCACCATCTAAATTTGTGAAGAGATCACCCATCCCGTCTCCCTCAGGAGGAAATGGTTCACATCTT
AGAGTGGAGGCCTCTCAGGCCATAGCTGAAACCACAGAGTCGAAGTTCCCAAGTATAGAGGATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCAAGAGGATTTAA
GGGTTACTCCAAATTGAAGAAAAATGAGCTCATGGAACTCCTGAGATCCTAA
Protein sequenceShow/hide protein sequence
MQFGYSKNCIFLGKSDGSSSFPVSNSNPISQFGLIRKKTHFRFESLLNLADIADGYPSKSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRRNEPSRKTDAHKNEEDLKKPK
SNNQEEIIALFRKIQTSIAKDSASTNDEDSQKDEHGAESILETLRESRKQVKGKTSKKTGAKVLRRKGTSEDKETNHTSPVADFKLVRPPSKFVKRSPIPSPSGGNGSHL
RVEASQAIAETTESKFPSIEDMKLTELKALAKSRGFKGYSKLKKNELMELLRS