; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationchr7:897106..898617
RNA-Seq ExpressionLag0020626
SyntenyLag0020626
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048864.1 WW domain-binding protein 11-like [Cucumis melo var. makuwa]2.1e-7161.84Show/hide
Query:  MANLPRFGRAWQRISA-PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS-------PAKKALSPAVSPKYNATVTRVA--------SP
        MANLPR+GR WQR+S+ PR APAAA+   EPEPEILPLAPT Q+LQ FEP PPPVAA   S       P+ +   PA SPKY ATV   A        SP
Subjt:  MANLPRFGRAWQRISA-PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS-------PAKKALSPAVSPKYNATVTRVA--------SP

Query:  PVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEV
        PVSP RKS + R+SISPNSYQ+ +KPTTP LS L LPKS++VT I S I PEVE+K GP KK     D Q EY SG PP   +A+AINLAG N+GAVM++
Subjt:  PVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEV

Query:  KQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVD
         QFSDK  GGEV RKIETE+ V     +EK  R T FPMTP  NSNFQEVNNSV+YNSSC+ RDPGLHLDFSG+ KD+ A V+
Subjt:  KQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVD

KAG7028529.1 hypothetical protein SDJN02_09710 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-4346.38Show/hide
Query:  MANLPRFGRAWQRIS-APRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRV---------ASPPVSPSR
        M+N PRFG + QR S A  P      PA+E +PE  P    +QSLQ               PA+K  SP  SPKY  +VTRV          SPPVSP++
Subjt:  MANLPRFGRAWQRIS-APRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRV---------ASPPVSPSR

Query:  KSPDRRHSISP-NSYQKTVKPTTPPLSPLALP-----KSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQK---AEAINLAGHNVGA
        K PDR    SP  S  ++++ + PP  PLALP      + N T  Q +I PEVEKK   Y K VE   K DR +EYGSGKP +K   AE+INLAGHNVGA
Subjt:  KSPDRRHSISP-NSYQKTVKPTTPPLSPLALP-----KSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQK---AEAINLAGHNVGA

Query:  VMEVKQFS--DKRGGEVIRKIETE-SDVTAGAKE----------KGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGG
        VME+ + S   + GGE +RK ETE  D+    KE          K  +    PMT +MNSNFQ VNNSV+Y+SSCS RDPGLHL F+  +  DGA VDG 
Subjt:  VMEVKQFS--DKRGGEVIRKIETE-SDVTAGAKE----------KGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGG

Query:  EKSK
        +  K
Subjt:  EKSK

XP_008437853.1 PREDICTED: WW domain-binding protein 11-like [Cucumis melo]3.2e-5159.11Show/hide
Query:  PPPVAAWISSPAKKALSPAVSPKYNATVTRVA--------SPPVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIG
        PP       +P+ +   PA SPKY ATV   A        SPPVSP RKS + R+SISPNSYQ+ +KPTTP LS L LPKS++VT I S I PEVE+K G
Subjt:  PPPVAAWISSPAKKALSPAVSPKYNATVTRVA--------SPPVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIG

Query:  PYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNS
        P KK     D Q EY SG PP   +A+AINLAG N+GAVM++ QFSDK  GGEV RKIETE+ V     +EK  R T FPMTP  NSNFQEVNNSV+YNS
Subjt:  PYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNS

Query:  SCSGRDPGLHLDFSGESKDDGAIVD
        SC+ RDPGLHLDFSG+ KD+ A V+
Subjt:  SCSGRDPGLHLDFSGESKDDGAIVD

XP_011650663.1 gibberellin-regulated protein 14 [Cucumis sativus]4.5e-7461.89Show/hide
Query:  MANLPRFGRAWQRISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS----PAKKALSPAVSPKYNATVTRVA--------SPPVSP
        MANLPR+GR W+  +  R AP A Q   EPEPEILPLAPT Q+LQPFEP PP  AA  SS    P  +   PA SPKY ATV R A        SPPVSP
Subjt:  MANLPRFGRAWQRISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS----PAKKALSPAVSPKYNATVTRVA--------SPPVSP

Query:  SRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFS
         RKS D RHSISPNSYQ+ +KPT P LS LALPKSA+VT + S I PEVE+K   +KK     DRQ +  S KPP   +A+AINL G N+GAVM++ QFS
Subjt:  SRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFS

Query:  DK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKSKY
        DK  GGEV RKIET++ V      EK  R T FPMTP  NSNFQEVNNSVMYNSSCSGRDPGLHLDFSG+ KD+ A VDG +KSKY
Subjt:  DK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKSKY

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]7.5e-5352.4Show/hide
Query:  MANLPRFGRAWQR--ISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKA---LSPAVSPKYNATVTRVA--------SPPVS
        MANLPRFGR WQR  + AP PAPAA  P+           PTTQ+LQPFEPN PP A+  SSP K +     P+ SPKY A   RV         SPP S
Subjt:  MANLPRFGRAWQR--ISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKA---LSPAVSPKYNATVTRVA--------SPPVS

Query:  PSRKSPDRRH--SISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEV
        P+ K  DR H  +  P S  K+ +  TPPLSPLALP++AN  A+ S +LPEVEKK   Y K VE   K  R +E+GSGKPPQ AE INL GHNVGAVME+
Subjt:  PSRKSPDRRH--SISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEV

Query:  KQFSDKR-GGEVIRKIETESDVTAGAKEKGDRATG----FPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESK-DDGAIVDGGEKS
         Q S K   GE+I+K E+E+++        +R TG     P T +MNSNFQ VNNSV+Y+SSCS RDPGLHL FS  +    GAIVDG  K+
Subjt:  KQFSDKR-GGEVIRKIETESDVTAGAKEKGDRATG----FPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESK-DDGAIVDGGEKS

TrEMBL top hitse value%identityAlignment
A0A0A0L8G8 Uncharacterized protein2.2e-7461.89Show/hide
Query:  MANLPRFGRAWQRISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS----PAKKALSPAVSPKYNATVTRVA--------SPPVSP
        MANLPR+GR W+  +  R AP A Q   EPEPEILPLAPT Q+LQPFEP PP  AA  SS    P  +   PA SPKY ATV R A        SPPVSP
Subjt:  MANLPRFGRAWQRISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS----PAKKALSPAVSPKYNATVTRVA--------SPPVSP

Query:  SRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFS
         RKS D RHSISPNSYQ+ +KPT P LS LALPKSA+VT + S I PEVE+K   +KK     DRQ +  S KPP   +A+AINL G N+GAVM++ QFS
Subjt:  SRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFS

Query:  DK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKSKY
        DK  GGEV RKIET++ V      EK  R T FPMTP  NSNFQEVNNSVMYNSSCSGRDPGLHLDFSG+ KD+ A VDG +KSKY
Subjt:  DK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKSKY

A0A1S3AVL7 WW domain-binding protein 11-like1.5e-5159.11Show/hide
Query:  PPPVAAWISSPAKKALSPAVSPKYNATVTRVA--------SPPVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIG
        PP       +P+ +   PA SPKY ATV   A        SPPVSP RKS + R+SISPNSYQ+ +KPTTP LS L LPKS++VT I S I PEVE+K G
Subjt:  PPPVAAWISSPAKKALSPAVSPKYNATVTRVA--------SPPVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIG

Query:  PYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNS
        P KK     D Q EY SG PP   +A+AINLAG N+GAVM++ QFSDK  GGEV RKIETE+ V     +EK  R T FPMTP  NSNFQEVNNSV+YNS
Subjt:  PYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEVKQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNS

Query:  SCSGRDPGLHLDFSGESKDDGAIVD
        SC+ RDPGLHLDFSG+ KD+ A V+
Subjt:  SCSGRDPGLHLDFSGESKDDGAIVD

A0A5D3DB96 WW domain-binding protein 11-like1.0e-7161.84Show/hide
Query:  MANLPRFGRAWQRISA-PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS-------PAKKALSPAVSPKYNATVTRVA--------SP
        MANLPR+GR WQR+S+ PR APAAA+   EPEPEILPLAPT Q+LQ FEP PPPVAA   S       P+ +   PA SPKY ATV   A        SP
Subjt:  MANLPRFGRAWQRISA-PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISS-------PAKKALSPAVSPKYNATVTRVA--------SP

Query:  PVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEV
        PVSP RKS + R+SISPNSYQ+ +KPTTP LS L LPKS++VT I S I PEVE+K GP KK     D Q EY SG PP   +A+AINLAG N+GAVM++
Subjt:  PVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQ--KAEAINLAGHNVGAVMEV

Query:  KQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVD
         QFSDK  GGEV RKIETE+ V     +EK  R T FPMTP  NSNFQEVNNSV+YNSSC+ RDPGLHLDFSG+ KD+ A V+
Subjt:  KQFSDK-RGGEVIRKIETESDV-TAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVD

A0A6J1D1D2 vegetative cell wall protein gp1-like3.6e-5352.4Show/hide
Query:  MANLPRFGRAWQR--ISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKA---LSPAVSPKYNATVTRVA--------SPPVS
        MANLPRFGR WQR  + AP PAPAA  P+           PTTQ+LQPFEPN PP A+  SSP K +     P+ SPKY A   RV         SPP S
Subjt:  MANLPRFGRAWQR--ISAPRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKA---LSPAVSPKYNATVTRVA--------SPPVS

Query:  PSRKSPDRRH--SISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEV
        P+ K  DR H  +  P S  K+ +  TPPLSPLALP++AN  A+ S +LPEVEKK   Y K VE   K  R +E+GSGKPPQ AE INL GHNVGAVME+
Subjt:  PSRKSPDRRH--SISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVE---KVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEV

Query:  KQFSDKR-GGEVIRKIETESDVTAGAKEKGDRATG----FPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESK-DDGAIVDGGEKS
         Q S K   GE+I+K E+E+++        +R TG     P T +MNSNFQ VNNSV+Y+SSCS RDPGLHL FS  +    GAIVDG  K+
Subjt:  KQFSDKR-GGEVIRKIETESDVTAGAKEKGDRATG----FPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESK-DDGAIVDGGEKS

A0A6J1GHE5 sulfated surface glycoprotein 185-like2.0e-4346.18Show/hide
Query:  MANLPRFGRAWQRIS-APRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRV---------ASPPVSPSR
        M+N PRFG + QR S A  P      PA+E +PE  P    +QSLQ               PA+K  SP  SPKY  +VTRV          SPPVSP++
Subjt:  MANLPRFGRAWQRIS-APRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRV---------ASPPVSPSR

Query:  KSPDRRHSISP-NSYQKTVKPT-TPPLSPLALP----KSANVTAIQSKILPEVEKKIGPYKKPVEKV---DRQTEYGSGKPPQK---AEAINLAGHNVGA
        K PDR    SP  S  ++++ +  PP  PLALP     + N T  Q +I PEVEKK   Y K VEK+   DR +EYGSGKP +K    E+INLAGHNVGA
Subjt:  KSPDRRHSISP-NSYQKTVKPT-TPPLSPLALP----KSANVTAIQSKILPEVEKKIGPYKKPVEKV---DRQTEYGSGKPPQK---AEAINLAGHNVGA

Query:  VMEVKQFS--DKRGGEVIRKIETESDVTAG--------AKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKS
        VME+ + S   + GGE +RK ETE     G         K+K  +    PMT +MNSNFQ VNNSV+Y+SSCS RDPGLHL F+  +  DGA VDG +  
Subjt:  VMEVKQFS--DKRGGEVIRKIETESDVTAG--------AKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGAIVDGGEKS

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63310.1 unknown protein8.7e-0737.93Show/hide
Query:  INLAGHNVGAVMEVKQFSDKRGGEVIRKIETESDVTAGAKEK-GDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGE
        I L+G N+GA M                 +TE D   G +++ GD    F ++ Y+NSNFQ VNNS+M  +     DPG+HLD SG+
Subjt:  INLAGHNVGAVMEVKQFSDKRGGEVIRKIETESDVTAGAKEK-GDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGE

AT1G75260.1 oxidoreductases, acting on NADH or NADPH1.4e-0427.53Show/hide
Query:  PSRKSPDRRHSISP--NSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEVKQF
        P +K+  +  +  P  +++QKT    T  L    +      +++  KI  ++           + + + T   S    +      L G N GA M +   
Subjt:  PSRKSPDRRHSISP--NSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKKIGPYKKPVEKVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEVKQF

Query:  SDKRGGEV-IRK-----IETESDVTAGAKE--KGDRA-TGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGE
         DK+ GEV IR+      +  S+ TA   E  K D A      T Y+N N Q +NNS++  SS S  DPG+H+ F  E
Subjt:  SDKRGGEV-IRK-----IETESDVTAGAKE--KGDRA-TGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGE

AT2G46630.1 unknown protein1.5e-0623.71Show/hide
Query:  PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRVASPPVSPSRKSPDRRHSISPNSYQKTVKPTTP--P
        PR  P    P S P  +  PL P  Q   P   +PP   +   SP  + +SP   PK        A+PP  P R S       SP   Q+ + P  P  P
Subjt:  PRPAPAAAQPASEPEPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRVASPPVSPSRKSPDRRHSISPNSYQKTVKPTTP--P

Query:  LSPLALPKSANVTAIQSKILPEVE--------KKIGPYKKPV----------------------------------------------------------
         SP    +S    +++++   E E        + + PY  P                                                           
Subjt:  LSPLALPKSANVTAIQSKILPEVE--------KKIGPYKKPV----------------------------------------------------------

Query:  --EKVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEVKQF--SDKRG-------------GEVIRKIETESDVTAGAKEKGDRAT---------GFPMTPY
          +K+ RQ      +       I +AG N GAVME+ +    +K G             GE  R++++ S  ++   E   + T           PM  +
Subjt:  --EKVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEVKQF--SDKRG-------------GEVIRKIETESDVTAGAKEKGDRAT---------GFPMTPY

Query:  MNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGA--IVDGGEKSKY
        MNSN Q +NNS++YNS+ S  DPG+HL  S +   D    + D G    Y
Subjt:  MNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGESKDDGA--IVDGGEKSKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCTCTCTCTCTCTCAAGTCTCAAGCTTTCCCATTTCTCAGTTCTTGCCTCCTGCAAATCTTCCTCTCTTCAGAATCGCTTCATCTTCTCCTTGAATTCTTCATC
ATTTTCAGCCTTACGCCTTCTCTTACAGAGATCTTCACTGGTTCCAACTTCCAACTATAAATTGTTGCAGAGACAGAGCTTCCATTATCACTGTCTTCTCAATCTCAAAC
TTTCTATTTCTCTGTTCATAATGGCCAATCTTCCTCGCTTTGGTCGGGCATGGCAACGTATTTCCGCCCCCCGCCCTGCTCCGGCCGCCGCACAGCCTGCTTCAGAACCA
GAGCCTGAGATTCTGCCCTTAGCTCCGACCACCCAATCTCTGCAACCTTTCGAACCCAACCCGCCGCCTGTGGCTGCTTGGATTTCCTCTCCGGCGAAGAAAGCGCTCTC
ACCCGCCGTATCGCCGAAATATAATGCTACCGTCACACGTGTGGCCAGCCCGCCGGTCTCCCCTTCACGAAAATCTCCCGATCGGAGACATTCAATTAGCCCTAATTCGT
ATCAGAAAACCGTCAAGCCAACTACTCCACCACTTTCCCCTCTGGCTCTGCCGAAATCTGCGAATGTGACGGCGATTCAATCCAAAATCCTGCCGGAGGTGGAGAAGAAA
ATCGGTCCGTACAAGAAGCCCGTCGAGAAGGTGGATCGGCAGACGGAGTACGGCTCCGGTAAGCCGCCGCAGAAGGCGGAGGCTATAAACCTCGCCGGACATAACGTCGG
CGCGGTCATGGAAGTAAAACAATTCTCCGATAAACGAGGCGGAGAAGTCATCAGGAAGATCGAAACAGAAAGCGACGTAACGGCGGGGGCAAAGGAGAAAGGTGACAGAG
CAACCGGATTTCCGATGACGCCATACATGAACAGCAATTTTCAAGAAGTGAACAATTCTGTAATGTATAATTCGTCGTGCAGTGGCCGTGATCCAGGGCTGCACCTTGAT
TTCTCCGGCGAGTCGAAGGATGATGGAGCCATTGTCGACGGCGGCGAGAAATCTAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCTCTCTCTCTCTCAAGTCTCAAGCTTTCCCATTTCTCAGTTCTTGCCTCCTGCAAATCTTCCTCTCTTCAGAATCGCTTCATCTTCTCCTTGAATTCTTCATC
ATTTTCAGCCTTACGCCTTCTCTTACAGAGATCTTCACTGGTTCCAACTTCCAACTATAAATTGTTGCAGAGACAGAGCTTCCATTATCACTGTCTTCTCAATCTCAAAC
TTTCTATTTCTCTGTTCATAATGGCCAATCTTCCTCGCTTTGGTCGGGCATGGCAACGTATTTCCGCCCCCCGCCCTGCTCCGGCCGCCGCACAGCCTGCTTCAGAACCA
GAGCCTGAGATTCTGCCCTTAGCTCCGACCACCCAATCTCTGCAACCTTTCGAACCCAACCCGCCGCCTGTGGCTGCTTGGATTTCCTCTCCGGCGAAGAAAGCGCTCTC
ACCCGCCGTATCGCCGAAATATAATGCTACCGTCACACGTGTGGCCAGCCCGCCGGTCTCCCCTTCACGAAAATCTCCCGATCGGAGACATTCAATTAGCCCTAATTCGT
ATCAGAAAACCGTCAAGCCAACTACTCCACCACTTTCCCCTCTGGCTCTGCCGAAATCTGCGAATGTGACGGCGATTCAATCCAAAATCCTGCCGGAGGTGGAGAAGAAA
ATCGGTCCGTACAAGAAGCCCGTCGAGAAGGTGGATCGGCAGACGGAGTACGGCTCCGGTAAGCCGCCGCAGAAGGCGGAGGCTATAAACCTCGCCGGACATAACGTCGG
CGCGGTCATGGAAGTAAAACAATTCTCCGATAAACGAGGCGGAGAAGTCATCAGGAAGATCGAAACAGAAAGCGACGTAACGGCGGGGGCAAAGGAGAAAGGTGACAGAG
CAACCGGATTTCCGATGACGCCATACATGAACAGCAATTTTCAAGAAGTGAACAATTCTGTAATGTATAATTCGTCGTGCAGTGGCCGTGATCCAGGGCTGCACCTTGAT
TTCTCCGGCGAGTCGAAGGATGATGGAGCCATTGTCGACGGCGGCGAGAAATCTAAGTATTAA
Protein sequenceShow/hide protein sequence
MLSLSLSSLKLSHFSVLASCKSSSLQNRFIFSLNSSSFSALRLLLQRSSLVPTSNYKLLQRQSFHYHCLLNLKLSISLFIMANLPRFGRAWQRISAPRPAPAAAQPASEP
EPEILPLAPTTQSLQPFEPNPPPVAAWISSPAKKALSPAVSPKYNATVTRVASPPVSPSRKSPDRRHSISPNSYQKTVKPTTPPLSPLALPKSANVTAIQSKILPEVEKK
IGPYKKPVEKVDRQTEYGSGKPPQKAEAINLAGHNVGAVMEVKQFSDKRGGEVIRKIETESDVTAGAKEKGDRATGFPMTPYMNSNFQEVNNSVMYNSSCSGRDPGLHLD
FSGESKDDGAIVDGGEKSKY