; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027041 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027041
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionKAT8 regulatory NSL complex subunit 1 like
Genome locationchr10:44533629..44537607
RNA-Seq ExpressionLag0027041
SyntenyLag0027041
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580620.1 hypothetical protein SDJN03_20622, partial [Cucurbita argyrosperma subsp. sororia]1.7e-10688.98Show/hide
Query:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ
        MAD SLSLCFSS    F ISRSL L          SSPRF +S HHRPSRLLR SVKSSASGSF GDDSFGLFPWTDGD+EIHWVPEERVTLFTPDGLVQ
Subjt:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ

Query:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
        IGGSIVPRRIS SDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
Subjt:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ

Query:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

XP_008442709.1 PREDICTED: uncharacterized protein LOC103486501 [Cucumis melo]1.2e-10788.84Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG
        MAD SLS  FSSFS   SL LSPSF+ HPF+ SP+FP+S HHRPS LLR S+KSS+SG F GD DSFGLFPW DGDSEIHWVPEERVTLFTPDGLVQIGG
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKK GKSK  QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        TMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

XP_022934142.1 uncharacterized protein LOC111441404 isoform X1 [Cucurbita moschata]1.3e-10688.98Show/hide
Query:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ
        MAD SLSLCFSS    F ISRSL L          SSPRF +S HHRPSRLLR SVKSSASGSF GDDSFGLFPWTDGD+EIHWVPEERVTLFTPDGLVQ
Subjt:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ

Query:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
        IGGSIVPRRIS SDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
Subjt:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ

Query:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

XP_022983093.1 uncharacterized protein LOC111481743 [Cucurbita maxima]1.3e-10688.56Show/hide
Query:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ
        M D SLSLCFSS    F ISRSL L          SSPRF +S HHRPSRLLR S+KSSASGSF GDDSFGLFPWTDGD+EIHWVPEERVTLFTPDGLVQ
Subjt:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ

Query:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
        IGGSIVPRRISSSDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
Subjt:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ

Query:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

XP_038905101.1 uncharacterized protein LOC120091234 [Benincasa hispida]2.3e-11493.53Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGGS
        MAD SLSLCFSSFSISRSL LSPSF+LHPF+ SPRF VS HHRPSRLLR S+KSS SGSF GDDSFGLFPW+DGDSEIHWVPEERVTLFTPDGLVQIGGS
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGGS

Query:  IVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
        IVPRRISSSDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCI+GFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
Subjt:  IVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT

Query:  MTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        MTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  MTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

TrEMBL top hitse value%identityAlignment
A0A1S3B735 uncharacterized protein LOC1034865015.8e-10888.84Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG
        MAD SLS  FSSFS   SL LSPSF+ HPF+ SP+FP+S HHRPS LLR S+KSS+SG F GD DSFGLFPW DGDSEIHWVPEERVTLFTPDGLVQIGG
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKK GKSK  QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        TMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

A0A5D3DPB2 Uncharacterized protein5.8e-10888.84Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG
        MAD SLS  FSSFS   SL LSPSF+ HPF+ SP+FP+S HHRPS LLR S+KSS+SG F GD DSFGLFPW DGDSEIHWVPEERVTLFTPDGLVQIGG
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGD-DSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKK GKSK  QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        TMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

A0A6J1CTU7 uncharacterized protein LOC111014232 isoform X18.4e-10788.19Show/hide
Query:  MADASLSLCFSSFS----ISRSLDLSPSFVLHPFVSSPRFPVS-HHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLV
        MA+AS +LCFSSFS    ISRSLDLSPS     F+SSPRF  S  HHRPSRLLR SV+SS SGSF GDDS GLFPW DG SEIHWVPEERVTLFTPDGLV
Subjt:  MADASLSLCFSSFS----ISRSLDLSPSFVLHPFVSSPRFPVS-HHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLV

Query:  QIGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
        QIGGSIVPRRISSSDKK GKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
Subjt:  QIGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL

Query:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        QEKLTMTVAVPLLWGVPPASETLH AVQSGGGIVEK+
Subjt:  QEKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

A0A6J1F1V4 uncharacterized protein LOC111441404 isoform X16.4e-10788.98Show/hide
Query:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ
        MAD SLSLCFSS    F ISRSL L          SSPRF +S HHRPSRLLR SVKSSASGSF GDDSFGLFPWTDGD+EIHWVPEERVTLFTPDGLVQ
Subjt:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ

Query:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
        IGGSIVPRRIS SDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
Subjt:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ

Query:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

A0A6J1J6S7 uncharacterized protein LOC1114817436.4e-10788.56Show/hide
Query:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ
        M D SLSLCFSS    F ISRSL L          SSPRF +S HHRPSRLLR S+KSSASGSF GDDSFGLFPWTDGD+EIHWVPEERVTLFTPDGLVQ
Subjt:  MADASLSLCFSS----FSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQ

Query:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
        IGGSIVPRRISSSDKK GKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ
Subjt:  IGGSIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQ

Query:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEK+
Subjt:  EKLTMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36895.1 unknown protein8.9e-7765.24Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRF-PVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG
        MA+ S +L FS+F  S  L +SP    HP  S+ RF  +    RPS   R +VK+S  G+F  DD+F  FPW+D ++EI WVPEER+TLFT DGLVQIGG
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRF-PVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        ++VPRRI SS+KK G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GLQEKL
Subjt:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        TMTVAVP LWGVPPA+E LHLAV++GGGIV+K+
Subjt:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL

AT2G36895.2 unknown protein6.4e-7564.81Show/hide
Query:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRF-PVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG
        MA+ S +L FS+F  S  L +SP    HP  S+ RF  +    RPS   R +VK+S  G+F  DD+F  FPW+D ++EI WVPEER+TLFT DGLVQIGG
Subjt:  MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRF-PVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        ++VPRRI SS+K  G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GLQEKL
Subjt:  SIVPRRISSSDKKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL
        TMTVAVP LWGVPPA+E LHLAV++GGGIV+K+
Subjt:  TMTVAVPLLWGVPPASETLHLAVQSGGGIVEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACGCTTCTCTTTCCTTGTGTTTCTCTTCCTTCTCCATTTCCCGCTCCCTCGACCTTTCCCCCTCTTTCGTCTTACACCCTTTTGTTTCCTCTCCTAGATTCCC
CGTCTCCCATCATCATCGCCCATCTCGTCTTCTTCGTCTCTCCGTCAAATCCTCCGCCTCTGGAAGCTTCCCCGGCGACGATTCTTTCGGATTGTTTCCTTGGACCGATG
GCGATAGCGAAATCCATTGGGTTCCTGAAGAGAGAGTCACATTGTTCACTCCTGATGGGCTTGTTCAGATTGGAGGCTCCATTGTTCCTAGACGCATCTCTTCTTCAGAT
AAAAAACTAGGGAAATCAAAAGCTTACCAAAGATTCCAACGGTTTCAAGAGAGTGATTACATGGATCCAAAGCAGAGCATATGTCTTGGTGCACTGTTTGATATTGCAGC
CACCAATGGACTCGACATGGGACGAAGACTTTGTATCTTTGGTTTCTGCCGTTCGGTCGAGATGCTCAGTGATGTTGTGGAAGACATTGTTTTGGAGCAAGGTGGAGAGG
TTGTAGCAGCAGAGAAAGCAAGTAAAGGGGGTTTGCAGGAGAAGCTAACCATGACAGTTGCTGTGCCACTTCTCTGGGGGGTTCCTCCTGCTTCTGAAACTCTTCACTTA
GCTGTTCAGAGTGGTGGAGGAATTGTGGAGAAACTGACTGTTTGGAGTGATATGGTTGTTTGGGCTGATTCTACACTTTGTTGGGTTGCTAGTGTGTTATCAAAAGCCCA
TAATGGATTTGTCATTTTAGTCAGAGATGTGCTCCATAAAAGAGCAAACGAAGGCGACACGACACGAACCCGAAGACACGGCAAGAAGACGGACCCTGGAGGAGAAACAG
ACCAGAGGTTGGGCCAAGGTCGGAGGGATTGGGCCTTGGTCCAACCCCCATGCATGGTCTGGGCCTCGGCCGACCCAGCGAACCGAGTTCCCTTCCCTCCGTTTGGTCCC
TGGTGCTCGTGGGTCGCCCCAATCCCACCTGGTTCAGCTCGAATAGTCTCCAAGAGCCTAGAAACCCTAGAACAGGAACAAGTATATAGACCATTCTTCGTCACTGAAGA
AGGGATCCCCGAAACTCATTCTCTGAAGCCTATTCCTCTTCTTGCTCTCTTGCTCACTTCCTTCAGCTTTCTGACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACGCTTCTCTTTCCTTGTGTTTCTCTTCCTTCTCCATTTCCCGCTCCCTCGACCTTTCCCCCTCTTTCGTCTTACACCCTTTTGTTTCCTCTCCTAGATTCCC
CGTCTCCCATCATCATCGCCCATCTCGTCTTCTTCGTCTCTCCGTCAAATCCTCCGCCTCTGGAAGCTTCCCCGGCGACGATTCTTTCGGATTGTTTCCTTGGACCGATG
GCGATAGCGAAATCCATTGGGTTCCTGAAGAGAGAGTCACATTGTTCACTCCTGATGGGCTTGTTCAGATTGGAGGCTCCATTGTTCCTAGACGCATCTCTTCTTCAGAT
AAAAAACTAGGGAAATCAAAAGCTTACCAAAGATTCCAACGGTTTCAAGAGAGTGATTACATGGATCCAAAGCAGAGCATATGTCTTGGTGCACTGTTTGATATTGCAGC
CACCAATGGACTCGACATGGGACGAAGACTTTGTATCTTTGGTTTCTGCCGTTCGGTCGAGATGCTCAGTGATGTTGTGGAAGACATTGTTTTGGAGCAAGGTGGAGAGG
TTGTAGCAGCAGAGAAAGCAAGTAAAGGGGGTTTGCAGGAGAAGCTAACCATGACAGTTGCTGTGCCACTTCTCTGGGGGGTTCCTCCTGCTTCTGAAACTCTTCACTTA
GCTGTTCAGAGTGGTGGAGGAATTGTGGAGAAACTGACTGTTTGGAGTGATATGGTTGTTTGGGCTGATTCTACACTTTGTTGGGTTGCTAGTGTGTTATCAAAAGCCCA
TAATGGATTTGTCATTTTAGTCAGAGATGTGCTCCATAAAAGAGCAAACGAAGGCGACACGACACGAACCCGAAGACACGGCAAGAAGACGGACCCTGGAGGAGAAACAG
ACCAGAGGTTGGGCCAAGGTCGGAGGGATTGGGCCTTGGTCCAACCCCCATGCATGGTCTGGGCCTCGGCCGACCCAGCGAACCGAGTTCCCTTCCCTCCGTTTGGTCCC
TGGTGCTCGTGGGTCGCCCCAATCCCACCTGGTTCAGCTCGAATAGTCTCCAAGAGCCTAGAAACCCTAGAACAGGAACAAGTATATAGACCATTCTTCGTCACTGAAGA
AGGGATCCCCGAAACTCATTCTCTGAAGCCTATTCCTCTTCTTGCTCTCTTGCTCACTTCCTTCAGCTTTCTGACTTAA
Protein sequenceShow/hide protein sequence
MADASLSLCFSSFSISRSLDLSPSFVLHPFVSSPRFPVSHHHRPSRLLRLSVKSSASGSFPGDDSFGLFPWTDGDSEIHWVPEERVTLFTPDGLVQIGGSIVPRRISSSD
KKLGKSKAYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLTMTVAVPLLWGVPPASETLHL
AVQSGGGIVEKLTVWSDMVVWADSTLCWVASVLSKAHNGFVILVRDVLHKRANEGDTTRTRRHGKKTDPGGETDQRLGQGRRDWALVQPPCMVWASADPANRVPFPPFGP
WCSWVAPIPPGSARIVSKSLETLEQEQVYRPFFVTEEGIPETHSLKPIPLLALLLTSFSFLT