; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028511 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028511
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor bHLH110
Genome locationtig00153204:1788112..1794618
RNA-Seq ExpressionSgr028511
SyntenySgr028511
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020490.1 Transcription factor bHLH, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-5660.66Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEV-KEGEASASSPKTT----
        MDFSAN HHQ LQLENQLLL + + HAW  DVT SPSCDFEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  +E EASASSPKT     
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEV-KEGEASASSPKTT----

Query:  ------EDLNKISENCYC---SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKI
              EDLNKI+++C C   +KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KI
Subjt:  ------EDLNKISENCYC---SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKI

Query:  RKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        RKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  RKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

XP_022144323.1 transcription factor bHLH110-like isoform X1 [Momordica charantia]8.2e-6866.53Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPGID-AESFTSLQHSHGLGSQDQKFGEVKEGEASASSPKTTEDLNKISEN
        MD SANLHHQLLQLENQLL+ + SSHAWT D+ LSPS DFEP+FH QDLNPGID AESFTSLQ  H      QKFGE KE EAS SSPK TEDL+K S++
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPGID-AESFTSLQHSHGLGSQDQKFGEVKEGEASASSPKTTEDLNKISEN

Query:  CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASSHHLLQQSRTSSL----------------------------------------PTKIRKEKL
        C CSKIH TVNVSSRKPL MAE SSSLGFD S L FASSH+L+QQSR SSL                                        P KIRKEKL
Subjt:  CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASSHHLLQQSRTSSL----------------------------------------PTKIRKEKL

Query:  GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

XP_022951393.1 uncharacterized protein LOC111454232 isoform X1 [Cucurbita moschata]5.0e-5761.16Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSAN HHQ LQLENQLLL + + HAW  DVT SPSCDFEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK
             EDLNKI+++C C  +KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRK
Subjt:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK

Query:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

XP_022951394.1 uncharacterized protein LOC111454232 isoform X2 [Cucurbita moschata]5.5e-5660.83Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSAN HHQ LQLENQLLL + + HAW  DVT SPSCDFEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK
             EDLNKI+++C C  +KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRK
Subjt:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK

Query:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV
        EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQ+
Subjt:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV

XP_023002435.1 transcription factor bHLH69-like isoform X1 [Cucurbita maxima]1.1e-5661Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSANLHHQ LQLENQLLL + + HAW  DVT SPSC+FEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE
             EDLNK++++ C C+KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRKE
Subjt:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE

Query:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

TrEMBL top hitse value%identityAlignment
A0A6J1CSZ7 transcription factor bHLH110-like isoform X14.0e-6866.53Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPGID-AESFTSLQHSHGLGSQDQKFGEVKEGEASASSPKTTEDLNKISEN
        MD SANLHHQLLQLENQLL+ + SSHAWT D+ LSPS DFEP+FH QDLNPGID AESFTSLQ  H      QKFGE KE EAS SSPK TEDL+K S++
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPGID-AESFTSLQHSHGLGSQDQKFGEVKEGEASASSPKTTEDLNKISEN

Query:  CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASSHHLLQQSRTSSL----------------------------------------PTKIRKEKL
        C CSKIH TVNVSSRKPL MAE SSSLGFD S L FASSH+L+QQSR SSL                                        P KIRKEKL
Subjt:  CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASSHHLLQQSRTSSL----------------------------------------PTKIRKEKL

Query:  GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  GDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

A0A6J1GHK6 uncharacterized protein LOC111454232 isoform X12.4e-5761.16Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSAN HHQ LQLENQLLL + + HAW  DVT SPSCDFEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK
             EDLNKI+++C C  +KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRK
Subjt:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK

Query:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

A0A6J1GIL8 uncharacterized protein LOC111454232 isoform X22.7e-5660.83Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSAN HHQ LQLENQLLL + + HAW  DVT SPSCDFEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK
             EDLNKI+++C C  +KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRK
Subjt:  -----EDLNKISENCYC--SKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRK

Query:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV
        EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQ+
Subjt:  EKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV

A0A6J1KLA7 transcription factor bHLH69-like isoform X15.4e-5761Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSANLHHQ LQLENQLLL + + HAW  DVT SPSC+FEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE
             EDLNK++++ C C+KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRKE
Subjt:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE

Query:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI
        KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE+
Subjt:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEI

A0A6J1KTJ7 transcription factor bHLH69-like isoform X26.0e-5660.67Show/hide
Query:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----
        MDFSANLHHQ LQLENQLLL + + HAW  DVT SPSC+FEPSFHLQD+NP   ID ES   +TSLQH +  GS+D+KF E  E EASASSPKT      
Subjt:  MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPG--IDAES---FTSLQHSHGLGSQDQKFGEVKEGEASASSPKTT-----

Query:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE
             EDLNK++++ C C+KIH T+NVSSR+PL MAELSS  GFDH+ +PF  S     H + Q  R  SL                      P KIRKE
Subjt:  -----EDLNKISEN-CYCSKIHHTVNVSSRKPLMMAELSSSLGFDHSLLPFASS-----HHLLQQSRTSSL----------------------PTKIRKE

Query:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV
        KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQ+
Subjt:  KLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQV

SwissProt top hitse value%identityAlignment
F4IHJ0 Phosphatidylinositol/phosphatidylcholine transfer protein SFH82.3e-2056Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EGF   D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K GRRKSDG     S+EDV DVEEL  V  FR +L++DELLP +HDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR

F4JVA6 Phosphatidylinositol/phosphatidylcholine transfer protein SFH67.3e-1955.56Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EG    D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K RRKSD      S+EDV DVEEL  V EFR AL+++ELLP KHDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR

Q8GXC6 Phosphatidylinositol/phosphatidylcholine transfer protein SFH59.9e-1647.57Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQS--GIVKLKNKAKNASSKVRPSLQRKG--RRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHM
        EG S +D +  R+SD E++EDEK++  G    K KA  ASSK+R SL++KG  RR+S   +   ++ED+ DVEEL  V EFR+ L+ + LLPP  DDYH+
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQS--GIVKLKNKAKNASSKVRPSLQRKG--RRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHM

Query:  LWR
        + R
Subjt:  LWR

Q8S3D1 Transcription factor bHLH682.1e-1371.43Show/hide
Query:  LQQSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE
        LQ S +S    K+RKEKLG RIAAL QLV+PFGKTDTASVL EAI YI FLQ+Q+E
Subjt:  LQQSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE

Q9SFZ3 Transcription factor bHLH1103.1e-1750.45Show/hide
Query:  QSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE-IQFPAKRRRRSLPRGGGFPATSGHTSGTLAKTQLDGGLRHLRFVD
        +SR+S  P K+RKEKLGDRIAALQQLV+PFGKTDTASVLMEAI YI FLQ+Q+E +  P  R  R+ P         G  S  ++++Q +G     R + 
Subjt:  QSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE-IQFPAKRRRRSLPRGGGFPATSGHTSGTLAKTQLDGGLRHLRFVD

Query:  HRHIRLEGFSC
         R + L   SC
Subjt:  HRHIRLEGFSC

Arabidopsis top hitse value%identityAlignment
AT1G27660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.2e-1850.45Show/hide
Query:  QSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE-IQFPAKRRRRSLPRGGGFPATSGHTSGTLAKTQLDGGLRHLRFVD
        +SR+S  P K+RKEKLGDRIAALQQLV+PFGKTDTASVLMEAI YI FLQ+Q+E +  P  R  R+ P         G  S  ++++Q +G     R + 
Subjt:  QSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVE-IQFPAKRRRRSLPRGGGFPATSGHTSGTLAKTQLDGGLRHLRFVD

Query:  HRHIRLEGFSC
         R + L   SC
Subjt:  HRHIRLEGFSC

AT2G21520.1 Sec14p-like phosphatidylinositol transfer family protein1.6e-2156Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EGF   D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K GRRKSDG     S+EDV DVEEL  V  FR +L++DELLP +HDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR

AT2G21520.2 Sec14p-like phosphatidylinositol transfer family protein1.6e-2156Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EGF   D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K GRRKSDG     S+EDV DVEEL  V  FR +L++DELLP +HDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRK-GRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR

AT4G39170.1 Sec14p-like phosphatidylinositol transfer family protein5.2e-2055.56Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EG    D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K RRKSD      S+EDV DVEEL  V EFR AL+++ELLP KHDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR

AT4G39170.2 Sec14p-like phosphatidylinositol transfer family protein5.2e-2055.56Show/hide
Query:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR
        EG    D K  R+SD E +EDE+++ I  LK KA NAS+K + SL++K RRKSD      S+EDV DVEEL  V EFR AL+++ELLP KHDDYHM+ R
Subjt:  EGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALIVDELLPPKHDDYHMLWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCTCTGCAAATCTCCATCACCAACTTCTTCAACTTGAAAACCAGTTGCTTCTTAGTGCTGCAAGCAGCCATGCCTGGACCTCCGACGTCACGTTGTCCCCAAG
CTGTGACTTTGAGCCAAGTTTCCATCTTCAAGACCTGAATCCAGGCATTGATGCAGAGAGCTTCACAAGCCTGCAGCATTCCCATGGCCTTGGTAGCCAAGACCAGAAGT
TTGGAGAAGTGAAAGAGGGGGAGGCCTCAGCTTCTTCTCCCAAAACCACTGAAGATTTGAACAAGATTTCTGAAAACTGTTACTGCAGCAAGATTCATCATACTGTTAAT
GTTTCTTCAAGGAAGCCATTGATGATGGCAGAGTTGTCAAGCTCCTTGGGGTTTGACCATAGCCTTCTTCCTTTTGCTTCTTCACATCATCTCTTGCAGCAATCAAGGAC
CTCATCTCTTCCTACTAAAATTAGGAAAGAGAAATTAGGAGATAGAATTGCAGCACTTCAGCAATTAGTGGCACCCTTTGGTAAGACTGACACAGCATCAGTTCTGATGG
AGGCCATTAGTTATATAAATTTCCTTCAAAATCAGGTTGAGATACAATTTCCCGCCAAAAGGCGGCGGAGATCATTGCCACGCGGCGGCGGCTTCCCGGCGACGAGTGGC
CATACCAGCGGGACACTTGCCAAGACGCAGTTAGACGGTGGTTTACGACACCTTCGATTCGTCGATCATCGGCATATACGGCTTGAAGGGTTTTCTTGCCATGATGGGAA
AGGACCACGACAATCTGATTGTGAAATTGCAGAGGATGAGAAGCAATCAGGAATTGTGAAACTGAAGAATAAAGCAAAAAATGCATCAAGTAAAGTCAGGCCGTCTCTGC
AGAGAAAAGGCCGGAGAAAGAGTGATGGCCCAAGTGACCATGCTTCAGTTGAGGATGTTTGGGATGTTGAAGAGCTTCATATTGTAGGAGAATTTCGCGATGCTCTTATT
GTGGATGAGTTGCTACCTCCAAAACATGATGACTATCATATGTTATGGAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCTCTGCAAATCTCCATCACCAACTTCTTCAACTTGAAAACCAGTTGCTTCTTAGTGCTGCAAGCAGCCATGCCTGGACCTCCGACGTCACGTTGTCCCCAAG
CTGTGACTTTGAGCCAAGTTTCCATCTTCAAGACCTGAATCCAGGCATTGATGCAGAGAGCTTCACAAGCCTGCAGCATTCCCATGGCCTTGGTAGCCAAGACCAGAAGT
TTGGAGAAGTGAAAGAGGGGGAGGCCTCAGCTTCTTCTCCCAAAACCACTGAAGATTTGAACAAGATTTCTGAAAACTGTTACTGCAGCAAGATTCATCATACTGTTAAT
GTTTCTTCAAGGAAGCCATTGATGATGGCAGAGTTGTCAAGCTCCTTGGGGTTTGACCATAGCCTTCTTCCTTTTGCTTCTTCACATCATCTCTTGCAGCAATCAAGGAC
CTCATCTCTTCCTACTAAAATTAGGAAAGAGAAATTAGGAGATAGAATTGCAGCACTTCAGCAATTAGTGGCACCCTTTGGTAAGACTGACACAGCATCAGTTCTGATGG
AGGCCATTAGTTATATAAATTTCCTTCAAAATCAGGTTGAGATACAATTTCCCGCCAAAAGGCGGCGGAGATCATTGCCACGCGGCGGCGGCTTCCCGGCGACGAGTGGC
CATACCAGCGGGACACTTGCCAAGACGCAGTTAGACGGTGGTTTACGACACCTTCGATTCGTCGATCATCGGCATATACGGCTTGAAGGGTTTTCTTGCCATGATGGGAA
AGGACCACGACAATCTGATTGTGAAATTGCAGAGGATGAGAAGCAATCAGGAATTGTGAAACTGAAGAATAAAGCAAAAAATGCATCAAGTAAAGTCAGGCCGTCTCTGC
AGAGAAAAGGCCGGAGAAAGAGTGATGGCCCAAGTGACCATGCTTCAGTTGAGGATGTTTGGGATGTTGAAGAGCTTCATATTGTAGGAGAATTTCGCGATGCTCTTATT
GTGGATGAGTTGCTACCTCCAAAACATGATGACTATCATATGTTATGGAGGTGA
Protein sequenceShow/hide protein sequence
MDFSANLHHQLLQLENQLLLSAASSHAWTSDVTLSPSCDFEPSFHLQDLNPGIDAESFTSLQHSHGLGSQDQKFGEVKEGEASASSPKTTEDLNKISENCYCSKIHHTVN
VSSRKPLMMAELSSSLGFDHSLLPFASSHHLLQQSRTSSLPTKIRKEKLGDRIAALQQLVAPFGKTDTASVLMEAISYINFLQNQVEIQFPAKRRRRSLPRGGGFPATSG
HTSGTLAKTQLDGGLRHLRFVDHRHIRLEGFSCHDGKGPRQSDCEIAEDEKQSGIVKLKNKAKNASSKVRPSLQRKGRRKSDGPSDHASVEDVWDVEELHIVGEFRDALI
VDELLPPKHDDYHMLWR