; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1096 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1096
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionF-box domain-containing protein
Genome locationMC03:17248421..17249712
RNA-Seq ExpressionMC03g1096
SyntenyMC03g1096
Gene Ontology termsGO:0031146 - SCF-dependent proteasomal ubiquitin-dependent protein catabolic process (biological process)
GO:0019005 - SCF ubiquitin ligase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001810 - F-box domain
IPR036047 - F-box-like domain superfamily
IPR045118 - F-box only protein FBXO9/FBXO48


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449214.1 PREDICTED: probable F-box protein At5g04010 [Cucumis melo]2.07e-12266.44Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S S PPWEV++LVAE LDP++LATASCVCKSWS+SMASDHLWEPIFTANFPSLFNLA  P+SS   SFRRLFGLG+ AAARRR    KPTLSLSDLVF+I
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+  T KD + I      K+EK+  PQS ++ VK+G EL V  + NG FKF +NL +++G AP+V GA +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCCPN VSGGIV DL+LGLCG+G   N++              R+ES SVGM+SVV+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

XP_022153920.1 probable F-box protein At5g04010 isoform X1 [Momordica charantia]5.73e-204100Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
        SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV

Query:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
        VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
Subjt:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE

Query:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
        ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
Subjt:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

XP_022153921.1 probable F-box protein At5g04010 isoform X2 [Momordica charantia]5.32e-204100Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
        SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV

Query:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
        VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
Subjt:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE

Query:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
        ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
Subjt:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

XP_031740421.1 probable F-box protein At5g04010 [Cucumis sativus]1.51e-12266.09Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV-PSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S S PPWEV++LVAE LDP++LATASCVCKSWS+SMASDHLWEPIFTANFPSLFNLA  P++S   SFRRLFGLG+ AAARRR   SKP+LSLSDLVF+I
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV-PSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+  T+KD   I      K+EK+  PQS ++ VK+G EL V  D NG FKF +NL  +   AP+V GA +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCCPN VSGGIV DL+LGLCG+G   N++              R+ES SVGM+SVV+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

XP_038883618.1 probable F-box protein At5g04010 [Benincasa hispida]2.22e-12266.78Show/hide
Query:  SSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV----PSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S PPWEVL+LVAE LDP++LATASCVCK WSISMASDHLWEPIFTANFPSL NLA+    P+S   SFRRLFGLGHTAAARRR  PSKPTLSLSDL+F+I
Subjt:  SSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV----PSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIRKK--KEKSAPPQSI-SCVKYGSELVVAADPNGRFKFYVNLASDSGD--APLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+ ST KD   +  +  ++  APPQSI + VK+G EL V  D NG FKF +NL  D+G+   P+V G  +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIRKK--KEKSAPPQSI-SCVKYGSELVVAADPNGRFKFYVNLASDSGD--APLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCC N VSGGIVADLKLGL G+        NG       G  +R+ES SVGM+S+V+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

TrEMBL top hitse value%identityAlignment
A0A0A0LDE5 F-box domain-containing protein7.30e-12366.09Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV-PSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S S PPWEV++LVAE LDP++LATASCVCKSWS+SMASDHLWEPIFTANFPSLFNLA  P++S   SFRRLFGLG+ AAARRR   SKP+LSLSDLVF+I
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAV-PSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+  T+KD   I      K+EK+  PQS ++ VK+G EL V  D NG FKF +NL  +   AP+V GA +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCCPN VSGGIV DL+LGLCG+G   N++              R+ES SVGM+SVV+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

A0A1S3BLJ1 probable F-box protein At5g040101.00e-12266.44Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S S PPWEV++LVAE LDP++LATASCVCKSWS+SMASDHLWEPIFTANFPSLFNLA  P+SS   SFRRLFGLG+ AAARRR    KPTLSLSDLVF+I
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+  T KD + I      K+EK+  PQS ++ VK+G EL V  + NG FKF +NL +++G AP+V GA +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCCPN VSGGIV DL+LGLCG+G   N++              R+ES SVGM+SVV+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

A0A5D3DEJ6 Putative F-box protein1.00e-12266.44Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII
        S S PPWEV++LVAE LDP++LATASCVCKSWS+SMASDHLWEPIFTANFPSLFNLA  P+SS   SFRRLFGLG+ AAARRR    KPTLSLSDLVF+I
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLA-VPSSSA-ASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFII

Query:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP
        S+  T KD + I      K+EK+  PQS ++ VK+G EL V  + NG FKF +NL +++G AP+V GA +EV+VVWN+VL GWRGVFTMIECGGRVG AP
Subjt:  SVVSTHKDDAFIR----KKKEKSAPPQS-ISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLV-GAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAP

Query:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
          DGWFSEELP AGCCPN VSGGIV DL+LGLCG+G   N++              R+ES SVGM+SVV+WRYVS++DGLMYLQHFLFN
Subjt:  GVDGWFSEELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

A0A6J1DI77 probable F-box protein At5g04010 isoform X12.77e-204100Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
        SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV

Query:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
        VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
Subjt:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE

Query:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
        ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
Subjt:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

A0A6J1DM50 probable F-box protein At5g04010 isoform X22.58e-204100Show/hide
Query:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
        SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV
Subjt:  SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISV

Query:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
        VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE
Subjt:  VSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSE

Query:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
        ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN
Subjt:  ELPLAGCCPNTVSGGIVADLKLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN

SwissProt top hitse value%identityAlignment
Q5EAF6 Probable F-box protein At5g040102.8e-2135.27Show/hide
Query:  PSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSA----ASFRRLFGLGHTAAARRRST-PSKPTLSLSDLVF
        PS P WE+L LV  ++DP SLA ASCV  +WS   +S+ LW+ +  A   S+F  A+    A     S++RL     +AA RRR+  P++P +SLSDLVF
Subjt:  PSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSA----ASFRRLFGLGHTAAARRRST-PSKPTLSLSDLVF

Query:  IISVVSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDG
        I+ V                SA  +    VK G +LV  +  N RF+    + +D  D+       +V + WNVVL  +  +F M E    +     + G
Subjt:  IISVVSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDG

Query:  WFSEELP
        WF++ELP
Subjt:  WFSEELP

Arabidopsis top hitse value%identityAlignment
AT5G04010.1 F-box family protein2.0e-2235.27Show/hide
Query:  PSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSA----ASFRRLFGLGHTAAARRRST-PSKPTLSLSDLVF
        PS P WE+L LV  ++DP SLA ASCV  +WS   +S+ LW+ +  A   S+F  A+    A     S++RL     +AA RRR+  P++P +SLSDLVF
Subjt:  PSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSA----ASFRRLFGLGHTAAARRRST-PSKPTLSLSDLVF

Query:  IISVVSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDG
        I+ V                SA  +    VK G +LV  +  N RF+    + +D  D+       +V + WNVVL  +  +F M E    +     + G
Subjt:  IISVVSTHKDDAFIRKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDG

Query:  WFSEELP
        WF++ELP
Subjt:  WFSEELP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGCCGTCATCGCCGCCGTGGGAGGTGCTTGTTCTGGTGGCGGAACATCTGGACCCGAGAAGCCTGGCGACGGCGTCGTGCGTGTGCAAGTCGTGGTCCATTTCCATGGC
CTCGGACCACCTCTGGGAGCCCATTTTCACCGCCAATTTCCCTTCTCTTTTCAACCTCGCCGTCCCCTCCTCCTCCGCCGCCTCCTTCCGCCGCCTCTTTGGCCTCGGCC
ACACCGCCGCTGCCCGCCGCCGCTCCACTCCATCAAAACCAACCCTCTCCCTCTCCGACCTCGTCTTCATCATCAGCGTCGTCTCGACTCACAAAGACGACGCGTTCATC
CGAAAGAAGAAAGAAAAGAGCGCTCCGCCACAATCCATCAGTTGCGTGAAGTATGGCAGTGAGCTGGTGGTGGCCGCAGATCCAAACGGACGATTCAAATTCTACGTAAA
TCTAGCCAGCGACAGCGGCGATGCTCCGTTGGTCGGGGCCGGGGACGAGGTGAAGGTGGTGTGGAACGTGGTTTTGGGAGGGTGGCGGGGGGTGTTTACGATGATCGAAT
GTGGAGGCCGAGTGGGGTTTGCTCCGGGAGTCGACGGGTGGTTCTCGGAGGAGCTACCATTGGCGGGATGTTGCCCGAACACGGTGAGCGGTGGAATAGTAGCGGATCTT
AAGTTGGGATTGTGTGGGAGCGGAAGGAATGAGAATGAGATTGACAACGGAGAAAATATTAGGGTTGACAAGGGAGAAAATATTAGGGTTGAGAGCGCGAGCGTGGGAAT
GTTGAGTGTTGTGAATTGGAGATATGTGAGTGTGGAAGATGGACTGATGTATTTGCAACACTTTTTGTTCAAT
mRNA sequenceShow/hide mRNA sequence
TCGCCGTCATCGCCGCCGTGGGAGGTGCTTGTTCTGGTGGCGGAACATCTGGACCCGAGAAGCCTGGCGACGGCGTCGTGCGTGTGCAAGTCGTGGTCCATTTCCATGGC
CTCGGACCACCTCTGGGAGCCCATTTTCACCGCCAATTTCCCTTCTCTTTTCAACCTCGCCGTCCCCTCCTCCTCCGCCGCCTCCTTCCGCCGCCTCTTTGGCCTCGGCC
ACACCGCCGCTGCCCGCCGCCGCTCCACTCCATCAAAACCAACCCTCTCCCTCTCCGACCTCGTCTTCATCATCAGCGTCGTCTCGACTCACAAAGACGACGCGTTCATC
CGAAAGAAGAAAGAAAAGAGCGCTCCGCCACAATCCATCAGTTGCGTGAAGTATGGCAGTGAGCTGGTGGTGGCCGCAGATCCAAACGGACGATTCAAATTCTACGTAAA
TCTAGCCAGCGACAGCGGCGATGCTCCGTTGGTCGGGGCCGGGGACGAGGTGAAGGTGGTGTGGAACGTGGTTTTGGGAGGGTGGCGGGGGGTGTTTACGATGATCGAAT
GTGGAGGCCGAGTGGGGTTTGCTCCGGGAGTCGACGGGTGGTTCTCGGAGGAGCTACCATTGGCGGGATGTTGCCCGAACACGGTGAGCGGTGGAATAGTAGCGGATCTT
AAGTTGGGATTGTGTGGGAGCGGAAGGAATGAGAATGAGATTGACAACGGAGAAAATATTAGGGTTGACAAGGGAGAAAATATTAGGGTTGAGAGCGCGAGCGTGGGAAT
GTTGAGTGTTGTGAATTGGAGATATGTGAGTGTGGAAGATGGACTGATGTATTTGCAACACTTTTTGTTCAAT
Protein sequenceShow/hide protein sequence
SPSSPPWEVLVLVAEHLDPRSLATASCVCKSWSISMASDHLWEPIFTANFPSLFNLAVPSSSAASFRRLFGLGHTAAARRRSTPSKPTLSLSDLVFIISVVSTHKDDAFI
RKKKEKSAPPQSISCVKYGSELVVAADPNGRFKFYVNLASDSGDAPLVGAGDEVKVVWNVVLGGWRGVFTMIECGGRVGFAPGVDGWFSEELPLAGCCPNTVSGGIVADL
KLGLCGSGRNENEIDNGENIRVDKGENIRVESASVGMLSVVNWRYVSVEDGLMYLQHFLFN