; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021351 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021351
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein SPOROCYTELESS
Genome locationscaffold358:1293094..1294285
RNA-Seq ExpressionMS021351
SyntenyMS021351
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR014855 - Plant transcription factor NOZZLE
IPR040356 - SPEAR family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591999.1 hypothetical protein SDJN03_14345, partial [Cucurbita argyrosperma subsp. sororia]4.8e-4345.21Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----
        MA+PM+++          +ETKP EP KTR +R  K+A K P HKKPPQRGLGVAQLERLRLQERWKKMTE+ PP     H+    + P L F A     
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----

Query:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF
         D  TG  G   F G G  GG   G GG   +EP+ HGGGA+ D R+LIG+   E  RELSSIP +         P PCVSDRCDICFK      R+N  
Subjt:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF

Query:  GFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKEL------
            N   EK +I      P P +     + L T++       S  FNQ          +  G GGGGG   + LMEYEFFP KNGRGTE +E       
Subjt:  GFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKEL------

Query:  --------EMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
                E  +EEEEEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  --------EMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

KAG7024875.1 hypothetical protein SDJN02_13694, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-4345.86Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----
        MA+PM+++          +ETKP EP KTR +R  K+A K P HKKPPQRGLGVAQLERLRLQERWKKMTE+ PP     H+    + P L F A     
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----

Query:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF
         D  TG  G   F G G  GG   G GG   +EP+ HGGGA+ DPR+LIG+   E  RELSSIP +         P PCVSDRCDICFK      R+N  
Subjt:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF

Query:  GFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------
            N   EK +I      P P +     + L T++       S  FNQ          +  G GGGGG   + LMEYEFFP KNGRGTE +E       
Subjt:  GFRENFRDEKTVIHIHLRFPTPKS-----IKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------

Query:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
             E  +EEEEEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

XP_022936226.1 protein virilizer homolog [Cucurbita moschata]2.0e-4446.13Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----
        MA+PM+++          +ETKP EP KTR +R  K+A K P HKKPPQRGLGVAQLERLRLQERWKKMTE+ PP     H+    + P L FSA     
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----

Query:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF
         D  TG  G   F G G  GG   G GG   +EP+ HGGGA+ DPR+LIG+   E  RELSSIP +         P PCVSDRCDICFK      R+N  
Subjt:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF

Query:  GFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------
            N   EK +I      P P       + L T++       S  FNQ          +  G GGGGG   + LMEYEFFP KNGRGTE +E       
Subjt:  GFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------

Query:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
             E  +EEEEEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

XP_022975733.1 protein SPOROCYTELESS [Cucurbita maxima]1.6e-4646.86Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCT
        MATPM+++          +ETKP EP KTR +R  K+A K P  KKPPQRGLGVAQLERLRLQERWKKMT++ PP      F     + G   + V +  
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCT

Query:  GEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFKVIFIILRINLFGFRENFRDE
        G G   V   +GN  GF +G GG   +EP+ HGGGA+ DPR+LIG+   E  RELSSIP +     P PCVSDRCDICFK      R+N      N   E
Subjt:  GEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFKVIFIILRINLFGFRENFRDE

Query:  KTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT--------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE
        K +  I    P P S   + L T++       S  FNQ          +  G GGGGGE + LMEYEFFP KNGRGTEF+EL+ P           +EE+
Subjt:  KTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT--------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE

Query:  EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
        EEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

XP_038900102.1 protein SPOROCYTELESS-like [Benincasa hispida]5.0e-4043.44Show/hide
Query:  QETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHF--------------ANIPGLHFSA------VDDCTG
        +ETKP EP KTRP R  K+  + P  KKPPQRGLGVAQLERLRLQ++W K+TEM PP    HHF               N P L F A        D  G
Subjt:  QETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHF--------------ANIPGLHFSA------VDDCTG

Query:  EGG-------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMP-----CVSDRCDICFKVIFIILRINLFGFRE
         GG       GLV Q +GN GGF+AG       EPY+HGGG     VLIG+   E  RELSSIPK+P P     C SD CDICFK      R+N      
Subjt:  EGG-------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMP-----CVSDRCDICFKVIFIILRINLFGFRE

Query:  NFRDEKTVIHIHLRFP-------------TPKSIKLITHAIFLSS------------RFNQ----TEETLGNGGGG----GENILMEYEFFPGKNGRGTE
        N   EK +       P             T  S   + ++  ++              FNQ         G+GGGG    G + LMEYEFFP KN RGTE
Subjt:  NFRDEKTVIHIHLRFP-------------TPKSIKLITHAIFLSS------------RFNQ----TEETLGNGGGG----GENILMEYEFFPGKNGRGTE

Query:  FKELEMPKEE-----EEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
         +EL+MPKEE     EE E      AVDHGE  SCITT+ N  I   NG + + S+A+DLSLKLSF
Subjt:  FKELEMPKEE-----EEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

TrEMBL top hitse value%identityAlignment
A0A0A0LDJ3 Uncharacterized protein7.8e-3944.65Show/hide
Query:  EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------DDCTG-EGGGLVFQGMGNVGGFVA
        EP KTR  R  K  PK P  KKPPQRGLGVAQLERLRLQE WK +TE+ PP    H+   N P LHF          DC G +  G V Q +GN GGF+ 
Subjt:  EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------DDCTG-EGGGLVFQGMGNVGGFVA

Query:  GAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFRENFRDEKTVI--------HIHLRFPTPKSIK
         +G                  VLIG+   E  RELSSIPK+P+ C SDRCD CFK      R+N      N   EK +I           L   T  S +
Subjt:  GAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFRENFRDEKTVI--------HIHLRFPTPKSIK

Query:  LITHAIFLSSRFNQTEETL--------------GNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE------EEEEEFSFAEEAVDHGEASSCITTT
        L TH    +   + T   L              G GG GGE + LMEYEFFP KNGRGTE +EL+MPKE      EE EE      A+DHGE  SCITT+
Subjt:  LITHAIFLSSRFNQTEETL--------------GNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE------EEEEEFSFAEEAVDHGEASSCITTT

Query:  YNTAIVNNNGSSSSGSSAVDLSLKLSF
         N  I   NG + + S+A+DLSLKLSF
Subjt:  YNTAIVNNNGSSSSGSSAVDLSLKLSF

A0A1S3BZ18 uncharacterized protein LOC103494987 isoform X23.0e-3841.81Show/hide
Query:  MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------D
        MATP+     Q+  +N S   KP   +       K  PK P  KKPPQRGLGVAQLERLRLQE WK +TE+ PP    H+   N P LHF          
Subjt:  MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------D

Query:  DCTGEGG---------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFR
        DC   G          G V Q +GN GGF+   G                  VLIG+   E  RELSSIPK+P+ C SDRCD CFK      R+N     
Subjt:  DCTGEGG---------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFR

Query:  ENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF---------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE----
         N   EK +I      P        T  + +L TH++          L   F+   ++  G GG GGE + LMEYEFFP KNGRGTE +EL+MPKE    
Subjt:  ENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF---------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE----

Query:  --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
          EE EE      A+DHGE  SCITT+ N  I   NG + + S+A+DLSLKLSF
Subjt:  --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

A0A5A7T067 Protein SPOROCYTELESS3.9e-3841.81Show/hide
Query:  MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------D
        MATP+     Q+  +N S   KP   +       K  PK P  KKPPQRGLGVAQLERLRLQE WK +TE+ PP    H+   N P LHF          
Subjt:  MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHH-FANIPGLHFSAV------D

Query:  DCTGEGG---------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFR
        DC   G          G V Q +GN GGF+   G                  VLIG+   E  RELSSIPK+P+ C SDRCD CFK      R+N     
Subjt:  DCTGEGG---------GLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFR

Query:  ENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF---------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE----
         N   EK +I      P        T  + +L TH++          L   F+   ++  G GG GGE + LMEYEFFP KNGRGTE +EL+MPKE    
Subjt:  ENFRDEKTVIHIHLRFP--------TPKSIKLITHAIF---------LSSRFN-QTEETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMPKE----

Query:  --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
          EE EE      A+DHGE  SCITT+ N  I   NG + + S+A+DLSLKLSF
Subjt:  --EEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

A0A6J1F6X7 protein virilizer homolog9.5e-4546.13Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----
        MA+PM+++          +ETKP EP KTR +R  K+A K P HKKPPQRGLGVAQLERLRLQERWKKMTE+ PP     H+    + P L FSA     
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPP---PVHAHHFANIPGLHFSAV----

Query:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF
         D  TG  G   F G G  GG   G GG   +EP+ HGGGA+ DPR+LIG+   E  RELSSIP +         P PCVSDRCDICFK      R+N  
Subjt:  -DDCTGEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM---------PMPCVSDRCDICFKVIFIILRINLF

Query:  GFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------
            N   EK +I      P P       + L T++       S  FNQ          +  G GGGGG   + LMEYEFFP KNGRGTE +E       
Subjt:  GFRENFRDEKTVIHIHLRFPTP-----KSIKLITHA----IFLSSRFNQT--------EETLGNGGGGG--ENILMEYEFFPGKNGRGTEFKE-------

Query:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
             E  +EEEEEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  ----LEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

A0A6J1IK53 protein SPOROCYTELESS7.8e-4746.86Show/hide
Query:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCT
        MATPM+++          +ETKP EP KTR +R  K+A K P  KKPPQRGLGVAQLERLRLQERWKKMT++ PP      F     + G   + V +  
Subjt:  MATPMLLMAAQKSNANASQETKP-EPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFA---NIPGLHFSAVDDCT

Query:  GEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFKVIFIILRINLFGFRENFRDE
        G G   V   +GN  GF +G GG   +EP+ HGGGA+ DPR+LIG+   E  RELSSIP +     P PCVSDRCDICFK      R+N      N   E
Subjt:  GEGGGLVFQGMGNVGGFVAGAGGFTVVEPYTHGGGAL-DPRVLIGSYGEEDLRELSSIPKM-----PMPCVSDRCDICFKVIFIILRINLFGFRENFRDE

Query:  KTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT--------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE
        K +  I    P P S   + L T++       S  FNQ          +  G GGGGGE + LMEYEFFP KNGRGTEF+EL+ P           +EE+
Subjt:  KTVIHIHLRFPTPKS---IKLITHA----IFLSSRFNQT--------EETLGNGGGGGE-NILMEYEFFPGKNGRGTEFKELEMP-----------KEEE

Query:  EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF
        EEE   A +AVDHGE  SCITT+ +  I   NG + + S+ +DLSLKLSF
Subjt:  EEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCCAATGCTTCTCATGGCCGCCCAAAAATCCAACGCCAACGCATCACAAGAAACCAAGCCAGAGCCCACCAAAACCAGACCCTCAAGATCGTCAAAGTCCGC
CCCCAAAGCCCCCGCCCACAAGAAGCCGCCGCAGCGCGGCCTCGGCGTCGCCCAGCTCGAGCGCCTCCGCCTCCAGGAACGCTGGAAGAAAATGACCGAAATGCCCCCGC
CGCCCGTCCACGCCCACCACTTCGCCAACATTCCCGGCCTCCACTTCTCCGCCGTTGATGACTGTACTGGCGAAGGCGGTGGTCTCGTCTTTCAGGGGATGGGGAATGTT
GGAGGGTTTGTCGCCGGCGCCGGAGGGTTCACGGTGGTGGAGCCGTACACGCACGGCGGCGGAGCCCTGGATCCGAGGGTTCTGATCGGAAGTTACGGCGAGGAGGATTT
GAGAGAGCTCTCTTCAATCCCAAAAATGCCGATGCCGTGCGTTTCCGATCGCTGTGATATTTGCTTCAAGGTAATATTTATTATTCTTCGTATCAACTTGTTTGGATTCC
GAGAAAATTTCCGTGACGAGAAAACGGTTATACATATTCATCTCCGTTTTCCGACGCCCAAATCGATTAAATTAATTACTCACGCCATTTTTCTGAGTTCACGTTTTAAT
CAAACAGAAGAAACGCTGGGGAACGGCGGAGGAGGAGGAGAAAATATATTAATGGAATACGAGTTTTTTCCTGGGAAAAATGGCAGAGGCACAGAGTTCAAGGAACTGGA
AATGCCAAAGGAAGAAGAAGAAGAAGAATTTTCGTTTGCAGAAGAAGCAGTGGATCATGGAGAAGCTTCGTCTTGTATTACTACAACCTACAACACTGCCATTGTTAATA
ATAATGGCAGCAGTAGCAGTGGTTCCAGTGCAGTTGATTTGTCTCTCAAACTTTCATTT
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCCAATGCTTCTCATGGCCGCCCAAAAATCCAACGCCAACGCATCACAAGAAACCAAGCCAGAGCCCACCAAAACCAGACCCTCAAGATCGTCAAAGTCCGC
CCCCAAAGCCCCCGCCCACAAGAAGCCGCCGCAGCGCGGCCTCGGCGTCGCCCAGCTCGAGCGCCTCCGCCTCCAGGAACGCTGGAAGAAAATGACCGAAATGCCCCCGC
CGCCCGTCCACGCCCACCACTTCGCCAACATTCCCGGCCTCCACTTCTCCGCCGTTGATGACTGTACTGGCGAAGGCGGTGGTCTCGTCTTTCAGGGGATGGGGAATGTT
GGAGGGTTTGTCGCCGGCGCCGGAGGGTTCACGGTGGTGGAGCCGTACACGCACGGCGGCGGAGCCCTGGATCCGAGGGTTCTGATCGGAAGTTACGGCGAGGAGGATTT
GAGAGAGCTCTCTTCAATCCCAAAAATGCCGATGCCGTGCGTTTCCGATCGCTGTGATATTTGCTTCAAGGTAATATTTATTATTCTTCGTATCAACTTGTTTGGATTCC
GAGAAAATTTCCGTGACGAGAAAACGGTTATACATATTCATCTCCGTTTTCCGACGCCCAAATCGATTAAATTAATTACTCACGCCATTTTTCTGAGTTCACGTTTTAAT
CAAACAGAAGAAACGCTGGGGAACGGCGGAGGAGGAGGAGAAAATATATTAATGGAATACGAGTTTTTTCCTGGGAAAAATGGCAGAGGCACAGAGTTCAAGGAACTGGA
AATGCCAAAGGAAGAAGAAGAAGAAGAATTTTCGTTTGCAGAAGAAGCAGTGGATCATGGAGAAGCTTCGTCTTGTATTACTACAACCTACAACACTGCCATTGTTAATA
ATAATGGCAGCAGTAGCAGTGGTTCCAGTGCAGTTGATTTGTCTCTCAAACTTTCATTT
Protein sequenceShow/hide protein sequence
MATPMLLMAAQKSNANASQETKPEPTKTRPSRSSKSAPKAPAHKKPPQRGLGVAQLERLRLQERWKKMTEMPPPPVHAHHFANIPGLHFSAVDDCTGEGGGLVFQGMGNV
GGFVAGAGGFTVVEPYTHGGGALDPRVLIGSYGEEDLRELSSIPKMPMPCVSDRCDICFKVIFIILRINLFGFRENFRDEKTVIHIHLRFPTPKSIKLITHAIFLSSRFN
QTEETLGNGGGGGENILMEYEFFPGKNGRGTEFKELEMPKEEEEEEFSFAEEAVDHGEASSCITTTYNTAIVNNNGSSSSGSSAVDLSLKLSF