; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017303 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017303
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationscaffold33:1473800..1474423
RNA-Seq ExpressionMS017303
SyntenyMS017303
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647976.1 hypothetical protein Csa_000363 [Cucumis sativus]1.1e-6266.5Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EV+LKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+K  M G L NLY SVE LN+TYLQ NQSKDSLLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          K Y C ++C   +A++P A CPNCR+ M + C FV+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK++
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLD
        TLD
Subjt:  TLD

XP_004147723.1 uncharacterized protein LOC101207526 [Cucumis sativus]2.1e-6366.02Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EV+LKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+K  M G L NLY SVE LN+TYLQ NQSKDSLLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          K Y C ++C   +A++P A CPNCR+ M + C FV+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK++
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
        TLDV++
Subjt:  TLDVNE

XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]5.4e-6768.45Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EVRLKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+KQGMVG L NLY SVE LN+TYLQ NQSKD+LLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          KFY C +RC   +A++P A CP+CR  M + C  V+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK+I
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
        TLDVN+
Subjt:  TLDVNE

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]1.1e-7069.77Show/hide
Query:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        MA   VRLKLLIDS  QRV+FGEADKN + FLF LLSLPLG VIRLL+KQGMVGCLGNLY SVETLN+TYLQ NQSKD LLKPKVSFCG SS MLLPNID
Subjt:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  -SSSATKFYWCSS----RCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQ--AATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD
         S++AT FY C+S     CR  +++ P A CP C   M QV  FV P   S    AA    EGG+VKGVVTYMVMDDLSV+PMSTIS+I LL NKFN+K+
Subjt:  -SSSATKFYWCSS----RCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQ--AATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD

Query:  VGALEEKIITLDVNE
        VGALEEK++TLDVNE
Subjt:  VGALEEKIITLDVNE

XP_022764289.1 uncharacterized protein LOC111309518 [Durio zibethinus]2.1e-5859.72Show/hide
Query:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        MA   V  KLLID  SQRV+F EA K+FV FLF +LSLP+G VIRLL KQGMVGCLGNLY S+E L++TY+QS  +KD+LLKP VS       +LLPNI+
Subjt:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  SSSATK--FYWC-SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGAL
          S+T    Y C ++ CR Y+ANDPK+TCP+C  +M+Q   FV+P+      A+S GEGGYVKGVVTYM+MDDL V PMSTIS+ITLL N+FN+KDVG L
Subjt:  SSSATK--FYWC-SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGAL

Query:  EEKIITLDVNE
        EE++I + ++E
Subjt:  EEKIITLDVNE

TrEMBL top hitse value%identityAlignment
A0A0A0KLL7 Uncharacterized protein1.0e-6366.02Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EV+LKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+K  M G L NLY SVE LN+TYLQ NQSKDSLLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          K Y C ++C   +A++P A CPNCR+ M + C FV+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK++
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
        TLDV++
Subjt:  TLDVNE

A0A1S3CGQ2 uncharacterized protein LOC1035002682.6e-6768.45Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EVRLKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+KQGMVG L NLY SVE LN+TYLQ NQSKD+LLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          KFY C +RC   +A++P A CP+CR  M + C  V+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK+I
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
        TLDVN+
Subjt:  TLDVNE

A0A5A7U8V2 DUF674 domain-containing protein2.6e-6768.45Show/hide
Query:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS
        EVRLKLLIDS  +RV+FGEADKN + FLF LLSLPLG VIRLL+KQGMVG L NLY SVE LN+TYLQ NQSKD+LLKPKVSF   +S +LLPNI+S + 
Subjt:  EVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDS-SS

Query:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
          KFY C +RC   +A++P A CP+CR  M + C  V+P   + QA   VGE GG+VKGVVTYMVMDDLSV+PMSTIS+ITLL NKFNIK+VGALEEK+I
Subjt:  ATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE-GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
        TLDVN+
Subjt:  TLDVNE

A0A6J1CBJ8 uncharacterized protein LOC1110100135.1e-7169.77Show/hide
Query:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        MA   VRLKLLIDS  QRV+FGEADKN + FLF LLSLPLG VIRLL+KQGMVGCLGNLY SVETLN+TYLQ NQSKD LLKPKVSFCG SS MLLPNID
Subjt:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  -SSSATKFYWCSS----RCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQ--AATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD
         S++AT FY C+S     CR  +++ P A CP C   M QV  FV P   S    AA    EGG+VKGVVTYMVMDDLSV+PMSTIS+I LL NKFN+K+
Subjt:  -SSSATKFYWCSS----RCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQ--AATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD

Query:  VGALEEKIITLDVNE
        VGALEEK++TLDVNE
Subjt:  VGALEEKIITLDVNE

A0A6P6AHE1 uncharacterized protein LOC1113095181.0e-5859.72Show/hide
Query:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        MA   V  KLLID  SQRV+F EA K+FV FLF +LSLP+G VIRLL KQGMVGCLGNLY S+E L++TY+QS  +KD+LLKP VS       +LLPNI+
Subjt:  MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  SSSATK--FYWC-SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGAL
          S+T    Y C ++ CR Y+ANDPK+TCP+C  +M+Q   FV+P+      A+S GEGGYVKGVVTYM+MDDL V PMSTIS+ITLL N+FN+KDVG L
Subjt:  SSSATK--FYWC-SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGAL

Query:  EEKIITLDVNE
        EE++I + ++E
Subjt:  EEKIITLDVNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)1.9e-1731.63Show/hide
Query:  KREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQK-----QGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLP
        K +  L+LLID    RVI  EA K+FV  L +LL+LP+G ++RLL+K       +VGCL NLY SV  ++    +S   K  LL P+ S  G     L  
Subjt:  KREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQK-----QGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLP

Query:  NIDSSSATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE----GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD
        NID + ATKF+ C +    +++ +       CR L   V       G S      V E    G +     ++++ DDL V  ++++  +  + N F    
Subjt:  NIDSSSATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGE----GGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKD

Query:  VGALEEKIITLDVNE
           L+E +I +   E
Subjt:  VGALEEKIITLDVNE

AT5G01120.1 Protein of unknown function (DUF674)1.0e-1831.31Show/hide
Query:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        + LKLLID    +V+F EA  +FV  LF+  +LP+G ++RLL+     +   +GC  N+YASV ++   +  +   K  LL P  S        +   ID
Subjt:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV
         S ATK + C     S +C    +N   + C +C   MD+V QF    G    +   V    +V+G  T +++ DDL V+  S  ST+ +L +     D 
Subjt:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV

Query:  GALEEKIITLDVNE
          L E I+ +++ E
Subjt:  GALEEKIITLDVNE

AT5G01150.1 Protein of unknown function (DUF674)8.0e-1629.61Show/hide
Query:  LKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQG-----MVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDSS
        L+L++D    +V+  EA ++FV  LF+LL+LP+G ++RLL+         +GC  NLY SV  +     ++   K  L+ PK S        L  NI+ +
Subjt:  LKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQG-----MVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDSS

Query:  SATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII
           K + CSS C G  +N   + C  C   M++  Q  +      +       G +V G  ++++ DDL V   ST   +  L +     DVG L E+++
Subjt:  SATKFYWCSSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKII

Query:  TLDVNE
         + V E
Subjt:  TLDVNE

AT5G43240.1 Protein of unknown function (DUF674)1.5e-1932.71Show/hide
Query:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        ++LKLLID    +V+F EA K+FV  LF+  +LP+G ++RLL+     ++  +GC  N+YASV ++   +  +   K  LL P  S        L   +D
Subjt:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV
         S ATK++ C       +C    +N   + C +C  LM++V Q +   GG   A   V  G +V+   T +M+ DDL VE  S   T+ +L +     D 
Subjt:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV

Query:  GALEEKIITLDVNE
          L+EKI  +++ E
Subjt:  GALEEKIITLDVNE

AT5G43240.3 Protein of unknown function (DUF674)1.5e-1932.71Show/hide
Query:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID
        ++LKLLID    +V+F EA K+FV  LF+  +LP+G ++RLL+     ++  +GC  N+YASV ++   +  +   K  LL P  S        L   +D
Subjt:  VRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQ-----KQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNID

Query:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV
         S ATK++ C       +C    +N   + C +C  LM++V Q +   GG   A   V  G +V+   T +M+ DDL VE  S   T+ +L +     D 
Subjt:  SSSATKFYWC-----SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVT-YMVMDDLSVEPMSTISTITLLNNKFNIKDV

Query:  GALEEKIITLDVNE
          L+EKI  +++ E
Subjt:  GALEEKIITLDVNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAGAGGGAAGTGAGATTGAAGCTTCTAATAGACTCGGGATCACAAAGAGTTATTTTTGGTGAAGCAGACAAGAATTTCGTCCACTTTCTTTTCACTCTGCTGTC
TCTCCCACTTGGGGCTGTGATTCGGCTGCTCCAAAAGCAAGGCATGGTGGGGTGCTTGGGAAATCTGTACGCAAGCGTGGAGACGTTGAACGAGACATATTTGCAGTCAA
ACCAGAGCAAGGACTCTCTTTTGAAGCCCAAAGTTTCATTCTGTGGTGGTTCCTCGGCCATGCTTTTGCCTAATATTGATTCCTCGTCTGCAACCAAGTTTTATTGGTGT
AGCAGTCGCTGTAGGGGTTACATTGCCAATGACCCTAAAGCAACCTGTCCGAATTGCAGAACATTGATGGACCAAGTGTGTCAATTTGTGCATCCGTCGGGAGGAAGTAA
ACAGGCCGCAACAAGTGTGGGAGAGGGAGGGTATGTGAAGGGTGTGGTGACTTACATGGTGATGGATGATTTGAGTGTGGAACCAATGTCCACCATCTCCACCATTACTC
TTCTGAATAACAAGTTTAATATCAAAGATGTGGGTGCTTTGGAGGAGAAGATCATCACTTTGGATGTCAATGAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAGAGGGAAGTGAGATTGAAGCTTCTAATAGACTCGGGATCACAAAGAGTTATTTTTGGTGAAGCAGACAAGAATTTCGTCCACTTTCTTTTCACTCTGCTGTC
TCTCCCACTTGGGGCTGTGATTCGGCTGCTCCAAAAGCAAGGCATGGTGGGGTGCTTGGGAAATCTGTACGCAAGCGTGGAGACGTTGAACGAGACATATTTGCAGTCAA
ACCAGAGCAAGGACTCTCTTTTGAAGCCCAAAGTTTCATTCTGTGGTGGTTCCTCGGCCATGCTTTTGCCTAATATTGATTCCTCGTCTGCAACCAAGTTTTATTGGTGT
AGCAGTCGCTGTAGGGGTTACATTGCCAATGACCCTAAAGCAACCTGTCCGAATTGCAGAACATTGATGGACCAAGTGTGTCAATTTGTGCATCCGTCGGGAGGAAGTAA
ACAGGCCGCAACAAGTGTGGGAGAGGGAGGGTATGTGAAGGGTGTGGTGACTTACATGGTGATGGATGATTTGAGTGTGGAACCAATGTCCACCATCTCCACCATTACTC
TTCTGAATAACAAGTTTAATATCAAAGATGTGGGTGCTTTGGAGGAGAAGATCATCACTTTGGATGTCAATGAG
Protein sequenceShow/hide protein sequence
MAKREVRLKLLIDSGSQRVIFGEADKNFVHFLFTLLSLPLGAVIRLLQKQGMVGCLGNLYASVETLNETYLQSNQSKDSLLKPKVSFCGGSSAMLLPNIDSSSATKFYWC
SSRCRGYIANDPKATCPNCRTLMDQVCQFVHPSGGSKQAATSVGEGGYVKGVVTYMVMDDLSVEPMSTISTITLLNNKFNIKDVGALEEKIITLDVNE