; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013483 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013483
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description(thale cress) hypothetical protein
Genome locationscaffold402:1881097..1882834
RNA-Seq ExpressionMS013483
SyntenyMS013483
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573361.1 hypothetical protein SDJN03_27248, partial [Cucurbita argyrosperma subsp. sororia]1.1e-6051.26Show/hide
Query:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MES A+F GG P KLPL  P PST + + P WL F++   PFPRL NGS  T +SIG TRNP    K LA C          YSNA++D  K+L+SLV P
Subjt:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
        K  N+S L  I S KNEALKLVV++KY    C  + L +      +V YEAR+A++Q LI LDEY+KALEFLEE   FP+  S +AR  LYKAVVHTMLG
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          D AE+ WN YL+TL NGNVN+    H  N       FL +AK+ LK LLSLK      +S L  IIP K  ALK VV   +  A+R M++L  K +D 
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  -EEAYEAQMAYIQILIHL
         EEA EAQ+AY+ ILI+L
Subjt:  -EEAYEAQMAYIQILIHL

XP_008444305.1 PREDICTED: uncharacterized protein LOC103487673 [Cucumis melo]5.2e-5849.69Show/hide
Query:  MESIAMFHGGSPPKLPLYAPPST--SLRMAPWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MESI +F   S PKLP  AP  T  S+  +PWL F+L + +PF +L N ST I +IG   +     K +A C N V   +    +  Q+P KSLLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAPPST--SLRMAPWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
          + NS  STI   K+EALKLV+D KY   E  M  L  +GDT  +V YEARVA++Q LI+LD+Y KAL FLEE+G FP S   + R CLYKAVV+TML 
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLK-PAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKD
        K D AEK WN YL+TL N NVN     +  NNT  E++ + +AK  LK LLSLK PAK E+NS  + II  KN A+K VV G ++ A+ LMK   E  KD
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLK-PAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKD

Query:  GEEAYEAQMAYIQILIHL
          E  EAQ+ Y+ ILI+L
Subjt:  GEEAYEAQMAYIQILIHL

XP_022140224.1 uncharacterized protein LOC111010943 [Momordica charantia]2.9e-12596.19Show/hide
Query:  MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPKADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM
        MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHC NI HSR+TQYSNAKQDPWKSLLSLVP KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM
Subjt:  MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPKADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM

Query:  RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP
        RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDG+FPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP
Subjt:  RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP

Query:  ELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAK
        ELLFLTDAKAPLKSLLSL    EEKNSSLAKIIPAK
Subjt:  ELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAK

XP_022955326.1 uncharacterized protein LOC111457322 isoform X1 [Cucurbita moschata]6.2e-5951.57Show/hide
Query:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MES A+  GG P KLPL  P PST + + P WL F++   PFPRL NGS  T +SIG TRNP    K LA C + V      YSNA++D  K+LLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
        K  N+S L  I S KNEALKLVV+ KY    C  + L +      +V YEAR+A++Q LI LDEY+KALEFLEED  FP+  S +AR  LYKAVVHTMLG
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          D AE+ WN YL+TL +GNVN+    H  N       FL +AK+ LK LLSLK      +S L  IIP K  ALK VV   +  A+R M++L  K +D 
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  -EEAYEAQMAYIQILIHL
         EEA EAQ+AY+ ILI+L
Subjt:  -EEAYEAQMAYIQILIHL

XP_022955329.1 uncharacterized protein LOC111457322 isoform X2 [Cucurbita moschata]6.2e-5951.57Show/hide
Query:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MES A+  GG P KLPL  P PST + + P WL F++   PFPRL NGS  T +SIG TRNP    K LA C + V      YSNA++D  K+LLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
        K  N+S L  I S KNEALKLVV+ KY    C  + L +      +V YEAR+A++Q LI LDEY+KALEFLEED  FP+  S +AR  LYKAVVHTMLG
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          D AE+ WN YL+TL +GNVN+    H  N       FL +AK+ LK LLSLK      +S L  IIP K  ALK VV   +  A+R M++L  K +D 
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  -EEAYEAQMAYIQILIHL
         EEA EAQ+AY+ ILI+L
Subjt:  -EEAYEAQMAYIQILIHL

TrEMBL top hitse value%identityAlignment
A0A0A0LUX5 Uncharacterized protein1.8e-5148.58Show/hide
Query:  MESIAMFHGGSPPKLPLYAPPSTSLRM-APWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPK
        MESI +    S PKLP  AP  T   M + WL F+L + +PF +L N ST I SIG   +     + L  C N VH  +       QDP KSLLSLV P 
Subjt:  MESIAMFHGGSPPKLPLYAPPSTSLRM-APWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPK

Query:  ADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGK
           NS  STI   K+EALKLV+D KY+  E  M  L  +GDT  DV YEAR+A++Q LI+LD+Y KAL FLE++G FP+S   + R  LYKAVV+TML K
Subjt:  ADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGK

Query:  -HDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSL-KPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          DAEK WN Y+ TL   NVN    T+  N+T  E++ + DAK  LK LLS  KPAK E+N+ L+ II  KN A+K VV G ++ A+ LMK   E  KD 
Subjt:  -HDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSL-KPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  EEAYEAQMAYIQILIHL
        +E  EAQ+ +I ILI+L
Subjt:  EEAYEAQMAYIQILIHL

A0A1S3BA48 uncharacterized protein LOC1034876732.5e-5849.69Show/hide
Query:  MESIAMFHGGSPPKLPLYAPPST--SLRMAPWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MESI +F   S PKLP  AP  T  S+  +PWL F+L + +PF +L N ST I +IG   +     K +A C N V   +    +  Q+P KSLLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAPPST--SLRMAPWLWFDL-SKSPFPRLVNGSTGI-SIGPTRN----PKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
          + NS  STI   K+EALKLV+D KY   E  M  L  +GDT  +V YEARVA++Q LI+LD+Y KAL FLEE+G FP S   + R CLYKAVV+TML 
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLK-PAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKD
        K D AEK WN YL+TL N NVN     +  NNT  E++ + +AK  LK LLSLK PAK E+NS  + II  KN A+K VV G ++ A+ LMK   E  KD
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLK-PAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKD

Query:  GEEAYEAQMAYIQILIHL
          E  EAQ+ Y+ ILI+L
Subjt:  GEEAYEAQMAYIQILIHL

A0A6J1CF48 uncharacterized protein LOC1110109431.4e-12596.19Show/hide
Query:  MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPKADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM
        MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHC NI HSR+TQYSNAKQDPWKSLLSLVP KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM
Subjt:  MAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPKADNNSQLSTIVSIKNEALKLVVDKKYDVLECLM

Query:  RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP
        RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDG+FPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP
Subjt:  RCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHSNNNTYP

Query:  ELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAK
        ELLFLTDAKAPLKSLLSL    EEKNSSLAKIIPAK
Subjt:  ELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAK

A0A6J1GT93 uncharacterized protein LOC111457322 isoform X13.0e-5951.57Show/hide
Query:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MES A+  GG P KLPL  P PST + + P WL F++   PFPRL NGS  T +SIG TRNP    K LA C + V      YSNA++D  K+LLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
        K  N+S L  I S KNEALKLVV+ KY    C  + L +      +V YEAR+A++Q LI LDEY+KALEFLEED  FP+  S +AR  LYKAVVHTMLG
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          D AE+ WN YL+TL +GNVN+    H  N       FL +AK+ LK LLSLK      +S L  IIP K  ALK VV   +  A+R M++L  K +D 
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  -EEAYEAQMAYIQILIHL
         EEA EAQ+AY+ ILI+L
Subjt:  -EEAYEAQMAYIQILIHL

A0A6J1GVX9 uncharacterized protein LOC111457322 isoform X23.0e-5951.57Show/hide
Query:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP
        MES A+  GG P KLPL  P PST + + P WL F++   PFPRL NGS  T +SIG TRNP    K LA C + V      YSNA++D  K+LLSLV P
Subjt:  MESIAMFHGGSPPKLPLYAP-PSTSLRMAP-WLWFDLSKSPFPRLVNGS--TGISIGPTRNP----KFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPP

Query:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG
        K  N+S L  I S KNEALKLVV+ KY    C  + L +      +V YEAR+A++Q LI LDEY+KALEFLEED  FP+  S +AR  LYKAVVHTMLG
Subjt:  KADNNSQLSTIVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLG

Query:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG
          D AE+ WN YL+TL +GNVN+    H  N       FL +AK+ LK LLSLK      +S L  IIP K  ALK VV   +  A+R M++L  K +D 
Subjt:  KHD-AEKCWNAYLKTLNNGNVNDTFLTHSNNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDG

Query:  -EEAYEAQMAYIQILIHL
         EEA EAQ+AY+ ILI+L
Subjt:  -EEAYEAQMAYIQILIHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34540.2 unknown protein4.1e-0831.53Show/hide
Query:  IVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEF--LEEDGKFPKSASSDARPCLYKAVVHTMLGKH-DAEKC
        I SIK EA++ + + K +    L+R  + R     +  +  ++A V+ LI L+ Y +A E+  L ++     +  SD R  LYKA+++TML K  +A++C
Subjt:  IVSIKNEALKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEF--LEEDGKFPKSASSDARPCLYKAVVHTMLGKH-DAEKC

Query:  WNAYLKTLNNG
        W  + K++  G
Subjt:  WNAYLKTLNNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCATTGCTATGTTTCATGGTGGCTCACCACCCAAACTTCCCTTGTATGCACCTCCCTCTACCTCCCTGCGGATGGCTCCATGGCTCTGGTTCGACCTTAGCAA
GTCACCCTTCCCTCGGCTCGTCAATGGCTCGACCGGTATCTCCATCGGGCCGACTAGGAACCCCAAGTTTTTAGCTCATTGCATAAATATTGTACATTCAAGGAAGACAC
AGTACTCAAATGCTAAACAAGACCCATGGAAGTCTCTGTTGAGCTTAGTGCCACCAAAAGCAGATAACAACTCACAGTTATCGACCATTGTTTCCATTAAGAATGAGGCA
TTGAAGTTGGTAGTGGATAAGAAATACGATGTATTAGAGTGCCTTATGAGATGCTTAAGCAATAGGGGTGATACAAATGATGACGTGGAATACGAGGCTCGGGTGGCATA
TGTCCAATTTCTTATATATCTCGATGAATACAACAAAGCTCTAGAATTTCTAGAGGAGGATGGCAAATTTCCAAAATCTGCATCATCTGATGCAAGACCTTGCCTTTACA
AGGCTGTGGTACATACCATGTTGGGCAAACATGATGCTGAAAAATGTTGGAATGCATACCTAAAAACCCTTAACAATGGCAATGTAAATGACACATTTTTAACTCATTCC
AACAATAATACATATCCAGAGCTGTTGTTCTTGACCGATGCTAAAGCCCCATTGAAGTCACTATTGAGCTTAAAACCAGCAAAAGAGGAAAAAAACTCATCGTTAGCGAA
GATTATTCCCGCCAAGAATAAGGCATTGAAGGGAGTGGTGGGTGGGGGGTTCAAAAGAGCAGAACGCCTAATGAAAGATTTATGGGAGAAAGAGAAAGATGGGGAGGAGG
CGTACGAGGCTCAGATGGCATATATCCAAATTCTTATACATCTT
mRNA sequenceShow/hide mRNA sequence
ATGGAATCCATTGCTATGTTTCATGGTGGCTCACCACCCAAACTTCCCTTGTATGCACCTCCCTCTACCTCCCTGCGGATGGCTCCATGGCTCTGGTTCGACCTTAGCAA
GTCACCCTTCCCTCGGCTCGTCAATGGCTCGACCGGTATCTCCATCGGGCCGACTAGGAACCCCAAGTTTTTAGCTCATTGCATAAATATTGTACATTCAAGGAAGACAC
AGTACTCAAATGCTAAACAAGACCCATGGAAGTCTCTGTTGAGCTTAGTGCCACCAAAAGCAGATAACAACTCACAGTTATCGACCATTGTTTCCATTAAGAATGAGGCA
TTGAAGTTGGTAGTGGATAAGAAATACGATGTATTAGAGTGCCTTATGAGATGCTTAAGCAATAGGGGTGATACAAATGATGACGTGGAATACGAGGCTCGGGTGGCATA
TGTCCAATTTCTTATATATCTCGATGAATACAACAAAGCTCTAGAATTTCTAGAGGAGGATGGCAAATTTCCAAAATCTGCATCATCTGATGCAAGACCTTGCCTTTACA
AGGCTGTGGTACATACCATGTTGGGCAAACATGATGCTGAAAAATGTTGGAATGCATACCTAAAAACCCTTAACAATGGCAATGTAAATGACACATTTTTAACTCATTCC
AACAATAATACATATCCAGAGCTGTTGTTCTTGACCGATGCTAAAGCCCCATTGAAGTCACTATTGAGCTTAAAACCAGCAAAAGAGGAAAAAAACTCATCGTTAGCGAA
GATTATTCCCGCCAAGAATAAGGCATTGAAGGGAGTGGTGGGTGGGGGGTTCAAAAGAGCAGAACGCCTAATGAAAGATTTATGGGAGAAAGAGAAAGATGGGGAGGAGG
CGTACGAGGCTCAGATGGCATATATCCAAATTCTTATACATCTT
Protein sequenceShow/hide protein sequence
MESIAMFHGGSPPKLPLYAPPSTSLRMAPWLWFDLSKSPFPRLVNGSTGISIGPTRNPKFLAHCINIVHSRKTQYSNAKQDPWKSLLSLVPPKADNNSQLSTIVSIKNEA
LKLVVDKKYDVLECLMRCLSNRGDTNDDVEYEARVAYVQFLIYLDEYNKALEFLEEDGKFPKSASSDARPCLYKAVVHTMLGKHDAEKCWNAYLKTLNNGNVNDTFLTHS
NNNTYPELLFLTDAKAPLKSLLSLKPAKEEKNSSLAKIIPAKNKALKGVVGGGFKRAERLMKDLWEKEKDGEEAYEAQMAYIQILIHL