; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g41650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g41650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:32689850..32694047
RNA-Seq ExpressionMoc06g41650
SyntenyMoc06g41650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050949.1 uncharacterized protein E6C27_scaffold761G00770 [Cucumis melo var. makuwa]2.5e-5131.25Show/hide
Query:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK
        D E+LK SIL   ML IG W+     EGD LV KF+F  +QL  E+ RNG +QKIEIEWSNI+G++A L+E+  G L++EL  PPKFY+E++ + R H +
Subjt:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK

Query:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ---------------FFVNRTQMIPMIGSSNNLQMASSSS
        W DGSDFTE QA   RR+ I FPP VL + +E+L + DK L+ LSQ  FPT   +YF S+                  N+    P   S+N +   +S  
Subjt:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ---------------FFVNRTQMIPMIGSSNNLQMASSSS

Query:  S--------------------------SSMPQYSQVPNDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGV
        S                          +SM +Y +V     + +N    DH L+N     N +++   L G      A+ N     N    D    FNG 
Subjt:  S--------------------------SSMPQYSQVPNDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGV

Query:  SNNYSNLPTDPRGVWPLLLL------------------------SFPISLVQLQPSSFALAPGRHAVP----------QNEGDLALRFDYGAENISWEIV
         +  +     P  V P+  +                          P++   L  S+  L   R  +P            + +L ++ ++ ++ I +E++
Subjt:  SNNYSNLPTDPRGVWPLLLL------------------------SFPISLVQLQPSSFALAPGRHAVP----------QNEGDLALRFDYGAENISWEIV

Query:  NEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEV----NVNLETHGT-SKWIEGSDFTEDHQASKCRRHLAAFP----PGVLNEQLEK
          G  ++ KIE+DWSNI+GI+AS  D   G L++EL  PP  Y+ +    N N+E+H   +KW+ G DFT+  QA+ C R   +FP    P    +Q   
Subjt:  NEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEV----NVNLETHGT-SKWIEGSDFTEDHQASKCRRHLAAFP----PGVLNEQLEK

Query:  LIKYDERLYELCHRRFPTFELAYFLSQN
         I Y++  + +         L + L Q+
Subjt:  LIKYDERLYELCHRRFPTFELAYFLSQN

KAE8653342.1 hypothetical protein Csa_007445 [Cucumis sativus]1.0e-3651.22Show/hide
Query:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK
        D E+LK SIL   ML IG W+     EGD LV KF+F  +QL  E+ RNG +QKIEIEWSNI+G++A L+E+  G L++EL  PPKFY+E++ + R H +
Subjt:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK

Query:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN
        W DGSDFTE QA   RR+ I FPP VL + +E+L + DK L+ LS+  FPT   +YF S+   N
Subjt:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN

KAG7024021.1 hypothetical protein SDJN02_15050, partial [Cucurbita argyrosperma subsp. argyrosperma]7.4e-6436.11Show/hide
Query:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI
        MK+  ++     K K IS E D E+LK S L   ML IG W+     EGD LV KF+F  + L  E+ RNG ++K+EIEWSNI+G++A ++EN  G L++
Subjt:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI

Query:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQ
        EL  PPKFY+E++ + R H +W DG DFT+ QA   RR+ I FPP VL + +++L + D+RL+ LS+  FPTF   YFPS+   NR   I          
Subjt:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQ

Query:  MASSSSSSSMPQYSQVP-NDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGVSNNYSNLPTDPRGVWPLLL
                   Q+ ++   D+  + N   N+   + R+   N+V+         Y I AS NY +       +G    N     + N    P  V     
Subjt:  MASSSSSSSMPQYSQVP-NDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGVSNNYSNLPTDPRGVWPLLL

Query:  LSFPISLVQLQPSSFALAPGRHAVPQNEGDLALRFDYGAENISWEIVNEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEVNVNLETH
                               +P N   L ++  +  + + +EI + G   + KIEIDWSNIIGI+A +E             PPKFYKE+ +  E  
Subjt:  LSFPISLVQLQPSSFALAPGRHAVPQNEGDLALRFDYGAENISWEIVNEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEVNVNLETH

Query:  GTSKWIEGSDFTEDHQASKCRRHLAAFPPGVLNEQLEKLIKYDERLYELCHRRFPTFELAYFLSQNFL
          S+W++GSD T D QAS CRRH   FPP +L++  E+LI++DERL+ L  R +PTF + YF S+ FL
Subjt:  GTSKWIEGSDFTEDHQASKCRRHLAAFPPGVLNEQLEKLIKYDERLYELCHRRFPTFELAYFLSQNFL

XP_022968797.1 uncharacterized protein LOC111467928 isoform X1 [Cucurbita maxima]1.2e-3747.87Show/hide
Query:  LHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQEL-KIDRRSHNKWIDGSDFTEDQAS
        L+IGKW+  P    ++LVVK  F+ KQL  E+S NG R KIEI+WSNI+G++A +K+N  G L++EL  PPKFY+EL + D  +H++W+DGSDFT  QAS
Subjt:  LHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQEL-KIDRRSHNKWIDGSDFTEDQAS

Query:  RCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQMASSSSSSSMPQYSQVPNDHLS
         CRR+ I FPP +L + FERLI  D+RL+ LSQ  +PTF + YF SQ F+      P     N+    S +  SS   Y+   ND  S
Subjt:  RCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQMASSSSSSSMPQYSQVPNDHLS

XP_031736050.1 uncharacterized protein LOC105436206 [Cucumis sativus]1.0e-3651.22Show/hide
Query:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK
        D E+LK SIL   ML IG W+     EGD LV KF+F  +QL  E+ RNG +QKIEIEWSNI+G++A L+E+  G L++EL  PPKFY+E++ + R H +
Subjt:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK

Query:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN
        W DGSDFTE QA   RR+ I FPP VL + +E+L + DK L+ LS+  FPT   +YF S+   N
Subjt:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN

TrEMBL top hitse value%identityAlignment
A0A0A0M1F5 Uncharacterized protein4.9e-3751.22Show/hide
Query:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK
        D E+LK SIL   ML IG W+     EGD LV KF+F  +QL  E+ RNG +QKIEIEWSNI+G++A L+E+  G L++EL  PPKFY+E++ + R H +
Subjt:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK

Query:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN
        W DGSDFTE QA   RR+ I FPP VL + +E+L + DK L+ LS+  FPT   +YF S+   N
Subjt:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVN

A0A5A7UBN7 Uncharacterized protein1.2e-5131.25Show/hide
Query:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK
        D E+LK SIL   ML IG W+     EGD LV KF+F  +QL  E+ RNG +QKIEIEWSNI+G++A L+E+  G L++EL  PPKFY+E++ + R H +
Subjt:  DKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNK

Query:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ---------------FFVNRTQMIPMIGSSNNLQMASSSS
        W DGSDFTE QA   RR+ I FPP VL + +E+L + DK L+ LSQ  FPT   +YF S+                  N+    P   S+N +   +S  
Subjt:  WIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ---------------FFVNRTQMIPMIGSSNNLQMASSSS

Query:  S--------------------------SSMPQYSQVPNDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGV
        S                          +SM +Y +V     + +N    DH L+N     N +++   L G      A+ N     N    D    FNG 
Subjt:  S--------------------------SSMPQYSQVPNDHLSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGV

Query:  SNNYSNLPTDPRGVWPLLLL------------------------SFPISLVQLQPSSFALAPGRHAVP----------QNEGDLALRFDYGAENISWEIV
         +  +     P  V P+  +                          P++   L  S+  L   R  +P            + +L ++ ++ ++ I +E++
Subjt:  SNNYSNLPTDPRGVWPLLLL------------------------SFPISLVQLQPSSFALAPGRHAVP----------QNEGDLALRFDYGAENISWEIV

Query:  NEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEV----NVNLETHGT-SKWIEGSDFTEDHQASKCRRHLAAFP----PGVLNEQLEK
          G  ++ KIE+DWSNI+GI+AS  D   G L++EL  PP  Y+ +    N N+E+H   +KW+ G DFT+  QA+ C R   +FP    P    +Q   
Subjt:  NEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEV----NVNLETHGT-SKWIEGSDFTEDHQASKCRRHLAAFP----PGVLNEQLEK

Query:  LIKYDERLYELCHRRFPTFELAYFLSQN
         I Y++  + +         L + L Q+
Subjt:  LIKYDERLYELCHRRFPTFELAYFLSQN

A0A6J1HC39 uncharacterized protein LOC1114620911.4e-3646.24Show/hide
Query:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI
        MK+  ++     K K IS E D E+LK S L   ML IG W+     EGD LV KF+F  + L  E+ RNG ++K+EIEWSNI+G++A ++EN  G L++
Subjt:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI

Query:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNR
        EL  PPKFY+E++ + R H +W DG DFT+ QA   RR+ I FPP VL + +++L + D+RL+ LS+  FPTF   YFPS+   N+
Subjt:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNR

A0A6J1HUI1 uncharacterized protein LOC111467928 isoform X15.8e-3847.87Show/hide
Query:  LHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQEL-KIDRRSHNKWIDGSDFTEDQAS
        L+IGKW+  P    ++LVVK  F+ KQL  E+S NG R KIEI+WSNI+G++A +K+N  G L++EL  PPKFY+EL + D  +H++W+DGSDFT  QAS
Subjt:  LHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQEL-KIDRRSHNKWIDGSDFTEDQAS

Query:  RCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQMASSSSSSSMPQYSQVPNDHLS
         CRR+ I FPP +L + FERLI  D+RL+ LSQ  +PTF + YF SQ F+      P     N+    S +  SS   Y+   ND  S
Subjt:  RCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQMASSSSSSSMPQYSQVPNDHLS

A0A6J1I0K3 uncharacterized protein LOC1114678931.4e-3645.79Show/hide
Query:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI
        MK+  ++     K K I+ E D E+LK S L   ML IG W+     EGD LV KF+F  + L  E+ RNG ++K+EIEWSNI+G++A ++EN  G L++
Subjt:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI

Query:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMI
        EL  PPKFY+E++ + R H +W DG DFTE QA   RR+ I FPP VL + +++L + D+RL+ LS+  FPTF   YFPS+   N+   I
Subjt:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein6.1e-1633.78Show/hide
Query:  DELVVKFFFIMKQLAM-------EVSRNGYRQKIEIEWSNIVGMKATL-KENGSGTLKIELLLPPKFYQELKIDRRSHNKWID-GSDFTEDQASRCRRYL
        D++V KF+F  K+L         E +    ++KIEI+W+++   + ++   + +G LKIEL   P F+ E       H +W     DFT D AS  RR+ 
Subjt:  DELVVKFFFIMKQLAM-------EVSRNGYRQKIEIEWSNIVGMKATL-KENGSGTLKIELLLPPKFYQELKIDRRSHNKWID-GSDFTEDQASRCRRYL

Query:  INFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQ
        ++FPPGVL+++ E+L+  D     L +  FP  E  YF S F  N ++
Subjt:  INFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQ

AT2G24100.1 unknown protein9.7e-3038.81Show/hide
Query:  EKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNKWI
        EKLK S     +L IG+W+++   EGD LV K +F   +L  EV   G + KIEI+WS+I+ +KA L E+  GTL I L   P F++E     R H  W 
Subjt:  EKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQELKIDRRSHNKWI

Query:  DGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMI--PMIGSSNNLQMASSSSSSSMPQYSQVPNDHL
          SDFT+ QAS  R++ +  PPG++ + FE+L+ CD RL+ L  SR P   LA   + FF +R  +   P +  S+N+  AS   + S  ++  + +D L
Subjt:  DGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMI--PMIGSSNNLQMASSSSSSSMPQYSQVPNDHL

Query:  S
        S
Subjt:  S

AT3G05770.1 unknown protein1.0e-1532.95Show/hide
Query:  EKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAME-------VSRNGYRQKIEIEWSNIVGMKATL-KENGSGTLKIELLLPPKFYQELKID
        EKLK        + IG   F      D++V KF+F  K+L  E        +    + KIEI+W+++   + ++   + +G LKIEL   P F+ E    
Subjt:  EKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAME-------VSRNGYRQKIEIEWSNIVGMKATL-KENGSGTLKIELLLPPKFYQELKID

Query:  RRSHNKWID-GSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRT
           H +W     DFT DQAS  RR+ ++FPPGVL+++ E+L+  D     L +  FP  E  YF   F  N +
Subjt:  RRSHNKWID-GSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRT

AT4G30780.1 unknown protein2.2e-2635.36Show/hide
Query:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI
        +K +SK +  +            EKLK S     +L IG+W+++   EGD LV K +F   +L  EV   G + KIEI+WS+I+ +KA   E+G GTL +
Subjt:  MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKI

Query:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ
         L   P F++E     R H  W   SDFT+ QAS  R++ +    G++ + FE+L+ CD RL+ LS+      +  YF ++
Subjt:  ELLLPPKFYQELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAACAAAGCAAGAAACTTCCAAGGTCAAAAAAGGCTAAAGATATTTCGAAAGAGGTCGATAAAGAAAAGTTGAAGGTCTCTATTTTAATACCTGAAATGCTTCA
CATTGGCAAATGGAAGTTTGAGCCAACATTAGAAGGTGATGAATTGGTGGTAAAATTCTTTTTTATAATGAAGCAATTAGCTATGGAGGTTTCGAGGAATGGGTATAGGC
AAAAGATTGAAATAGAATGGTCAAATATTGTGGGGATGAAAGCTACTTTAAAAGAAAATGGATCCGGAACCCTAAAAATTGAGCTATTACTTCCACCCAAGTTCTACCAA
GAACTCAAAATTGATCGACGGTCACACAATAAATGGATTGATGGATCAGATTTTACAGAAGACCAAGCTTCTAGATGCAGGAGGTATTTGATTAATTTTCCACCCGGAGT
GCTGAAGGAGTCGTTTGAGAGACTAATAAATTGTGACAAGCGATTGTATGGCCTGAGCCAAAGCCGATTTCCGACCTTTGAATTGGCCTACTTCCCTTCACAATTTTTTG
TGAATAGAACACAAATGATTCCAATGATTGGAAGCTCCAACAATTTGCAAATGGCTTCTTCTTCTTCTTCATCTTCAATGCCACAGTATTCACAGGTTCCAAATGACCAT
TTGTCAAATTTCAATATGGTTTCAAATGACCACTTGTTGAATAATAGACTCACTTGTCCTAATGAAGTTTCAAATGGCCACATGTTGAGTGGACCAACATATCCCATTGG
AGCTTCAAATAATTACTGCGACCCAACAAATGACCACTTGTTGGATGGATCAACATATTTCAATGGAGTCTCGAATAACTACTCCAATTTGCCAACAGACCCACGTGGAG
TTTGGCCTCTTCTTCTTCTGAGCTTCCCAATATCTCTAGTGCAGCTCCAGCCCTCTTCCTTCGCATTGGCTCCTGGCAGGCATGCAGTGCCCCAAAATGAAGGTGATTTG
GCGTTGAGATTTGATTATGGGGCCGAGAATATATCGTGGGAGATTGTAAATGAAGGGCCTTCGACAAAGCACAAGATTGAAATTGATTGGTCCAACATCATTGGGATTCA
AGCTTCTACTGAAGATCACAGACAAGGAACCCTCCAACTTGAGCTATTATATCCGCCAAAATTCTACAAAGAAGTCAATGTTAATCTAGAGACACACGGTACTAGTAAAT
GGATTGAGGGATCAGATTTTACTGAAGATCACCAAGCTTCTAAATGCAGGAGGCATTTGGCTGCGTTTCCACCTGGAGTGTTGAACGAGCAGTTGGAGAAACTAATAAAG
TATGATGAGCGATTGTATGAACTGTGCCATCGTCGATTTCCAACGTTTGAATTGGCTTACTTCCTTTCACAAAATTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAACAAAGCAAGAAACTTCCAAGGTCAAAAAAGGCTAAAGATATTTCGAAAGAGGTCGATAAAGAAAAGTTGAAGGTCTCTATTTTAATACCTGAAATGCTTCA
CATTGGCAAATGGAAGTTTGAGCCAACATTAGAAGGTGATGAATTGGTGGTAAAATTCTTTTTTATAATGAAGCAATTAGCTATGGAGGTTTCGAGGAATGGGTATAGGC
AAAAGATTGAAATAGAATGGTCAAATATTGTGGGGATGAAAGCTACTTTAAAAGAAAATGGATCCGGAACCCTAAAAATTGAGCTATTACTTCCACCCAAGTTCTACCAA
GAACTCAAAATTGATCGACGGTCACACAATAAATGGATTGATGGATCAGATTTTACAGAAGACCAAGCTTCTAGATGCAGGAGGTATTTGATTAATTTTCCACCCGGAGT
GCTGAAGGAGTCGTTTGAGAGACTAATAAATTGTGACAAGCGATTGTATGGCCTGAGCCAAAGCCGATTTCCGACCTTTGAATTGGCCTACTTCCCTTCACAATTTTTTG
TGAATAGAACACAAATGATTCCAATGATTGGAAGCTCCAACAATTTGCAAATGGCTTCTTCTTCTTCTTCATCTTCAATGCCACAGTATTCACAGGTTCCAAATGACCAT
TTGTCAAATTTCAATATGGTTTCAAATGACCACTTGTTGAATAATAGACTCACTTGTCCTAATGAAGTTTCAAATGGCCACATGTTGAGTGGACCAACATATCCCATTGG
AGCTTCAAATAATTACTGCGACCCAACAAATGACCACTTGTTGGATGGATCAACATATTTCAATGGAGTCTCGAATAACTACTCCAATTTGCCAACAGACCCACGTGGAG
TTTGGCCTCTTCTTCTTCTGAGCTTCCCAATATCTCTAGTGCAGCTCCAGCCCTCTTCCTTCGCATTGGCTCCTGGCAGGCATGCAGTGCCCCAAAATGAAGGTGATTTG
GCGTTGAGATTTGATTATGGGGCCGAGAATATATCGTGGGAGATTGTAAATGAAGGGCCTTCGACAAAGCACAAGATTGAAATTGATTGGTCCAACATCATTGGGATTCA
AGCTTCTACTGAAGATCACAGACAAGGAACCCTCCAACTTGAGCTATTATATCCGCCAAAATTCTACAAAGAAGTCAATGTTAATCTAGAGACACACGGTACTAGTAAAT
GGATTGAGGGATCAGATTTTACTGAAGATCACCAAGCTTCTAAATGCAGGAGGCATTTGGCTGCGTTTCCACCTGGAGTGTTGAACGAGCAGTTGGAGAAACTAATAAAG
TATGATGAGCGATTGTATGAACTGTGCCATCGTCGATTTCCAACGTTTGAATTGGCTTACTTCCTTTCACAAAATTTTCTCTAA
Protein sequenceShow/hide protein sequence
MKEQSKKLPRSKKAKDISKEVDKEKLKVSILIPEMLHIGKWKFEPTLEGDELVVKFFFIMKQLAMEVSRNGYRQKIEIEWSNIVGMKATLKENGSGTLKIELLLPPKFYQ
ELKIDRRSHNKWIDGSDFTEDQASRCRRYLINFPPGVLKESFERLINCDKRLYGLSQSRFPTFELAYFPSQFFVNRTQMIPMIGSSNNLQMASSSSSSSMPQYSQVPNDH
LSNFNMVSNDHLLNNRLTCPNEVSNGHMLSGPTYPIGASNNYCDPTNDHLLDGSTYFNGVSNNYSNLPTDPRGVWPLLLLSFPISLVQLQPSSFALAPGRHAVPQNEGDL
ALRFDYGAENISWEIVNEGPSTKHKIEIDWSNIIGIQASTEDHRQGTLQLELLYPPKFYKEVNVNLETHGTSKWIEGSDFTEDHQASKCRRHLAAFPPGVLNEQLEKLIK
YDERLYELCHRRFPTFELAYFLSQNFL