; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1635)
Genome locationchr3:10140141..10144701
RNA-Seq ExpressionMoc03g15040
SyntenyMoc03g15040
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR012862 - Protein of unknown function DUF1635


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.4e-10474.8Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSK
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE SRPNS LAMVC F   VKRKSK
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSK

Query:  GRAHALKTVVGTEPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR
        GRAHAL+    ++P TP V         GP+SE P PVIEL+ SG    EKRPR
Subjt:  GRAHALKTVVGTEPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR

XP_022158099.1 uncharacterized protein LOC111024665 [Momordica charantia]5.1e-9496.43Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRRSVIFALLLF
        SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRR    ++L F
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRRSVIFALLLF

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.3e-8682.7Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSEL
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE+SRPNSEL
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.6e-8782.89Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGA GIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAM
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+HFPRG+K+GTLVTDKL LES LLDYN  VRPIE+SRPNSEL M
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.2e-12468.42Show/hide
Query:  LARRLESELEEIENFRFSDDGEESDTSTSGQGLEYPSRMPEHYLGPLRRG--------------------------------------LPLHPFAQEFHN
        LARRLES+LEEIEN R SDDGE+SD STSGQGLEYPSR+PEHYLG LRRG                                      LPLHPF QEF  
Subjt:  LARRLESELEEIENFRFSDDGEESDTSTSGQGLEYPSRMPEHYLGPLRRG--------------------------------------LPLHPFAQEFHN

Query:  RTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRPF
        RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAEL  V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR F
Subjt:  RTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRPF

Query:  FDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGT
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL+    +
Subjt:  FDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGT

Query:  EPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR
        +PATP V         GP+SE P  VIEL+ SG    EKRPR
Subjt:  EPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138266.9e-10574.8Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSK
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE SRPNS LAMVC F   VKRKSK
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSK

Query:  GRAHALKTVVGTEPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR
        GRAHAL+    ++P TP V         GP+SE P PVIEL+ SG    EKRPR
Subjt:  GRAHALKTVVGTEPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR

A0A6J1DWD2 uncharacterized protein LOC1110246801.1e-8682.7Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSEL
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE+SRPNSEL
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251087.7e-8882.89Show/hide
Query:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG
        LPLHPF QEF  RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAELL V+QL  CFEAKRIAKKPGR+YMC+RKGA GIVKGPTSIKGWV KWF+ASG
Subjt:  LPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASG

Query:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAM
        EWLAKDESGR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+HFPRG+K+GTLVTDKL LES LLDYN  VRPIE+SRPNSEL M
Subjt:  EWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255026.0e-12568.42Show/hide
Query:  LARRLESELEEIENFRFSDDGEESDTSTSGQGLEYPSRMPEHYLGPLRRG--------------------------------------LPLHPFAQEFHN
        LARRLES+LEEIEN R SDDGE+SD STSGQGLEYPSR+PEHYLG LRRG                                      LPLHPF QEF  
Subjt:  LARRLESELEEIENFRFSDDGEESDTSTSGQGLEYPSRMPEHYLGPLRRG--------------------------------------LPLHPFAQEFHN

Query:  RTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRPF
        RTGLAP QVAPNGWGVIFALAILFWL+ARD +EAEL  V+QL  CFEAKRIAKKPGR+YMC+RKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR F
Subjt:  RTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAELLSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRPF

Query:  FDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGT
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRG+K+GTLVTD+L LES LLDYN  VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL+    +
Subjt:  FDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTDKLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGT

Query:  EPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR
        +PATP V         GP+SE P  VIEL+ SG    EKRPR
Subjt:  EPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPR

A0A6J1DYE5 uncharacterized protein LOC1110246652.5e-9496.43Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRRSVIFALLLF
        SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRR    ++L F
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRRSVIFALLLF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28140.1 Protein of unknown function (DUF1635)9.8e-1937.06Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        +EEL+  LL+T +ELE  +M A++E+I   + +  L +LL  A KE+DEA+++  ++L    L  NF            +   +     P ++ F  +  
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP-LQPLNIPPVLI
             ++ +P  + IE       LPEKG+LL++V++AGPLLQTLL+AG LP+WR+PPP L+   IPPV+I
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP-LQPLNIPPVLI

AT2G28690.1 Protein of unknown function (DUF1635)1.0e-3649.22Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        M+EL++KL Y++ ELE+VK +AN+E   ++E +KNLL+LL++A +ERDEAKDQL KLL               +K NSSITES+S   SP VDSFF+ VS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGN-------------------------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPL-QPLNIPPV
        S +  N                          +DP   +++ I+KG+ LPEKG+LLQ+VME+GPLLQTLLVAGPLPRWRNPPPL Q   +PP+
Subjt:  SPDSGN-------------------------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPL-QPLNIPPV

AT3G44940.1 Protein of unknown function (DUF1635)6.1e-2137.63Show/hide
Query:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQL-HKLLNKFMLPSNFQAES---------PVVKANSSITESSSLSGSPV
        EEL++ L+YT +ELE  K+ A++E+    E L +L ++L    KERDEA ++  H LLN  +L    Q            P+  A+S I +       P 
Subjt:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQL-HKLLNKFMLPSNFQAES---------PVVKANSSITESSSLSGSPV

Query:  VDSFFDAVSSPDSGNNMDPSAL---------------VIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI
        ++S     SS    + M PS +               ++ +++  + LPEKG+LLQ+V++AGPLLQTLL+AGPLP+WR+ PPPL+   IPPV I
Subjt:  VDSFFDAVSSPDSGNNMDPSAL---------------VIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI

AT5G22930.1 Protein of unknown function (DUF1635)8.9e-2035.33Show/hide
Query:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL-NKFMLPSNFQAESPVVKANS---------SITESSSLSGSPV
        EE+++ LLYT +EL+  KM A +E+    E L +L ++L    KERDEA ++  +L+ +   L        P+  A+S          +  + S S S  
Subjt:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL-NKFMLPSNFQAESPVVKANS---------SITESSSLSGSPV

Query:  VDSFFDAVS-----SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI
         +S            P     +  + ++++ +++ + LPEKG+LLQ+V++AGPLLQTLL+AGPLP+WR+ PPPL+   IPPV +
Subjt:  VDSFFDAVS-----SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI

AT5G59760.1 Protein of unknown function (DUF1635)2.1e-2135.68Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL--------NKFMLPSNFQAESPVVKANSSITESSSLSGSPVV
        ++E+++ L  T  ELE++KMEAN++   ++E +  LLNLL+   +ERDEA+ QL + +        ++ +  SN  + S  V ++SS   S+ L+  P  
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL--------NKFMLPSNFQAESPVVKANSSITESSSLSGSPVV

Query:  DSFFDAVSSPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP---LQPLNIPPVLINGHDM
         +  +  ++ +    +DP    ++ +V G+  PE G+LL++V+EAGPLL+TLL+AGPLP+W NPPP    Q   +P +   G D+
Subjt:  DSFFDAVSSPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP---LQPLNIPPVLINGHDM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGCTGAAAGAGAAGCTCTTATACACAGCGATTGAGCTGGAATCAGTGAAAATGGAAGCAAATCAAGAGATGATAAACAACAAAGAGAACTTAAAGAATCTGCT
GAATCTTCTTCAGATGGCATACAAAGAACGAGATGAAGCAAAGGACCAGCTGCATAAGCTGCTCAACAAATTCATGTTGCCATCCAATTTCCAAGCAGAGAGCCCTGTTG
TCAAAGCAAATTCAAGCATTACAGAATCTAGCAGCCTCTCCGGCTCCCCCGTCGTCGATTCCTTCTTTGACGCGGTTTCGTCGCCCGATTCGGGCAACAATATGGATCCA
AGCGCATTGGTGATTGAGAGCATTGTGAAGGGGAGGAGGCTGCCGGAGAAGGGGAGGCTGCTGCAGTCTGTGATGGAGGCAGGGCCTTTACTGCAGACGCTTCTCGTCGC
CGGGCCGCTCCCTCGGTGGCGCAATCCTCCGCCGCTGCAGCCCTTAAACATCCCACCAGTTCTCATCAATGGACACGACATGCCGGATGCTGAACAGAAACCAGCAGTGA
TGTCTCGAAGGTCAGTCATTTTCGCTTTACTTCTTTTTCCAAATATGGTGGTTTTCTTGTCTTCCTCATCCAGTAGTGATAGCATAGGCAGTGCGGGTCAGACCATAAGT
AGTTCGCCCCCCAAACCAAGCGATTCTGGGGAGGGCTTAGCTCGTAGGTTGGAGTCCGAGCTGGAAGAAATAGAGAACTTTAGGTTCTCAGACGACGGGGAGGAGAGTGA
CACTTCCACCTCGGGCCAGGGTTTGGAATACCCTTCTAGAATGCCTGAACATTATCTTGGACCCCTCCGTAGGGGACTTCCTCTCCATCCTTTTGCTCAAGAGTTCCACA
ACCGAACTGGTTTGGCGCCGACACAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCCTTAGCCATTCTTTTTTGGTTGCAAGCTCGGGACGAGGACGAGGCCGAGCTA
CTAAGCGTTAACCAACTTTTTGGGTGCTTTGAGGCCAAGAGGATAGCTAAAAAACCAGGTCGGTATTATATGTGCTCGAGGAAGGGCGCGGGTGGCATAGTCAAGGGGCC
GACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGGAACC
TAGTATCGATCAAGCCGATTCCCGAGCTCGCTCAAGCCACCTTCGACACCCTCAAACACTACAAGGACCACTTCCCAAGGGGCCAGAAGATCGGAACCTTGGTCACTGAC
AAGCTGTTCCTAGAGTCGGAGCTGCTGGACTACAACTCTTTAGTTCGTCCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCAATGGTGTGCGGATTCACTGGCAGCGT
GAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACCGTGGTGGGTACGGAACCGGCGACGCCTATCGTGCCGCGGAATGAGGCTCATGGTAACTCTGGGCCATCTT
CTGAAGTCCCCACCCCCGTGATCGAGCTAGATTTATCTGGGGATCGATATGGGGAGAAGCGCCCAAGGATTGGGTCCATCCAAGAGGGTGACGGAGTATCAATCTCCATC
ACATCTGGCTTCAAAATTGAAGGACTGTCCAAGATCTCGACCGGGACCGATCTAGCCAGGTCGGTCTCATATGCTGATGCCAGTTTGGCTAAGGCATCAGCATTGGAGTT
TTCAGATCTTGGGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGCTGAAAGAGAAGCTCTTATACACAGCGATTGAGCTGGAATCAGTGAAAATGGAAGCAAATCAAGAGATGATAAACAACAAAGAGAACTTAAAGAATCTGCT
GAATCTTCTTCAGATGGCATACAAAGAACGAGATGAAGCAAAGGACCAGCTGCATAAGCTGCTCAACAAATTCATGTTGCCATCCAATTTCCAAGCAGAGAGCCCTGTTG
TCAAAGCAAATTCAAGCATTACAGAATCTAGCAGCCTCTCCGGCTCCCCCGTCGTCGATTCCTTCTTTGACGCGGTTTCGTCGCCCGATTCGGGCAACAATATGGATCCA
AGCGCATTGGTGATTGAGAGCATTGTGAAGGGGAGGAGGCTGCCGGAGAAGGGGAGGCTGCTGCAGTCTGTGATGGAGGCAGGGCCTTTACTGCAGACGCTTCTCGTCGC
CGGGCCGCTCCCTCGGTGGCGCAATCCTCCGCCGCTGCAGCCCTTAAACATCCCACCAGTTCTCATCAATGGACACGACATGCCGGATGCTGAACAGAAACCAGCAGTGA
TGTCTCGAAGGTCAGTCATTTTCGCTTTACTTCTTTTTCCAAATATGGTGGTTTTCTTGTCTTCCTCATCCAGTAGTGATAGCATAGGCAGTGCGGGTCAGACCATAAGT
AGTTCGCCCCCCAAACCAAGCGATTCTGGGGAGGGCTTAGCTCGTAGGTTGGAGTCCGAGCTGGAAGAAATAGAGAACTTTAGGTTCTCAGACGACGGGGAGGAGAGTGA
CACTTCCACCTCGGGCCAGGGTTTGGAATACCCTTCTAGAATGCCTGAACATTATCTTGGACCCCTCCGTAGGGGACTTCCTCTCCATCCTTTTGCTCAAGAGTTCCACA
ACCGAACTGGTTTGGCGCCGACACAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCCTTAGCCATTCTTTTTTGGTTGCAAGCTCGGGACGAGGACGAGGCCGAGCTA
CTAAGCGTTAACCAACTTTTTGGGTGCTTTGAGGCCAAGAGGATAGCTAAAAAACCAGGTCGGTATTATATGTGCTCGAGGAAGGGCGCGGGTGGCATAGTCAAGGGGCC
GACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGGAACC
TAGTATCGATCAAGCCGATTCCCGAGCTCGCTCAAGCCACCTTCGACACCCTCAAACACTACAAGGACCACTTCCCAAGGGGCCAGAAGATCGGAACCTTGGTCACTGAC
AAGCTGTTCCTAGAGTCGGAGCTGCTGGACTACAACTCTTTAGTTCGTCCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCAATGGTGTGCGGATTCACTGGCAGCGT
GAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACCGTGGTGGGTACGGAACCGGCGACGCCTATCGTGCCGCGGAATGAGGCTCATGGTAACTCTGGGCCATCTT
CTGAAGTCCCCACCCCCGTGATCGAGCTAGATTTATCTGGGGATCGATATGGGGAGAAGCGCCCAAGGATTGGGTCCATCCAAGAGGGTGACGGAGTATCAATCTCCATC
ACATCTGGCTTCAAAATTGAAGGACTGTCCAAGATCTCGACCGGGACCGATCTAGCCAGGTCGGTCTCATATGCTGATGCCAGTTTGGCTAAGGCATCAGCATTGGAGTT
TTCAGATCTTGGGACTTGA
Protein sequenceShow/hide protein sequence
MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVSSPDSGNNMDP
SALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPAVMSRRSVIFALLLFPNMVVFLSSSSSSDSIGSAGQTIS
SSPPKPSDSGEGLARRLESELEEIENFRFSDDGEESDTSTSGQGLEYPSRMPEHYLGPLRRGLPLHPFAQEFHNRTGLAPTQVAPNGWGVIFALAILFWLQARDEDEAEL
LSVNQLFGCFEAKRIAKKPGRYYMCSRKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGQKIGTLVTD
KLFLESELLDYNSLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPATPIVPRNEAHGNSGPSSEVPTPVIELDLSGDRYGEKRPRIGSIQEGDGVSISI
TSGFKIEGLSKISTGTDLARSVSYADASLAKASALEFSDLGT