; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g15780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g15780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:10150379..10158478
RNA-Seq ExpressionMoc01g15780
SyntenyMoc01g15780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.1e-6867.27Show/hide
Query:  KWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTS
        KWF+ASGEWLA DES              V+I+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLL+ESGLLDYNP VRPIE+SRPNSELAMVCGF S
Subjt:  KWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPL-REVREGSPLKRRKKKKKTTSSSEVGPR
        +VKRKSKG+AHAL+  QSS P TPAV        VGP+S  P  VIEL+S+   SREKR R ++EA+DVSPL  EVRE  PLKRR+KKKKTTS  EVG R
Subjt:  SVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPL-REVREGSPLKRRKKKKKTTSSSEVGPR

Query:  GPLPSNHADLVDDPEARMGG
        G LP++ AD VDDPEARMGG
Subjt:  GPLPSNHADLVDDPEARMGG

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.5e-11153.66Show/hide
Query:  VSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPSTPAVDQN
        +SIKPIPEL QATFDTLKFYKDNFPRGRKIGTLVTDKLL+ESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALK VQSSDP TPAVDQN
Subjt:  VSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPSTPAVDQN

Query:  AAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSNHADLVDDPEARMGGDIRREDAVQN
        AAQDQ GPSSAAPT VIELDSTGERSREKRSRSESEALDVSPLREVR                                                     
Subjt:  AAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSNHADLVDDPEARMGGDIRREDAVQN

Query:  GTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREALTAKERENSSAALEATTTLKGELLKAQSEVDIL
                                                                                                            
Subjt:  GTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREALTAKERENSSAALEATTTLKGELLKAQSEVDIL

Query:  RAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLPT----ELSLKQ-----------------
            EAK ELLKREDE HKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALE KDAAIGRLNAELKAEKERL      E + +Q                 
Subjt:  RAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLPT----ELSLKQ-----------------

Query:  -------------------------------PSGNTQILMGPASLVDKYVRDLDSDYSDLDEDEIPSQEPTEVGTTQEGVPSQQNGSQEVNLLGSQGELS
                                       P+G +    GPASLVDKYVRDLDSDYSDLDEDE+PSQEPTEVGTTQEGVPSQQ+GSQEVNLLGSQGELS
Subjt:  -------------------------------PSGNTQILMGPASLVDKYVRDLDSDYSDLDEDEIPSQEPTEVGTTQEGVPSQQNGSQEVNLLGSQGELS

Query:  SHLGS
        SHLGS
Subjt:  SHLGS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]9.5e-8565.79Show/hide
Query:  EYPSRMPEHYLGP---LRRRLALAQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KW
        EY  R+P H        R  LA AQVAPN WGVIFALAILFWLRARD +EAELL VDQLL CFEAKRIAKKPG                         KW
Subjt:  EYPSRMPEHYLGP---LRRRLALAQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KW

Query:  FFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSV
        F+ASGEWLA DESGR+FFDVPTRFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LL+ESGLLDYNP VRPIE SRPNS LAMVC F S V
Subjt:  FFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSV

Query:  KRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD
        KRKSKGRAHAL+  QSS P TPAV        VGP+S  P  VIEL+S+G  SREKR R ++EA+D
Subjt:  KRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.6e-10361.03Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRRLAL-----------------------------------------------
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LRR  A+                                               
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRRLAL-----------------------------------------------

Query:  ------AQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KWFFASGEWLANDESGRAF
              AQVAPN WGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPG                         KWF+ASGEWLA DESGR+F
Subjt:  ------AQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KWFFASGEWLANDESGRAF

Query:  FDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSS
        FDVPTRFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LL+ESGLLDYNP VRPIE+SRPNSELAMVCGF S VKRKSKGRAHAL+  QSS
Subjt:  FDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSS

Query:  DPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD
         P+TPAV        VGP+S  P LVIEL+S+G  SREKR R ++EA+D
Subjt:  DPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.5e-15163.5Show/hide
Query:  GKWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFT
        GKWFFASGEWLA DESGRAFFDVPTRFGNLVSIK IPEL QATFDTLK YKD+FPR RKI TLVTDKLL+ESGLLDYNPLVR IEASRPNSELAMVCGFT
Subjt:  GKWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFT

Query:  SSVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPR
         SVKRKSKGRAHALKTV  ++P TP V +  AQ   GPSSA PT VIELD +G RS EKRSR ESEALDVSPL EVR  SPL+RR+KKKKT+SSSE G R
Subjt:  SSVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPR

Query:  GPLPSNHADLVDDPEARMGG--DIRREDAVQNGTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREAL
        G LP++HADLVDDPEARM G  ++R    ++  +  ++ +   ++   CL   L + S  V +   V   T        +  +IH AVM+KAELDGREAL
Subjt:  GPLPSNHADLVDDPEARMGG--DIRREDAVQNGTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREAL

Query:  TAKERENSSAALEATTTLKGELLKAQSEVDILRAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEK
         AKERENS AALEA TTLKGELLKAQ EVDILRAEV+AKV+LLK+E E HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRL  ELK  K
Subjt:  TAKERENSSAALEATTTLKGELLKAQSEVDILRAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEK

Query:  ERLPT----ELSLKQ------------PSGNTQILMG--------------------------------PASLVDKYVRDLDSDYSDLDEDEIPSQEPTE
        ERL      E S +Q             +G   ++ G                                P SLVDKYVR+LDSDYSD++E++ PSQEP E
Subjt:  ERLPT----ELSLKQ------------PSGNTQILMG--------------------------------PASLVDKYVRDLDSDYSDLDEDEIPSQEPTE

Query:  VGTTQEGVPSQQNGS
        VGTTQE VPSQQ GS
Subjt:  VGTTQEGVPSQQNGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.0e-6867.27Show/hide
Query:  KWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTS
        KWF+ASGEWLA DES              V+I+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLL+ESGLLDYNP VRPIE+SRPNSELAMVCGF S
Subjt:  KWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPL-REVREGSPLKRRKKKKKTTSSSEVGPR
        +VKRKSKG+AHAL+  QSS P TPAV        VGP+S  P  VIEL+S+   SREKR R ++EA+DVSPL  EVRE  PLKRR+KKKKTTS  EVG R
Subjt:  SVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPL-REVREGSPLKRRKKKKKTTSSSEVGPR

Query:  GPLPSNHADLVDDPEARMGG
        G LP++ AD VDDPEARMGG
Subjt:  GPLPSNHADLVDDPEARMGG

A0A6J1CLV1 uncharacterized protein LOC1110124671.7e-11153.66Show/hide
Query:  VSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPSTPAVDQN
        +SIKPIPEL QATFDTLKFYKDNFPRGRKIGTLVTDKLL+ESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALK VQSSDP TPAVDQN
Subjt:  VSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPSTPAVDQN

Query:  AAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSNHADLVDDPEARMGGDIRREDAVQN
        AAQDQ GPSSAAPT VIELDSTGERSREKRSRSESEALDVSPLREVR                                                     
Subjt:  AAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSNHADLVDDPEARMGGDIRREDAVQN

Query:  GTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREALTAKERENSSAALEATTTLKGELLKAQSEVDIL
                                                                                                            
Subjt:  GTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREALTAKERENSSAALEATTTLKGELLKAQSEVDIL

Query:  RAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLPT----ELSLKQ-----------------
            EAK ELLKREDE HKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALE KDAAIGRLNAELKAEKERL      E + +Q                 
Subjt:  RAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLPT----ELSLKQ-----------------

Query:  -------------------------------PSGNTQILMGPASLVDKYVRDLDSDYSDLDEDEIPSQEPTEVGTTQEGVPSQQNGSQEVNLLGSQGELS
                                       P+G +    GPASLVDKYVRDLDSDYSDLDEDE+PSQEPTEVGTTQEGVPSQQ+GSQEVNLLGSQGELS
Subjt:  -------------------------------PSGNTQILMGPASLVDKYVRDLDSDYSDLDEDEIPSQEPTEVGTTQEGVPSQQNGSQEVNLLGSQGELS

Query:  SHLGS
        SHLGS
Subjt:  SHLGS

A0A6J1CR42 uncharacterized protein LOC1110138264.6e-8565.79Show/hide
Query:  EYPSRMPEHYLGP---LRRRLALAQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KW
        EY  R+P H        R  LA AQVAPN WGVIFALAILFWLRARD +EAELL VDQLL CFEAKRIAKKPG                         KW
Subjt:  EYPSRMPEHYLGP---LRRRLALAQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KW

Query:  FFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSV
        F+ASGEWLA DESGR+FFDVPTRFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LL+ESGLLDYNP VRPIE SRPNS LAMVC F S V
Subjt:  FFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSV

Query:  KRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD
        KRKSKGRAHAL+  QSS P TPAV        VGP+S  P  VIEL+S+G  SREKR R ++EA+D
Subjt:  KRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD

A0A6J1DXS5 uncharacterized protein LOC1110255027.5e-10461.03Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRRLAL-----------------------------------------------
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LRR  A+                                               
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRRLAL-----------------------------------------------

Query:  ------AQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KWFFASGEWLANDESGRAF
              AQVAPN WGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPG                         KWF+ASGEWLA DESGR+F
Subjt:  ------AQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPG-------------------------KWFFASGEWLANDESGRAF

Query:  FDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSS
        FDVPTRFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LL+ESGLLDYNP VRPIE+SRPNSELAMVCGF S VKRKSKGRAHAL+  QSS
Subjt:  FDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSS

Query:  DPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD
         P+TPAV        VGP+S  P LVIEL+S+G  SREKR R ++EA+D
Subjt:  DPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256657.4e-15263.5Show/hide
Query:  GKWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFT
        GKWFFASGEWLA DESGRAFFDVPTRFGNLVSIK IPEL QATFDTLK YKD+FPR RKI TLVTDKLL+ESGLLDYNPLVR IEASRPNSELAMVCGFT
Subjt:  GKWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFPRGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFT

Query:  SSVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPR
         SVKRKSKGRAHALKTV  ++P TP V +  AQ   GPSSA PT VIELD +G RS EKRSR ESEALDVSPL EVR  SPL+RR+KKKKT+SSSE G R
Subjt:  SSVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPR

Query:  GPLPSNHADLVDDPEARMGG--DIRREDAVQNGTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREAL
        G LP++HADLVDDPEARM G  ++R    ++  +  ++ +   ++   CL   L + S  V +   V   T        +  +IH AVM+KAELDGREAL
Subjt:  GPLPSNHADLVDDPEARMGG--DIRREDAVQNGTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRRRDVHCTIHSAVMIKAELDGREAL

Query:  TAKERENSSAALEATTTLKGELLKAQSEVDILRAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEK
         AKERENS AALEA TTLKGELLKAQ EVDILRAEV+AKV+LLK+E E HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRL  ELK  K
Subjt:  TAKERENSSAALEATTTLKGELLKAQSEVDILRAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEK

Query:  ERLPT----ELSLKQ------------PSGNTQILMG--------------------------------PASLVDKYVRDLDSDYSDLDEDEIPSQEPTE
        ERL      E S +Q             +G   ++ G                                P SLVDKYVR+LDSDYSD++E++ PSQEP E
Subjt:  ERLPT----ELSLKQ------------PSGNTQILMG--------------------------------PASLVDKYVRDLDSDYSDLDEDEIPSQEPTE

Query:  VGTTQEGVPSQQNGS
        VGTTQE VPSQQ GS
Subjt:  VGTTQEGVPSQQNGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGACAAGAATTTGCACAACGGTTCTTCAGGAATCAAGCTCGAACCCGGTCTCCGGTTCTGACTTGAACACTAGAGTGGACCTGCACAAGAGGGTGACGGATCC
GACAGCACACACGACCGGCGGTTACATGTCTTTTCTCATGTCGGACCTATCGGGTTCCGAGCAGGTCGATCCCCAGTTCGATCTCGACCTGGCAGAGAAGTTTATTCGAA
TATATTTTGGACACGTGGCGACTTTTCATTTGCAGAGGGAATATGACCGTTGCGGAAGACGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGTGTTTTGCCGTTGCGTA
TCTCGAGAAGATCCTAGCCGCTCGTTGATTACACGTGTACGGCGCAGAGGTCAGTCACTTTTCCTTGCTCTTACTATTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCC
CTCCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCTAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTGGAAG
AAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTTGGACCCCTT
CGTAGGAGACTGGCTCTTGCTCAAGTGGCCCCCAATAGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGAGCTGCT
AAGTGTTGACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCAGGTAAGTGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAATGACGAATCAGGTCGTG
CCTTTTTTGATGTTCCCACTAGGTTTGGGAACCTAGTATCGATCAAACCGATTCCCGAGCTCGGTCAAGCCACTTTTGACACCCTCAAATTCTACAAGGACAACTTCCCA
AGGGGTCGGAAGATCGGGACCTTGGTCACCGACAAACTGCTGGTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGCCCGATTGAAGCTTCGAGGCCAAACTCCGA
GCTTGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGGAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTGATCCATCTACTCCTGCTGTGGATC
AGAATGCAGCTCAGGACCAGGTGGGTCCATCTTCTGCAGCTCCAACTCTGGTGATTGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCT
GAAGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAGGGCTCTCCTCTGAAGAGGAGGAAGAAAAAGAAGAAGACAACCTCCTCCTCGGAGGTTGGACCTCGTGGCCC
CCTGCCCTCAAACCACGCCGATCTGGTAGATGACCCGGAAGCTCGGATGGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAA
GACCAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCACTGTCTCAGGAGAGCATCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTACAACGGACTATCGACCACGCCGT
CGAGACGTTCACTGCACCATCCATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGACAGCGAAGGAGAGGGAGAACTCCTCTGCTGCCTTGGAGGC
TACCACTACGCTCAAGGGCGAGCTGCTGAAGGCTCAGAGCGAGGTGGATATACTGAGGGCCGAGGTTGAAGCCAAGGTCGAGCTGCTGAAGAGGGAGGATGAGAGCCATA
AGGCCCACCTCCGAGCTGCCCACGCCATCACAAAAGGGTTGGAGAAGGAAAAGTTTCAACTCCTTAAGGAGAAGGACGACATGCTCCAGGCCCTTGAAGGGAAGGACGCT
GCAATTGGGCGTCTCAATGCTGAGCTGAAGGCGGAGAAGGAGCGCTTACCAACGGAGCTCTCCTTGAAGCAGCCTTCAGGCAACACCCAGATTTTGATGGGTCCTGCATC
CCTGGTGGACAAATATGTCAGAGATCTAGACTCTGACTACTCCGACCTGGATGAAGACGAGATCCCAAGTCAGGAACCTACTGAGGTCGGCACCACTCAAGAAGGAGTCC
CTTCTCAGCAGAACGGATCTCAGGAGGTCAACCTTCTAGGCTCTCAAGGCGAGCTATCTTCTCATCTCGGGAGCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGACAAGAATTTGCACAACGGTTCTTCAGGAATCAAGCTCGAACCCGGTCTCCGGTTCTGACTTGAACACTAGAGTGGACCTGCACAAGAGGGTGACGGATCC
GACAGCACACACGACCGGCGGTTACATGTCTTTTCTCATGTCGGACCTATCGGGTTCCGAGCAGGTCGATCCCCAGTTCGATCTCGACCTGGCAGAGAAGTTTATTCGAA
TATATTTTGGACACGTGGCGACTTTTCATTTGCAGAGGGAATATGACCGTTGCGGAAGACGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGTGTTTTGCCGTTGCGTA
TCTCGAGAAGATCCTAGCCGCTCGTTGATTACACGTGTACGGCGCAGAGGTCAGTCACTTTTCCTTGCTCTTACTATTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCC
CTCCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCTAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTGGAAG
AAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTTGGACCCCTT
CGTAGGAGACTGGCTCTTGCTCAAGTGGCCCCCAATAGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGAGCTGCT
AAGTGTTGACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCAGGTAAGTGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAATGACGAATCAGGTCGTG
CCTTTTTTGATGTTCCCACTAGGTTTGGGAACCTAGTATCGATCAAACCGATTCCCGAGCTCGGTCAAGCCACTTTTGACACCCTCAAATTCTACAAGGACAACTTCCCA
AGGGGTCGGAAGATCGGGACCTTGGTCACCGACAAACTGCTGGTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGCCCGATTGAAGCTTCGAGGCCAAACTCCGA
GCTTGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGGAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTGATCCATCTACTCCTGCTGTGGATC
AGAATGCAGCTCAGGACCAGGTGGGTCCATCTTCTGCAGCTCCAACTCTGGTGATTGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCT
GAAGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAGGGCTCTCCTCTGAAGAGGAGGAAGAAAAAGAAGAAGACAACCTCCTCCTCGGAGGTTGGACCTCGTGGCCC
CCTGCCCTCAAACCACGCCGATCTGGTAGATGACCCGGAAGCTCGGATGGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAA
GACCAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCACTGTCTCAGGAGAGCATCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTACAACGGACTATCGACCACGCCGT
CGAGACGTTCACTGCACCATCCATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGACAGCGAAGGAGAGGGAGAACTCCTCTGCTGCCTTGGAGGC
TACCACTACGCTCAAGGGCGAGCTGCTGAAGGCTCAGAGCGAGGTGGATATACTGAGGGCCGAGGTTGAAGCCAAGGTCGAGCTGCTGAAGAGGGAGGATGAGAGCCATA
AGGCCCACCTCCGAGCTGCCCACGCCATCACAAAAGGGTTGGAGAAGGAAAAGTTTCAACTCCTTAAGGAGAAGGACGACATGCTCCAGGCCCTTGAAGGGAAGGACGCT
GCAATTGGGCGTCTCAATGCTGAGCTGAAGGCGGAGAAGGAGCGCTTACCAACGGAGCTCTCCTTGAAGCAGCCTTCAGGCAACACCCAGATTTTGATGGGTCCTGCATC
CCTGGTGGACAAATATGTCAGAGATCTAGACTCTGACTACTCCGACCTGGATGAAGACGAGATCCCAAGTCAGGAACCTACTGAGGTCGGCACCACTCAAGAAGGAGTCC
CTTCTCAGCAGAACGGATCTCAGGAGGTCAACCTTCTAGGCTCTCAAGGCGAGCTATCTTCTCATCTCGGGAGCGGCTGA
Protein sequenceShow/hide protein sequence
METTRICTTVLQESSSNPVSGSDLNTRVDLHKRVTDPTAHTTGGYMSFLMSDLSGSEQVDPQFDLDLAEKFIRIYFGHVATFHLQREYDRCGRRFDLPGCRSTQVFCRCV
SREDPSRSLITRVRRRGQSLFLALTILSNMVVFLSSPSSSDSLGSVGRTISSSPPKLSDSGEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPL
RRRLALAQVAPNRWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGKWFFASGEWLANDESGRAFFDVPTRFGNLVSIKPIPELGQATFDTLKFYKDNFP
RGRKIGTLVTDKLLVESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPSTPAVDQNAAQDQVGPSSAAPTLVIELDSTGERSREKRSRSES
EALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSNHADLVDDPEARMGGDIRREDAVQNGTVELRGERPGVTHLGCLLGSLSQESIQVCERPRVRATTDYRPRR
RDVHCTIHSAVMIKAELDGREALTAKERENSSAALEATTTLKGELLKAQSEVDILRAEVEAKVELLKREDESHKAHLRAAHAITKGLEKEKFQLLKEKDDMLQALEGKDA
AIGRLNAELKAEKERLPTELSLKQPSGNTQILMGPASLVDKYVRDLDSDYSDLDEDEIPSQEPTEVGTTQEGVPSQQNGSQEVNLLGSQGELSSHLGSG