; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:20287818..20297588
RNA-Seq ExpressionMoc06g26890
SyntenyMoc06g26890
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144033.1 uncharacterized protein LOC111013825 [Momordica charantia]1.3e-5137.47Show/hide
Query:  LTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQ
        +T  G +TREEFD +K +FD QVEALKA  ++K + FDDG++GESPFT D+LEAPIP KFKTP M PYDGSKDPK+YVE+ E LM+FQAA D IKC AFQ
Subjt:  LTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQ

Query:  IALTG--------------EAKKKGNL---------------------------------TLSLRTKDYLP-----------------------------
        IALTG              + KK G L                                 TL+++ ++  P                             
Subjt:  IALTG--------------EAKKKGNL---------------------------------TLSLRTKDYLP-----------------------------

Query:  ----------------------------------------------------RND--LIIGDPTWTPTGGGL----------------------------
                                                            R+D   +I      P+GG                              
Subjt:  ----------------------------------------------------RND--LIIGDPTWTPTGGGL----------------------------

Query:  ---------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVI
                             + DHV VRRVLVDGGASANILSL TYL LGWTR QLKK+PTPL GF+GE     GCI+L V IGQ DTQ TQMAEFVVI
Subjt:  ---------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVI

Query:  DGRSAHNAIFG
         GRSA+ AIFG
Subjt:  DGRSAHNAIFG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.7e-4132.22Show/hide
Query:  TRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQI
        T  G +TREEFD ++ + + QVEALKA  ++KE P +DG++GESPFT DVLEA        P +K YDGSKDPK+YVE+FEGLMDFQAASDAIKCRAFQI
Subjt:  TRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQI

Query:  ALTGEAK------------------------------------KKGNLTLS---------------LRTKDYLPR-------------------------
        ALTG A+                                    K+   T +               LRTK   P                          
Subjt:  ALTGEAK------------------------------------KKGNLTLS---------------LRTKDYLPR-------------------------

Query:  ------------------------------------------------------------------------------------NDLI--------IGDP
                                                                                             DLI        +G P
Subjt:  ------------------------------------------------------------------------------------NDLI--------IGDP

Query:  TWT------------------------------PTGG-------------------------------------------------GLMKDHVQVRRVLV
          +                              P+GG                                                   + DHV VRRVLV
Subjt:  TWT------------------------------PTGG-------------------------------------------------GLMKDHVQVRRVLV

Query:  DGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG
        D G SANI+SL TYLALGWTR QLKK+ TPL GF+ E V  EGCIDLPVT+G   TQVTQMAEFVVIDGRSA+NAIFG
Subjt:  DGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.1e-4729.06Show/hide
Query:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR
        MVQP NSTNT +RR + A++G QR++ A +VE Q           R AR      PP HP+PSKA                                   
Subjt:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR

Query:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLD
              E  Y  +T                                        G +TREEFD +K +FD QVEALKA  ++KE+ FDDG++GE  F+ D
Subjt:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLD

Query:  VLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK------------------------------------------
        +LEA IP KFKTP MKPYDGSKDPK+YVE+FE LMDFQAA+DAIKC AFQIALTG A+                                          
Subjt:  VLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK------------------------------------------

Query:  -KKG------------------------------------NLTLSLRTK---------------------------------------------------
         K+G                                     LT+ LR +                                                   
Subjt:  -KKG------------------------------------NLTLSLRTK---------------------------------------------------

Query:  ----------DY--------------------------------------LPRNDLIIGDPT---------------------W----------------
                  DY                                      L R + + GDP                      W                
Subjt:  ----------DY--------------------------------------LPRNDLIIGDPT---------------------W----------------

Query:  ----TPTGGGLMK----------------------------------------------------------------------DHVQVRRVLVDGGASAN
             P    + K                                                                      D V VRR+LVDGGASAN
Subjt:  ----TPTGGGLMK----------------------------------------------------------------------DHVQVRRVLVDGGASAN

Query:  ILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG
        ILSL+TYLALGWTR QLKK+PTPL GF+GE ++ EGCIDLPV+I Q DTQVTQMAEFVVIDGRSA+NAIFG
Subjt:  ILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.5e-5538.22Show/hide
Query:  LTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGE
        + REEFDLMK RFDEQVEALKA  ++KE+PFDD ++GESPFT D++EAPIP KFKTP MKPYDGSKDPK+YVE+FEGLMDFQAA+DAIKC AFQIALTG 
Subjt:  LTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGE

Query:  A------------------------------------------KKKGNLTLS---------------------------LRTKDYLPR------------
        A                                          ++K + TL+                           LRTK   P             
Subjt:  A------------------------------------------KKKGNLTLS---------------------------LRTKDYLPR------------

Query:  --------------------------------------------NDLI--------IGDPTWT------------------------------PTGGGL-
                                                     DLI        +G P                                 P+GG   
Subjt:  --------------------------------------------NDLI--------IGDPTWT------------------------------PTGGGL-

Query:  ------------------------------------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVT
                                                        + DHV VRRVLVDGGASANILSL TYLAL  TR QLKK+PTPL GF+ E V+
Subjt:  ------------------------------------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVT

Query:  SEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIF
         EGCIDLPVTIGQ  TQVTQMAEFVVIDGR A+NAIF
Subjt:  SEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIF

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]7.8e-5749.64Show/hide
Query:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR
        MVQPV+STNT +RR + A++G QR++ A +VE Q+  G       R AR    +  P HP+P KANRGRGG S++T+  AAP    E F  LQ+E++ MR
Subjt:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR

Query:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRS--------------------GGLTREEFDLMKQRFDEQVEALKANY
         ++ TMEEMY EM +A   GS S +    D   D  D      +    +G + S                    G +TREEFD +K +FD QVE LKA  
Subjt:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRS--------------------GGLTREEFDLMKQRFDEQVEALKANY

Query:  KRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK
        + K + FDDG++GESPFT D+LEA IP KFKTP MKPYDGSKDPK+YVE+FEGLM FQAA+DAIK RAFQIALT  A+
Subjt:  KRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK

TrEMBL top hitse value%identityAlignment
A0A6J1CS66 uncharacterized protein LOC1110138256.2e-5237.47Show/hide
Query:  LTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQ
        +T  G +TREEFD +K +FD QVEALKA  ++K + FDDG++GESPFT D+LEAPIP KFKTP M PYDGSKDPK+YVE+ E LM+FQAA D IKC AFQ
Subjt:  LTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQ

Query:  IALTG--------------EAKKKGNL---------------------------------TLSLRTKDYLP-----------------------------
        IALTG              + KK G L                                 TL+++ ++  P                             
Subjt:  IALTG--------------EAKKKGNL---------------------------------TLSLRTKDYLP-----------------------------

Query:  ----------------------------------------------------RND--LIIGDPTWTPTGGGL----------------------------
                                                            R+D   +I      P+GG                              
Subjt:  ----------------------------------------------------RND--LIIGDPTWTPTGGGL----------------------------

Query:  ---------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVI
                             + DHV VRRVLVDGGASANILSL TYL LGWTR QLKK+PTPL GF+GE     GCI+L V IGQ DTQ TQMAEFVVI
Subjt:  ---------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVI

Query:  DGRSAHNAIFG
         GRSA+ AIFG
Subjt:  DGRSAHNAIFG

A0A6J1D9E1 uncharacterized protein LOC1110188231.3e-4132.22Show/hide
Query:  TRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQI
        T  G +TREEFD ++ + + QVEALKA  ++KE P +DG++GESPFT DVLEA        P +K YDGSKDPK+YVE+FEGLMDFQAASDAIKCRAFQI
Subjt:  TRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQI

Query:  ALTGEAK------------------------------------KKGNLTLS---------------LRTKDYLPR-------------------------
        ALTG A+                                    K+   T +               LRTK   P                          
Subjt:  ALTGEAK------------------------------------KKGNLTLS---------------LRTKDYLPR-------------------------

Query:  ------------------------------------------------------------------------------------NDLI--------IGDP
                                                                                             DLI        +G P
Subjt:  ------------------------------------------------------------------------------------NDLI--------IGDP

Query:  TWT------------------------------PTGG-------------------------------------------------GLMKDHVQVRRVLV
          +                              P+GG                                                   + DHV VRRVLV
Subjt:  TWT------------------------------PTGG-------------------------------------------------GLMKDHVQVRRVLV

Query:  DGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG
        D G SANI+SL TYLALGWTR QLKK+ TPL GF+ E V  EGCIDLPVT+G   TQVTQMAEFVVIDGRSA+NAIFG
Subjt:  DGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG

A0A6J1DHB3 uncharacterized protein LOC1110204795.5e-4829.06Show/hide
Query:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR
        MVQP NSTNT +RR + A++G QR++ A +VE Q           R AR      PP HP+PSKA                                   
Subjt:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR

Query:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLD
              E  Y  +T                                        G +TREEFD +K +FD QVEALKA  ++KE+ FDDG++GE  F+ D
Subjt:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLD

Query:  VLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK------------------------------------------
        +LEA IP KFKTP MKPYDGSKDPK+YVE+FE LMDFQAA+DAIKC AFQIALTG A+                                          
Subjt:  VLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK------------------------------------------

Query:  -KKG------------------------------------NLTLSLRTK---------------------------------------------------
         K+G                                     LT+ LR +                                                   
Subjt:  -KKG------------------------------------NLTLSLRTK---------------------------------------------------

Query:  ----------DY--------------------------------------LPRNDLIIGDPT---------------------W----------------
                  DY                                      L R + + GDP                      W                
Subjt:  ----------DY--------------------------------------LPRNDLIIGDPT---------------------W----------------

Query:  ----TPTGGGLMK----------------------------------------------------------------------DHVQVRRVLVDGGASAN
             P    + K                                                                      D V VRR+LVDGGASAN
Subjt:  ----TPTGGGLMK----------------------------------------------------------------------DHVQVRRVLVDGGASAN

Query:  ILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG
        ILSL+TYLALGWTR QLKK+PTPL GF+GE ++ EGCIDLPV+I Q DTQVTQMAEFVVIDGRSA+NAIFG
Subjt:  ILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG

A0A6J1DPC9 uncharacterized protein LOC1110222801.2e-5538.22Show/hide
Query:  LTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGE
        + REEFDLMK RFDEQVEALKA  ++KE+PFDD ++GESPFT D++EAPIP KFKTP MKPYDGSKDPK+YVE+FEGLMDFQAA+DAIKC AFQIALTG 
Subjt:  LTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGE

Query:  A------------------------------------------KKKGNLTLS---------------------------LRTKDYLPR------------
        A                                          ++K + TL+                           LRTK   P             
Subjt:  A------------------------------------------KKKGNLTLS---------------------------LRTKDYLPR------------

Query:  --------------------------------------------NDLI--------IGDPTWT------------------------------PTGGGL-
                                                     DLI        +G P                                 P+GG   
Subjt:  --------------------------------------------NDLI--------IGDPTWT------------------------------PTGGGL-

Query:  ------------------------------------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVT
                                                        + DHV VRRVLVDGGASANILSL TYLAL  TR QLKK+PTPL GF+ E V+
Subjt:  ------------------------------------------------MKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLGGFAGEYVT

Query:  SEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIF
         EGCIDLPVTIGQ  TQVTQMAEFVVIDGR A+NAIF
Subjt:  SEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIF

A0A6J1DZJ1 uncharacterized protein LOC1110257383.8e-5749.64Show/hide
Query:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR
        MVQPV+STNT +RR + A++G QR++ A +VE Q+  G       R AR    +  P HP+P KANRGRGG S++T+  AAP    E F  LQ+E++ MR
Subjt:  MVQPVNSTNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMR

Query:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRS--------------------GGLTREEFDLMKQRFDEQVEALKANY
         ++ TMEEMY EM +A   GS S +    D   D  D      +    +G + S                    G +TREEFD +K +FD QVE LKA  
Subjt:  NRVRTMEEMYTEMTRANRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRS--------------------GGLTREEFDLMKQRFDEQVEALKANY

Query:  KRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK
        + K + FDDG++GESPFT D+LEA IP KFKTP MKPYDGSKDPK+YVE+FEGLM FQAA+DAIK RAFQIALT  A+
Subjt:  KRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKEYVEIFEGLMDFQAASDAIKCRAFQIALTGEAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCGGCATTGCGCGCGGCCAGTTCTAGCGAAAACAGTGCGGCGGCGCGTCTTGCAGCAGCTGAGGCTTGGGGTCGGCAGCAATAGGACGTTCGGCAGTGACCCACT
CCTTTTTGGACCCGAATCAACGTTACCCATCGCTTTTGACATTGGAACAGCAAGCATAAGGACATTCGAGCTCAGTTTTGGATGCCCGAACCTCCTCGGCGTCGACCCAC
TCCTTACCCAAGAGGTTTTATGGAAAATATCTTGTGTCATGATTATTGGATGTGCCTTGAAAGTTGCTGCTCGTGGTTGTGTCGTGATGGGCGTTGTGGTGGTTGCTGTT
ATTAGGGAAGTAGTACAGAAATCCTGGTTGGAGGCTAGAAATGATATGTGGAGTTATGAAAGTTACGAAGTGAATGGGGTCGATGCAGAGCTCAAGGCCTTGGGTATAAA
TGGTCAAGGGCCGATAGATGGTGAAGTCATCAGGGCCTCGGGTATAAATGGTCGAGGGCCGATGGAGCAGTGTCGGGGCTCTAGCATGACATGTTGTTCCTTGATAAGTA
TGCCGTTTGTGCGTGACCCTTCAGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTCTTTCCCCTCCAGTTTGCAGTGGATGGTTTATAGTTAGTAAG
GTTTCCTTGAAGGGTTTGGTTAATGTGAAGTCGGTTGAAGAACTTCATCAAGCTGCATTGCCAAAGAACTTCATCAAGTTTCAAGCGAATGTTGATGCTGAATTCTGTCG
AGCTGTCGAAGATCAGCTTCTGACAAACTTGAACGAGCTCCTCCTCGGAACCTTTCGAGGAGGGAAGCTCATTACTTCTAATCGTATTACCTCTAAAATATCCGAGCTCG
ACCCACTAGAATCTGACATATCCGACCTCAAAACCGACCAAGGTCTCCCCAGTCTTTCAGGCCATCCCAATAACCAAAAAAGTGGTCCAAGAATGGTGCAGCCAGTGAAC
TCTACCAATACAATAGAACGGAGAGAGGTGAACGCTGACAATGGCACTCAGCGTGACCTCGACGCTAGAATAGTCGAGGATCAAGTCCGAGCCGGGCAAGAGGAAGGTCT
GTCGCGGAGATTTGCTCGCCATGCGAATCAGGAACGACCTCCCGTTCACCCTAGACCTTCAAAGGCCAACCGAGGACGAGGTGGGACCTCAAAAAAGACCTCCCAAAGGG
CTGCCCCAACCATAGACCTCGAAGTCTTTATTACCCTCCAACGGGAGTTAGATGACATGCGCAATCGAGTGCGCACTATGGAGGAGATGTATACCGAGATGACACGGGCT
AACCGAATAGGATCTCCTTCCAGAAACCCAGGCGGGGACGACATGCATGAGGATGGGGAAGATCAAGATCCACTCCTCCACACTGACGATCAAGATGAAGGCCTCACGAG
GTCTGGGGGCCTCACCAGGGAAGAGTTCGACCTGATGAAGCAAAGGTTCGACGAACAGGTCGAAGCACTTAAGGCTAATTACAAAAGAAAGGAAAACCCGTTCGATGATG
GCGAGATAGGCGAATCGCCATTCACTTTGGATGTCTTGGAGGCCCCTATTCCTAAAAAATTTAAAACGCCCGCGATGAAGCCTTATGACGGGTCTAAAGACCCGAAGGAA
TATGTTGAGATCTTCGAAGGCCTCATGGACTTCCAAGCGGCATCAGATGCCATTAAGTGTCGAGCTTTCCAGATTGCTCTTACAGGGGAGGCCAAGAAAAAGGGAAATCT
GACTCTAAGTCTAAGGACAAAGGATTATCTTCCCAGGAACGATCTGATTATAGGAGATCCGACTTGGACTCCCACTGGAGGGGGCCTTATGAAAGACCACGTCCAGGTTA
GAAGAGTGCTGGTGGATGGGGGCGCCTCTGCCAACATTCTGTCTCTCACTACTTACCTAGCGTTAGGGTGGACCAGGGTGCAGTTAAAGAAGAATCCGACCCCTTTAGGG
GGATTTGCGGGTGAATATGTCACCTCGGAGGGCTGCATTGATCTTCCCGTCACCATTGGCCAAGGTGACACTCAAGTCACCCAAATGGCCGAGTTTGTGGTGATAGATGG
TAGATCGGCCCACAATGCTATCTTCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCGGCATTGCGCGCGGCCAGTTCTAGCGAAAACAGTGCGGCGGCGCGTCTTGCAGCAGCTGAGGCTTGGGGTCGGCAGCAATAGGACGTTCGGCAGTGACCCACT
CCTTTTTGGACCCGAATCAACGTTACCCATCGCTTTTGACATTGGAACAGCAAGCATAAGGACATTCGAGCTCAGTTTTGGATGCCCGAACCTCCTCGGCGTCGACCCAC
TCCTTACCCAAGAGGTTTTATGGAAAATATCTTGTGTCATGATTATTGGATGTGCCTTGAAAGTTGCTGCTCGTGGTTGTGTCGTGATGGGCGTTGTGGTGGTTGCTGTT
ATTAGGGAAGTAGTACAGAAATCCTGGTTGGAGGCTAGAAATGATATGTGGAGTTATGAAAGTTACGAAGTGAATGGGGTCGATGCAGAGCTCAAGGCCTTGGGTATAAA
TGGTCAAGGGCCGATAGATGGTGAAGTCATCAGGGCCTCGGGTATAAATGGTCGAGGGCCGATGGAGCAGTGTCGGGGCTCTAGCATGACATGTTGTTCCTTGATAAGTA
TGCCGTTTGTGCGTGACCCTTCAGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTCTTTCCCCTCCAGTTTGCAGTGGATGGTTTATAGTTAGTAAG
GTTTCCTTGAAGGGTTTGGTTAATGTGAAGTCGGTTGAAGAACTTCATCAAGCTGCATTGCCAAAGAACTTCATCAAGTTTCAAGCGAATGTTGATGCTGAATTCTGTCG
AGCTGTCGAAGATCAGCTTCTGACAAACTTGAACGAGCTCCTCCTCGGAACCTTTCGAGGAGGGAAGCTCATTACTTCTAATCGTATTACCTCTAAAATATCCGAGCTCG
ACCCACTAGAATCTGACATATCCGACCTCAAAACCGACCAAGGTCTCCCCAGTCTTTCAGGCCATCCCAATAACCAAAAAAGTGGTCCAAGAATGGTGCAGCCAGTGAAC
TCTACCAATACAATAGAACGGAGAGAGGTGAACGCTGACAATGGCACTCAGCGTGACCTCGACGCTAGAATAGTCGAGGATCAAGTCCGAGCCGGGCAAGAGGAAGGTCT
GTCGCGGAGATTTGCTCGCCATGCGAATCAGGAACGACCTCCCGTTCACCCTAGACCTTCAAAGGCCAACCGAGGACGAGGTGGGACCTCAAAAAAGACCTCCCAAAGGG
CTGCCCCAACCATAGACCTCGAAGTCTTTATTACCCTCCAACGGGAGTTAGATGACATGCGCAATCGAGTGCGCACTATGGAGGAGATGTATACCGAGATGACACGGGCT
AACCGAATAGGATCTCCTTCCAGAAACCCAGGCGGGGACGACATGCATGAGGATGGGGAAGATCAAGATCCACTCCTCCACACTGACGATCAAGATGAAGGCCTCACGAG
GTCTGGGGGCCTCACCAGGGAAGAGTTCGACCTGATGAAGCAAAGGTTCGACGAACAGGTCGAAGCACTTAAGGCTAATTACAAAAGAAAGGAAAACCCGTTCGATGATG
GCGAGATAGGCGAATCGCCATTCACTTTGGATGTCTTGGAGGCCCCTATTCCTAAAAAATTTAAAACGCCCGCGATGAAGCCTTATGACGGGTCTAAAGACCCGAAGGAA
TATGTTGAGATCTTCGAAGGCCTCATGGACTTCCAAGCGGCATCAGATGCCATTAAGTGTCGAGCTTTCCAGATTGCTCTTACAGGGGAGGCCAAGAAAAAGGGAAATCT
GACTCTAAGTCTAAGGACAAAGGATTATCTTCCCAGGAACGATCTGATTATAGGAGATCCGACTTGGACTCCCACTGGAGGGGGCCTTATGAAAGACCACGTCCAGGTTA
GAAGAGTGCTGGTGGATGGGGGCGCCTCTGCCAACATTCTGTCTCTCACTACTTACCTAGCGTTAGGGTGGACCAGGGTGCAGTTAAAGAAGAATCCGACCCCTTTAGGG
GGATTTGCGGGTGAATATGTCACCTCGGAGGGCTGCATTGATCTTCCCGTCACCATTGGCCAAGGTGACACTCAAGTCACCCAAATGGCCGAGTTTGTGGTGATAGATGG
TAGATCGGCCCACAATGCTATCTTCGGCTGA
Protein sequenceShow/hide protein sequence
MTRHCARPVLAKTVRRRVLQQLRLGVGSNRTFGSDPLLFGPESTLPIAFDIGTASIRTFELSFGCPNLLGVDPLLTQEVLWKISCVMIIGCALKVAARGCVVMGVVVVAV
IREVVQKSWLEARNDMWSYESYEVNGVDAELKALGINGQGPIDGEVIRASGINGRGPMEQCRGSSMTCCSLISMPFVRDPSDKGCLLSTVVVLIPLLSPPVCSGWFIVSK
VSLKGLVNVKSVEELHQAALPKNFIKFQANVDAEFCRAVEDQLLTNLNELLLGTFRGGKLITSNRITSKISELDPLESDISDLKTDQGLPSLSGHPNNQKSGPRMVQPVN
STNTIERREVNADNGTQRDLDARIVEDQVRAGQEEGLSRRFARHANQERPPVHPRPSKANRGRGGTSKKTSQRAAPTIDLEVFITLQRELDDMRNRVRTMEEMYTEMTRA
NRIGSPSRNPGGDDMHEDGEDQDPLLHTDDQDEGLTRSGGLTREEFDLMKQRFDEQVEALKANYKRKENPFDDGEIGESPFTLDVLEAPIPKKFKTPAMKPYDGSKDPKE
YVEIFEGLMDFQAASDAIKCRAFQIALTGEAKKKGNLTLSLRTKDYLPRNDLIIGDPTWTPTGGGLMKDHVQVRRVLVDGGASANILSLTTYLALGWTRVQLKKNPTPLG
GFAGEYVTSEGCIDLPVTIGQGDTQVTQMAEFVVIDGRSAHNAIFG