; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19910 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19910
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR-RED ELONGATED HYPOCOTYL 3-like
Genome locationchr3:13425994..13427259
RNA-Seq ExpressionMoc03g19910
SyntenyMoc03g19910
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153251.1 uncharacterized protein LOC111020787 [Momordica charantia]1.8e-7658.97Show/hide
Query:  MKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDV--------------------
        MK NFEFKVKKSTKTL+TVGCT+QGCKWGLR KSI+ GDSFIISKFNDVHKCKR+VLNHDHRQA+SWVVGQL+KSN+EDV                    
Subjt:  MKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDV--------------------

Query:  ----------------SRQYRPKDIINDMR---KNYGV----------NIRYEKAW--------RAKNVCLNLLMGSPKHSYTLLRKYGEALKDKFKDDA
                        S   RP  +I+      K  GV          N  Y  A+        ++    LN+   S       ++     LKD+FKDDA
Subjt:  ----------------SRQYRPKDIINDMR---KNYGV----------NIRYEKAW--------RAKNVCLNLLMGSPKHSYTLLRKYGEALKDKFKDDA

Query:  MQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF
        MQEMFILA KAC+K EFRYYFSQLAGF +VQRYLE IGFEKWT AFQPGLRYDQMTSNIAESMNAVLVHAR LPVT LLEHARALLQ WF
Subjt:  MQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF

XP_022154930.1 uncharacterized protein LOC111022077 [Momordica charantia]4.4e-9148.46Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        +S NA    V+E      QNI   +  D V +I V GIFRSK++LRFKL VLAMK NFEF+VKKSTKTLY VGC + GCKWGL    IR  DSF ISK+ 
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------
        DVH C ++VLNHDHRQA+SWVVGQLLK+NLEDVSR YRPKDI+ DMRK YGVNIRYEKAWRAK V LN+L+GSPK SY  LR+Y EA             
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
                              LKDKFK+D +Q MFILA KA +K  FRYYFSQLAGFP VQRYLEGIGFEKWT AFQP LRYDQMTSN AES+NAVL H
Subjt:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH

Query:  ARYLPVTTLLEHARALLQCWF
        AR LPVT LLE A AL+Q WF
Subjt:  ARYLPVTTLLEHARALLQCWF

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]1.2e-10155.64Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        +SGNAP Q VEE V   MQNII  N SD++ EIAVKGIFRSKE+LRFKLSVLAMK NF+FKVKKSTKTL+TVGCT+QGCKWGLR KSI+ GDSFIISKFN
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEAL------------
        D HKCKR+VLNHDHRQA+SWVVGQL+KSN+EDVSRQYRPKDIINDMR+NYGVNIRYEKAWRAKNV LNLLMG PKHSYTLLRKYGEAL            
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEAL------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------KDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGL
                               KDKFKDDAMQE+FILA KACRK EFRYYFSQLAGFP+VQRYLEGIGFEKWT AFQPGL
Subjt:  -----------------------KDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGL

XP_022156943.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]2.1e-14667.7Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------
        DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA             
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
                              LKDKFKDDAMQEMFILA KACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
Subjt:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH

Query:  ARYLPVTTLLEHARALLQCWF
        ARYLPVTTLLEHARALLQCWF
Subjt:  ARYLPVTTLLEHARALLQCWF

XP_022157237.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]7.2e-7842.36Show/hide
Query:  IANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVV
        I N+  +V +IAV  +FRSK++LRFKL+V A+  NFE+KVKKST  L +V CT+ GCKW LRV+ I+  ++F+IS F++ H C+R  L HDHRQA SWVV
Subjt:  IANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVV

Query:  GQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEALK---------------------------------
        GQL+KSN E+VSR+YRPKDI+NDM+KNYGVN+RYEKA RAK V L LLMGSP+ SY+ L KYGEALK                                 
Subjt:  GQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEALK---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --DKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF
          +KF+++ MQ +F  A KA +  +FRYY+ QLAGFP VQ+YLE IGF+KW  A+QPG+RY+QMTSN+AESMNAVLVHAR LP+T + E+ RALLQ WF
Subjt:  --DKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF

TrEMBL top hitse value%identityAlignment
A0A6J1DK28 uncharacterized protein LOC1110207878.6e-7758.97Show/hide
Query:  MKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDV--------------------
        MK NFEFKVKKSTKTL+TVGCT+QGCKWGLR KSI+ GDSFIISKFNDVHKCKR+VLNHDHRQA+SWVVGQL+KSN+EDV                    
Subjt:  MKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDV--------------------

Query:  ----------------SRQYRPKDIINDMR---KNYGV----------NIRYEKAW--------RAKNVCLNLLMGSPKHSYTLLRKYGEALKDKFKDDA
                        S   RP  +I+      K  GV          N  Y  A+        ++    LN+   S       ++     LKD+FKDDA
Subjt:  ----------------SRQYRPKDIINDMR---KNYGV----------NIRYEKAW--------RAKNVCLNLLMGSPKHSYTLLRKYGEALKDKFKDDA

Query:  MQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF
        MQEMFILA KAC+K EFRYYFSQLAGF +VQRYLE IGFEKWT AFQPGLRYDQMTSNIAESMNAVLVHAR LPVT LLEHARALLQ WF
Subjt:  MQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF

A0A6J1DL12 uncharacterized protein LOC1110220772.1e-9148.46Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        +S NA    V+E      QNI   +  D V +I V GIFRSK++LRFKL VLAMK NFEF+VKKSTKTLY VGC + GCKWGL    IR  DSF ISK+ 
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------
        DVH C ++VLNHDHRQA+SWVVGQLLK+NLEDVSR YRPKDI+ DMRK YGVNIRYEKAWRAK V LN+L+GSPK SY  LR+Y EA             
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
                              LKDKFK+D +Q MFILA KA +K  FRYYFSQLAGFP VQRYLEGIGFEKWT AFQP LRYDQMTSN AES+NAVL H
Subjt:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH

Query:  ARYLPVTTLLEHARALLQCWF
        AR LPVT LLE A AL+Q WF
Subjt:  ARYLPVTTLLEHARALLQCWF

A0A6J1DS25 protein FAR-RED ELONGATED HYPOCOTYL 3-like1.0e-14667.7Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------
        DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA             
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEA-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
                              LKDKFKDDAMQEMFILA KACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH
Subjt:  ----------------------LKDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVH

Query:  ARYLPVTTLLEHARALLQCWF
        ARYLPVTTLLEHARALLQCWF
Subjt:  ARYLPVTTLLEHARALLQCWF

A0A6J1DSY0 uncharacterized protein LOC1110236355.9e-10255.64Show/hide
Query:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN
        +SGNAP Q VEE V   MQNII  N SD++ EIAVKGIFRSKE+LRFKLSVLAMK NF+FKVKKSTKTL+TVGCT+QGCKWGLR KSI+ GDSFIISKFN
Subjt:  MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFN

Query:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEAL------------
        D HKCKR+VLNHDHRQA+SWVVGQL+KSN+EDVSRQYRPKDIINDMR+NYGVNIRYEKAWRAKNV LNLLMG PKHSYTLLRKYGEAL            
Subjt:  DVHKCKRDVLNHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEAL------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------KDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGL
                               KDKFKDDAMQE+FILA KACRK EFRYYFSQLAGFP+VQRYLEGIGFEKWT AFQPGL
Subjt:  -----------------------KDKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGL

A0A6J1DU12 protein FAR-RED ELONGATED HYPOCOTYL 3-like3.5e-7842.36Show/hide
Query:  IANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVV
        I N+  +V +IAV  +FRSK++LRFKL+V A+  NFE+KVKKST  L +V CT+ GCKW LRV+ I+  ++F+IS F++ H C+R  L HDHRQA SWVV
Subjt:  IANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVLNHDHRQAQSWVV

Query:  GQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEALK---------------------------------
        GQL+KSN E+VSR+YRPKDI+NDM+KNYGVN+RYEKA RAK V L LLMGSP+ SY+ L KYGEALK                                 
Subjt:  GQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEALK---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --DKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF
          +KF+++ MQ +F  A KA +  +FRYY+ QLAGFP VQ+YLE IGF+KW  A+QPG+RY+QMTSN+AESMNAVLVHAR LP+T + E+ RALLQ WF
Subjt:  --DKFKDDAMQEMFILAVKACRKLEFRYYFSQLAGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGGAATGCACCTTGTCAAATAGTTGAAGAAGCTGTTTCACATCTGATGCAGAATATCATAATTGCCAATAAGTCCGACTACGTATGTGAAATTGCTGTTAAGGG
TATTTTCCGTTCGAAGGAAAAACTACGCTTCAAGCTGTCTGTGTTAGCTATGAAGCCGAATTTCGAATTTAAGGTTAAAAAATCGACGAAGACATTGTACACCGTTGGAT
GTACCAAGCAAGGTTGCAAATGGGGTTTGCGTGTGAAGAGTATCCGAGAAGGTGACTCATTCATCATTTCAAAGTTTAATGATGTTCACAAGTGCAAACGTGACGTATTG
AACCATGACCACAGACAAGCTCAGAGTTGGGTGGTTGGTCAGTTATTGAAGTCCAATCTGGAGGATGTCAGCCGTCAGTACAGACCTAAGGACATTATTAATGACATGCG
AAAGAACTATGGGGTGAACATTAGATATGAAAAGGCGTGGCGTGCCAAAAATGTATGCTTGAATCTGCTTATGGGGTCACCTAAGCATTCATATACTTTGTTACGCAAAT
ATGGTGAAGCATTGAAGGACAAGTTCAAGGACGATGCCATGCAAGAAATGTTTATATTAGCAGTAAAGGCCTGCAGGAAATTAGAGTTCAGATACTATTTTTCCCAACTA
GCAGGGTTTCCAAAGGTCCAGCGATACTTGGAAGGAATTGGTTTTGAAAAATGGACTTGTGCATTTCAACCGGGTTTGAGGTATGACCAAATGACATCTAATATTGCGGA
GTCTATGAATGCAGTCCTTGTCCATGCACGTTATTTGCCAGTCACTACACTTCTGGAACATGCTCGTGCACTCTTACAATGCTGGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGGAATGCACCTTGTCAAATAGTTGAAGAAGCTGTTTCACATCTGATGCAGAATATCATAATTGCCAATAAGTCCGACTACGTATGTGAAATTGCTGTTAAGGG
TATTTTCCGTTCGAAGGAAAAACTACGCTTCAAGCTGTCTGTGTTAGCTATGAAGCCGAATTTCGAATTTAAGGTTAAAAAATCGACGAAGACATTGTACACCGTTGGAT
GTACCAAGCAAGGTTGCAAATGGGGTTTGCGTGTGAAGAGTATCCGAGAAGGTGACTCATTCATCATTTCAAAGTTTAATGATGTTCACAAGTGCAAACGTGACGTATTG
AACCATGACCACAGACAAGCTCAGAGTTGGGTGGTTGGTCAGTTATTGAAGTCCAATCTGGAGGATGTCAGCCGTCAGTACAGACCTAAGGACATTATTAATGACATGCG
AAAGAACTATGGGGTGAACATTAGATATGAAAAGGCGTGGCGTGCCAAAAATGTATGCTTGAATCTGCTTATGGGGTCACCTAAGCATTCATATACTTTGTTACGCAAAT
ATGGTGAAGCATTGAAGGACAAGTTCAAGGACGATGCCATGCAAGAAATGTTTATATTAGCAGTAAAGGCCTGCAGGAAATTAGAGTTCAGATACTATTTTTCCCAACTA
GCAGGGTTTCCAAAGGTCCAGCGATACTTGGAAGGAATTGGTTTTGAAAAATGGACTTGTGCATTTCAACCGGGTTTGAGGTATGACCAAATGACATCTAATATTGCGGA
GTCTATGAATGCAGTCCTTGTCCATGCACGTTATTTGCCAGTCACTACACTTCTGGAACATGCTCGTGCACTCTTACAATGCTGGTTTTAG
Protein sequenceShow/hide protein sequence
MSGNAPCQIVEEAVSHLMQNIIIANKSDYVCEIAVKGIFRSKEKLRFKLSVLAMKPNFEFKVKKSTKTLYTVGCTKQGCKWGLRVKSIREGDSFIISKFNDVHKCKRDVL
NHDHRQAQSWVVGQLLKSNLEDVSRQYRPKDIINDMRKNYGVNIRYEKAWRAKNVCLNLLMGSPKHSYTLLRKYGEALKDKFKDDAMQEMFILAVKACRKLEFRYYFSQL
AGFPKVQRYLEGIGFEKWTCAFQPGLRYDQMTSNIAESMNAVLVHARYLPVTTLLEHARALLQCWF