; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001810 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001810
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionABC transporter F family member 4-like
Genome locationscaffold30:189043..189966
RNA-Seq ExpressionMS001810
SyntenyMS001810
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057746.1 ABC transporter F family member 4-like [Cucumis melo var. makuwa]2.5e-3740.07Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+              HE  ST      + +++ ++    + ++   P+
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA

Query:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA
           A+  K   + +      P P                      ++ +  I+E   +++NKK GED     E+   SIICPGSPSFR YFVEE  DDK 
Subjt:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA

Query:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
         VE+KD  A    DVSH  SP+HD +++TT+  +       +I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  LLARKA+A
Subjt:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

XP_008464387.1 PREDICTED: uncharacterized protein LOC103502290 [Cucumis melo]2.5e-3740.07Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+              HE  ST      + +++ ++    + ++   P+
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA

Query:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA
           A+  K   + +      P P                      ++ +  I+E   +++NKK GED     E+   SIICPGSPSFR YFVEE  DDK 
Subjt:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA

Query:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
         VE+KD  A    DVSH  SP+HD +++TT+  +       +I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  LLARKA+A
Subjt:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

XP_011649820.1 probable DNA-directed RNA polymerase I subunit RPA43 isoform X1 [Cucumis sativus]1.2e-3641.56Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFPDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKD
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK     +  H N  +   
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFPDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKD

Query:  DEDYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVY
        DE  S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+Y
Subjt:  DEDYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVY

Query:  FVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRSTQ
        FVEE  DDK  VE+KD  A    DVSH  SP+ D +++TT+           I KGK+   +    + SKKR    GV+NLLNVKSCYHL CSGNDR+  
Subjt:  FVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRSTQ

Query:  LLARKAQA
        LLARKA+A
Subjt:  LLARKAQA

XP_022135203.1 uncharacterized protein LOC111007223 isoform X1 [Momordica charantia]4.4e-12794.25Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE
        MGCGNSRLIPDGESIPARIRPLMRLRF DLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTK  PAPAPAHINKQDNKDDE
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ
        DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKD       DVSHNNSPTHDRIDTTTTTTTCANFGQ
Subjt:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ

Query:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
             VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
Subjt:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

XP_022135204.1 uncharacterized protein LOC111007223 isoform X2 [Momordica charantia]4.6e-12492.34Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE
        MGCGNSRLIPDGESIPARIRPLMRLRF DLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTK  PAPAPAHINKQDNKDDE
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ
        DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS           +DVSHNNSPTHDRIDTTTTTTTCANFGQ
Subjt:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ

Query:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
             VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
Subjt:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

TrEMBL top hitse value%identityAlignment
A0A0A0LNT2 Uncharacterized protein6.0e-3741.56Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFPDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKD
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK     +  H N  +   
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFPDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKD

Query:  DEDYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVY
        DE  S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+Y
Subjt:  DEDYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVY

Query:  FVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRSTQ
        FVEE  DDK  VE+KD  A    DVSH  SP+ D +++TT+           I KGK+   +    + SKKR    GV+NLLNVKSCYHL CSGNDR+  
Subjt:  FVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRSTQ

Query:  LLARKAQA
        LLARKA+A
Subjt:  LLARKAQA

A0A1S3CLT6 uncharacterized protein LOC1035022901.2e-3740.07Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+              HE  ST      + +++ ++    + ++   P+
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA

Query:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA
           A+  K   + +      P P                      ++ +  I+E   +++NKK GED     E+   SIICPGSPSFR YFVEE  DDK 
Subjt:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA

Query:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
         VE+KD  A    DVSH  SP+HD +++TT+  +       +I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  LLARKA+A
Subjt:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

A0A5D3BHA4 ABC transporter F family member 4-like1.2e-3740.07Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+              HE  ST      + +++ ++    + ++   P+
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFPDLRRRKNGSNLETGTLSKKVLLKD--------------HEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPA

Query:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA
           A+  K   + +      P P                      ++ +  I+E   +++NKK GED     E+   SIICPGSPSFR YFVEE  DDK 
Subjt:  PAPAHINKQDNKDDEDYSPIPIP----------------------LSSNNAIKEDLHEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVEEEDDDKA

Query:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
         VE+KD  A    DVSH  SP+HD +++TT+  +       +I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  LLARKA+A
Subjt:  SVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

A0A6J1C0S3 uncharacterized protein LOC111007223 isoform X22.2e-12492.34Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE
        MGCGNSRLIPDGESIPARIRPLMRLRF DLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTK  PAPAPAHINKQDNKDDE
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ
        DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS           +DVSHNNSPTHDRIDTTTTTTTCANFGQ
Subjt:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ

Query:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
             VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
Subjt:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

A0A6J1C458 uncharacterized protein LOC111007223 isoform X12.1e-12794.25Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE
        MGCGNSRLIPDGESIPARIRPLMRLRF DLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTK  PAPAPAHINKQDNKDDE
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ
        DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKD       DVSHNNSPTHDRIDTTTTTTTCANFGQ
Subjt:  DYSPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQ

Query:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
             VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA
Subjt:  -----VIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50830.1 unknown protein2.4e-0628.02Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFPDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFPDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYF
            S     +         +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY 
Subjt:  SSYSSPSGSTKPAPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYF

Query:  VEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKE
        V+   DD       DD  KD ED   +       ++T + TT     G ++  + KE
Subjt:  VEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKE

AT5G50830.2 unknown protein4.1e-0627.45Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFPDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFPDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYF
            S     +         +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY 
Subjt:  SSYSSPSGSTKPAPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYF

Query:  VEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQL
        V+   DD       DD  KD ED   +       ++T + TT        I+ K K+  R     I+  ++      L NV + CY    C GN  S  +
Subjt:  VEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQL

Query:  LARKAQ
          + +Q
Subjt:  LARKAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTTATGCGCCTTAGATTTCCGGATTTGAGGAGGCGTAAGAACGGAAG
CAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTCTATGCATGTTATTGATAAAAAAAGCGTATCTTTTTCATCTTATT
CTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGGACGATGAAGATTATTCACCAATTCCCATTCCCCTCTCT
AGCAACAATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGACACTGCTCGGTCCATCATCTGTCCTGGATCCCCCAGTTT
CAGAGTTTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGCTGCTAAGGATAGGGAAGATGTCTCGCACAACAACTCCCCAACCCATG
ACAGAATCGACACCACCACCACCACCACCACTTGTGCAAATTTTGGCCAGGTGATCATAACGAAAGGAAAAGAAGGATCCAGGAGTTGTACAAAGGCCATCAGTAGTAAG
AAAAGACAAGGTGGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCCACTCAGCTTCTTGCTAGAAAAGCTCAAGCT
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTTATGCGCCTTAGATTTCCGGATTTGAGGAGGCGTAAGAACGGAAG
CAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTCTATGCATGTTATTGATAAAAAAAGCGTATCTTTTTCATCTTATT
CTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGGACGATGAAGATTATTCACCAATTCCCATTCCCCTCTCT
AGCAACAATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGACACTGCTCGGTCCATCATCTGTCCTGGATCCCCCAGTTT
CAGAGTTTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGCTGCTAAGGATAGGGAAGATGTCTCGCACAACAACTCCCCAACCCATG
ACAGAATCGACACCACCACCACCACCACCACTTGTGCAAATTTTGGCCAGGTGATCATAACGAAAGGAAAAGAAGGATCCAGGAGTTGTACAAAGGCCATCAGTAGTAAG
AAAAGACAAGGTGGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCCACTCAGCTTCTTGCTAGAAAAGCTCAAGCT
Protein sequenceShow/hide protein sequence
MGCGNSRLIPDGESIPARIRPLMRLRFPDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAPAHINKQDNKDDEDYSPIPIPLS
SNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDAAKDREDVSHNNSPTHDRIDTTTTTTTCANFGQVIITKGKEGSRSCTKAISSK
KRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQA