; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g00250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g00250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionABC transporter F family member 4-like
Genome locationchr6:182227..183135
RNA-Seq ExpressionMoc06g00250
SyntenyMoc06g00250
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057746.1 ABC transporter F family member 4-like [Cucumis melo var. makuwa]9.1e-4042.02Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD+ +
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KV I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

XP_008464387.1 PREDICTED: uncharacterized protein LOC103502290 [Cucumis melo]9.1e-4042.02Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD+ +
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KV I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

XP_011649820.1 probable DNA-directed RNA polymerase I subunit RPA43 isoform X1 [Cucumis sativus]4.5e-3941.94Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK   +  H N  +   DE
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFV
          S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+YFV
Subjt:  DYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFV

Query:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS
        EE  DDK  VE+KD     DVSH  SP+ D +++TT+    A   + Q++K I    K+G +  T   + SKKR    GV+NLLNVKSCYHL CSGNDR+
Subjt:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS

Query:  TQLLARKAQA
          LLARKA+A
Subjt:  TQLLARKAQA

XP_022135203.1 uncharacterized protein LOC111007223 isoform X1 [Momordica charantia]1.2e-137100Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

XP_022135204.1 uncharacterized protein LOC111007223 isoform X2 [Momordica charantia]8.3e-13498.43Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS    DDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

TrEMBL top hitse value%identityAlignment
A0A0A0LNT2 Uncharacterized protein2.2e-3941.94Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK   +  H N  +   DE
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFV
          S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+YFV
Subjt:  DYSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFV

Query:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS
        EE  DDK  VE+KD     DVSH  SP+ D +++TT+    A   + Q++K I    K+G +  T   + SKKR    GV+NLLNVKSCYHL CSGNDR+
Subjt:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS

Query:  TQLLARKAQA
          LLARKA+A
Subjt:  TQLLARKAQA

A0A1S3CLT6 uncharacterized protein LOC1035022904.4e-4042.02Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD+ +
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KV I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

A0A5D3BHA4 ABC transporter F family member 4-like4.4e-4042.02Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD+ +
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDL----------HEDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KV I KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

A0A6J1C0S3 uncharacterized protein LOC111007223 isoform X24.0e-13498.43Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS    DDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

A0A6J1C458 uncharacterized protein LOC111007223 isoform X16.0e-138100Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50830.1 unknown protein1.2e-0526.82Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE
            S     K       +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY V+
Subjt:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQLLARK
           DD       DD   +       ++T + TT      + ++   I+ K K+  R     I+  ++      L NV + CY    C GN  S  +  + 
Subjt:  EEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQLLARK

Query:  AQ
        +Q
Subjt:  AQ

AT5G50830.2 unknown protein2.2e-0729.58Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE
            S     K       +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY V+
Subjt:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE

Query:  ---EEDDDKASVE
           ++DD++  VE
Subjt:  ---EEDDDKASVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTTATGCGCCTTAGATTTTCGGATTTGAGGAGGCGTAAGAAC
GGAAGCAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTCTATGCATGTTATTGATAAAAAAAGCGTATCTTTT
TCATCTTATTCTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGGACGATGAAGATTATTCACCAATTCCCATT
CCCCTCTCTAGCAACAATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGACACTGCTCGGTCCATCATCTGTCCT
GGATCCCCCAGTTTCAGAGTTTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGTCTCGCACAACAACTCCCCAACCCATGAC
AGAATCGACACCACCACCACCACCACCACTTGTGCAAATTTTGGCCAGGCCCAAGACAGCAAGGTGATCATAACGAAAGGAAAAGAAGGATCCAGGAGTTGTACA
AAGGCCATTAGTAGTAAGAAAAGACAAGGTGGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCCACTCAGCTT
CTTGCTAGAAAAGCTCAAGCTCAAGCTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTTATGCGCCTTAGATTTTCGGATTTGAGGAGGCGTAAGAAC
GGAAGCAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTCTATGCATGTTATTGATAAAAAAAGCGTATCTTTT
TCATCTTATTCTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGGACGATGAAGATTATTCACCAATTCCCATT
CCCCTCTCTAGCAACAATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGACACTGCTCGGTCCATCATCTGTCCT
GGATCCCCCAGTTTCAGAGTTTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGTCTCGCACAACAACTCCCCAACCCATGAC
AGAATCGACACCACCACCACCACCACCACTTGTGCAAATTTTGGCCAGGCCCAAGACAGCAAGGTGATCATAACGAAAGGAAAAGAAGGATCCAGGAGTTGTACA
AAGGCCATTAGTAGTAAGAAAAGACAAGGTGGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCCACTCAGCTT
CTTGCTAGAAAAGCTCAAGCTCAAGCTTATTAA
Protein sequenceShow/hide protein sequence
MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPI
PLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT
KAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY