; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0026 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0026
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionABC transporter F family member 4-like
Genome locationMC06:177222..183966
RNA-Seq ExpressionMC06g0026
SyntenyMC06g0026
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057746.1 ABC transporter F family member 4-like [Cucumis melo var. makuwa]6.88e-4942.35Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD E 
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KVI  KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

XP_008464387.1 PREDICTED: uncharacterized protein LOC103502290 [Cucumis melo]8.10e-4942.35Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD E 
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KVI  KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

XP_011649820.1 probable DNA-directed RNA polymerase I subunit RPA43 isoform X1 [Cucumis sativus]5.59e-4841.94Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK   +  H N  +   DE
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFV
          S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+YFV
Subjt:  DYSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFV

Query:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS
        EE  DDK  VE+KD     DVSH  SP+ D +++TT+    A   + Q++K I    K+G +  T   + SKKR    GV+NLLNVKSCYHL CSGNDR+
Subjt:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS

Query:  TQLLARKAQA
          LLARKA+A
Subjt:  TQLLARKAQA

XP_022135203.1 uncharacterized protein LOC111007223 isoform X1 [Momordica charantia]2.36e-177100Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

XP_022135204.1 uncharacterized protein LOC111007223 isoform X2 [Momordica charantia]1.54e-17298.43Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS    DDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

TrEMBL top hitse value%identityAlignment
A0A0A0LNT2 Uncharacterized protein2.71e-4841.94Show/hide
Query:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE
        MGCGNS+L P GE +P RIRPL +R +  +LR+RKNG++L  G LSKKVLLKD E +   +MHV ++            GSTK   +  H N  +   DE
Subjt:  MGCGNSRLIPDGESIPARIRPL-MRLRFSDLRRRKNGSNLETGTLSKKVLLKDHE-DNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDE

Query:  DYSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFV
          S   IP S+NNA                                   +K+D            +++NKK GED     E+   S ICPGSPSFR+YFV
Subjt:  DYSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFV

Query:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS
        EE  DDK  VE+KD     DVSH  SP+ D +++TT+    A   + Q++K I    K+G +  T   + SKKR    GV+NLLNVKSCYHL CSGNDR+
Subjt:  EEEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCT-KAISSKKR--QGGVRNLLNVKSCYHLRCSGNDRS

Query:  TQLLARKAQA
          LLARKA+A
Subjt:  TQLLARKAQA

A0A1S3CLT6 uncharacterized protein LOC1035022903.92e-4942.35Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD E 
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KVI  KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

A0A5D3BHA4 ABC transporter F family member 4-like3.33e-4942.35Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED
        MGCGNS+L P+GE +P RIRPL+ R +F +LR+RKNG++L  G LSKKVLLK+ E    +   +D +           GSTK   +  H      KD E 
Subjt:  MGCGNSRLIPDGESIPARIRPLM-RLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDED

Query:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE
         S   IP S+N+A                                   +K+D            +++NKK GED     E+   SIICPGSPSFR YFVE
Subjt:  YSPIPIPLSSNNA-----------------------------------IKEDLH----------EDHNKKSGED---HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL
        E  DDK  VE+KD     DVSH  SP+HD +++TT+    A  G+ Q++KVI  KGK+G+ +  + IS ++    GV+NLLNVKSCYHL CSGNDR+  L
Subjt:  EEDDDKASVEIKD-----DVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG-GVRNLLNVKSCYHLRCSGNDRSTQL

Query:  LARKAQA
        LARKA+A
Subjt:  LARKAQA

A0A6J1C0S3 uncharacterized protein LOC111007223 isoform X27.48e-17398.43Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKAS    DDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

A0A6J1C458 uncharacterized protein LOC111007223 isoform X11.14e-177100Show/hide
Query:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
        MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY
Subjt:  MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDY

Query:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
        SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT
Subjt:  SPIPIPLSSNNAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIIT

Query:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
        KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY
Subjt:  KGKEGSRSCTKAISSKKRQGGVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50830.1 unknown protein1.2e-0526.82Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE
            S     K       +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY V+
Subjt:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE

Query:  EEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQLLARK
           DD       DD   +       ++T + TT      + ++   I+ K K+  R     I+  ++      L NV + CY    C GN  S  +  + 
Subjt:  EEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQGGVRNLLNVKS-CY-HLRCSGNDRSTQLLARK

Query:  AQ
        +Q
Subjt:  AQ

AT5G50830.2 unknown protein2.2e-0729.58Show/hide
Query:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF
        MGCG SRL         +G    +PA IRPL+R R  ++++R +   L+ + TLSKK LL+        D E+N  S+            +V +KK V +
Subjt:  MGCGNSRL-------IPDG--ESIPARIRPLMRLRFSDLRRRKNGSNLE-TGTLSKKVLLK--------DHEDNSTSM------------HVIDKKSVSF

Query:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE
            S     K       +NKQ+  +D  +  +       N   +K+   E+H+        KK G+D          I +    +I PGSPSFRVY V+
Subjt:  SSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSNN--AIKEDLHEDHN--------KKSGED---------HIEDTARSIICPGSPSFRVYFVE

Query:  ---EEDDDKASVE
           ++DD++  VE
Subjt:  ---EEDDDKASVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTTATGCGCCTTAGATTTTCGGATTTGAGGAGGCGTAAGAACGGAAG
CAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTCTATGCATGTTATTGATAAAAAAAGCGTATCTTTTTCATCTTATT
CTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGGACGATGAAGATTATTCACCAATTCCCATTCCCCTCTCTAGCAAC
AATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGACACTGCTCGGTCCATCATCTGTCCTGGATCCCCCAGTTTCAGAGT
TTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGTCTCGCACAACAACTCCCCAACCCATGACAGAATCGACACCACCACCACCACCA
CCACTTGTGCAAATTTTGGCCAGGCCCAAGACAGCAAGGTGATCATAACGAAAGGAAAAGAAGGATCCAGGAGTTGTACAAAGGCCATTAGTAGTAAGAAAAGACAAGGT
GGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCCACTCAGCTTCTTGCTAGAAAAGCTCAAGCTCAAGCTTATTAA
mRNA sequenceShow/hide mRNA sequence
GAGAAAAAAAAGGGTAAAGATAGAATAGCACCTGAAAGGTAGATTGTTGCGTGGAGTTTATTATTGGCAAATTGGGGCTTATGAAAAGTAAGTTGAATGTCCAGAAGGAA
ATTAAAGAGCAATGGATGTAAAAAGAAAAGATAACAAATAAAGTAACTGGAAGGAGTTATGGGAGCCTGTTTTTTACTATTGTTAGGGGAGACAGTGTGTTCCAATGCGG
CATCGTGTTATGGTGGTGTCAAATTCAGCTGGTGAAATTCAGTGTTCGTCTTCAAAATCCTTGGTTCTGCTTCTGCCTTGTTTAACGCACCCCCAAGAAACAAATCAATC
AATTTTCTGTGTCGGCCAATTCAACAATCTCGAGGCTTACAACAATGGGGTGCGGAAACTCGAGGCTTATCCCGGATGGAGAGTCGATTCCTGCCCGGATTCGCCCACTT
ATGCGCCTTAGATTTTCGGATTTGAGGAGGCGTAAGAACGGAAGCAATCTGGAGACTGGAACGCTCTCGAAGAAAGTGCTTCTCAAAGATCACGAAGACAACTCTACCTC
TATGCATGTTATTGATAAAAAAAGCGTATCTTTTTCATCTTATTCTTCACCTAGTGGCAGCACCAAACCTGCACCTGCACCTGCACACATCAACAAACAGGACAACAAGG
ACGATGAAGATTATTCACCAATTCCCATTCCCCTCTCTAGCAACAATGCAATCAAAGAGGACCTCCATGAAGACCACAACAAGAAATCAGGAGAAGATCACATCGAAGAC
ACTGCTCGGTCCATCATCTGTCCTGGATCCCCCAGTTTCAGAGTTTATTTCGTTGAAGAAGAAGATGATGACAAAGCAAGCGTTGAAATCAAAGACGATGTCTCGCACAA
CAACTCCCCAACCCATGACAGAATCGACACCACCACCACCACCACCACTTGTGCAAATTTTGGCCAGGCCCAAGACAGCAAGGTGATCATAACGAAAGGAAAAGAAGGAT
CCAGGAGTTGTACAAAGGCCATTAGTAGTAAGAAAAGACAAGGTGGTGTAAGGAATCTGTTGAATGTCAAATCTTGCTACCATTTACGTTGTTCTGGCAACGACAGATCC
ACTCAGCTTCTTGCTAGAAAAGCTCAAGCTCAAGCTTATTAATCATTCTCAAGGGGCTTGGGGAACAGAGGTCTGCTGCTCCTCGTTTATTAGGACATTACTGCCTTTTA
CTTCGCTAAACATTCCCTCTGTCTCTCCCTCCATCTCATCAAACGTACCATAGGTAGAATCATTAATTACTTTATCAATGTAATATCAATACATTCATTCAAGCCGCTGC
TTACTCGGCAACCATCTCTAATACATTGTTCTAATGTCCAATATGGCGCCATTTTTTTTTTAATTCTAATCCCCAACCTTCTTTGATAACAAAATCAGAAACACGTGGAT
TTACACCACTATATTCATATCTGAGCTTCTTTTTTTTTCACTACATCACAAAATATCAAATTCAAGTTGCATGATTCTACCACTTAAAACATTATAGATCCACATTGATC
TATATTTTCAAAACACAAACAAAAAAAACTGTGATGCAAATACATATGCAAATAGTAGTAGATTGGATGTTAGTTGTGCACTTTTTTTCATGAACTAGCAAAATTAAAAG
TTTGCCTACTCGGGGTGCTTAGTGAAAGCAGAAGAGAGGAAGTCCTCATCCTCCTTGAAGATCTCCAGATTCTTGCATTTCAGTATAAAATTAACAGCCTCATCATCAAA
ATTCACCAAAAAGGCCAACTTTATCAGCTCAGATAATGCACATATGTAATCACTATACTGCCAAAGGACACCCTCAGATATGACCTCAGAAAAGTAGGTAGCACGCTTTA
TCATCTCCTTCTGTCCACCTGATATCACCATTTTCTGGATGAACAACTTTGAACTGTCCCGTTCCCAACGCATCAACCCACCTGCAGGCGCTTCAATGACCTTTCCTGAA
GTCAGGGATAGACTATATTTTATCATCACAGGTTCTACAGTCTCAAAAACAGAAAGATTGAGGAGACATTGAACAACTTCATGCCTCTTTTCTGCTTCAATTTTCTTAGT
AGGGTCTGCTAAAAAACCAAGAATGAGTCGCACCACCCCTCTTCCAATCCAAATGTCATCTGGATTCACCTGCTTCATACTGATTCCATCAACCATGTCTGCCTCTACCA
TCTCAACAGACTCAGATAGATTCCGAACTCCTATGTTCTTATAAACTTCCAGCAACCTTGATCGAGGCAAGAAAGGTGAGCTCGGCTGAGGATACCACACAAAAATGGGA
TGAGGAGAGTTTTCTTCAAATACATCCTTCAATCGAAGATCATCGGCAACAAACACGTCACGCTTATCAAACAAAAAAACTGACTCATCTGAACCAGAAACTGCAGGCAC
TTTGGCAATGGCATCAATGACAGCTTGCTCCGTTTTTGAATTATAGTGCTTAGTGACATATTTCCAAAACTTAAAGCACTTGTCATGAGATAGTTGGTCCTGATTTCTTT
CCCAACTTTTCCAGAGTTTGCAGTAGTCGACAATCGAAGGATTGGATCTTACTTGAAAGGCTTTGGAAAAGAAAATGAGTAAATCTTGTTTATAGTACCTCTCGAGAACT
GTCAACTGCAGACCAAAAAGGTCTTCCTTGTCAAAAAGAACACAATCTTCTGGGTTGATCCACTGGCCACCATTATTTCCATCAGGTACCCAAATCCTTCTAGCAGCTTC
AGTTTCTGGCTCCCAGTTGAATGCACTCAAATAAGTATACATTCGTACAATAGTGGAGAACTCGCTATGAAAATCAAGGAAACTTGACACTAGTTCACAACCATGGTCCA
AATCAACTATCACCCCCAATTTTTTGAGCTCCTTCTTATAGGACTTGATGTCAAACTTGTAAAATTCTTCGTCCATAAAAGGCCCATCAGTCGGCTTCAGATGAGGACCC
CACTCAGGAATGAAGAGCAAACACTCTTTTGGAGACTGGTAACCAAAAGAGGTCTTCAACCATCTCTGCGAGACCTTTCTGGAGAAATTATCTGAAAAGGAATAATTCTT
CTCCATCAAAGTTCGGATGCAGTCCAGAAGTGAAAGAACGTTTTCAGAAGTAATATTGACGGGATTTTGTGGAAGATAAAGCCCATCAGCAACCATTTGAGCACCATCTT
TAAAGTCAGTAACAACTCCCATATTCTTCAACTCATTTTTATATTCATGAATATGGTTTTCATAGTAGTTTTCACTATCATCGATAAAAGGAAGGAGAGCAATTGCAGAT
ATAGATTCCCAACTTGGACCATACAATATGCAATCTTTAGGAGATCTATGATCACCGAGTCGAGTCCGTAACCACTTCAACTCATGAATGCACTTCTTAAGTTCCGAAGG
AAACTTTGTAGTTGCTTTAAGCTGTTTGTAGCTTGACAGAAACGACATCACACTTTCCTTTGTTAGAGAATTTTTAGTCGCTCGTTGCCTGAATACTTGAGAAAATGCTT
TTACTGCTTCTTCTAAATCGACCACCACTCCCAGTTTTTTCAATTCCCTCTTGTAAGATGAGATAATGCAGCTTCCATAGAAGTCACAATCAACAACAGGGAAACCAGTA
AAAACCTGCATAATGCAACCCCACGAAGGATCTGACAAGTAACATTCAGCCGGAGATTTGTAGCCTAGATTTGTCTTGAGGCATTTCACACTTTTGAATGTGTTGACAAG
ATAATCGGCTGATCTTGATTCCAATATACAGTGAAGAATCAATAGAAATGCCTCGGCTCCCAGACAAGTTAATTGAGTAGGTGGCTTCAAATTGTCCACAACAAGTTGGG
AATTATCCGATGAATCGACCACCACACCAAGCAACTTAAGCTCCTCCCGGAATGACAATATCTCATCACCATAGTAATCCTTGTCAATAAAAGGGATGTTGCTCACAAGT
GATGCTGTCTTCCACTCCTCGGTGTGCAATACTGATCCAACTGGAGAGGTATAGCCACGACATGTCTTAAGCCATGTTCCTTTTCTTATGCCAGCAACAAAGCTTTCAAC
ATGCAAATGATTTTTCAAAAATCTGATGAATTTTAACATAGAGAAGACATTTTCTCTTGTCAAACTTGATAAAGTCGCCACTGACATGAGGTGATTCCCAATAAATTCTA
AAACTTCACCATATTCAAACATGACACCAATAGTTTTCAGCTCCTCTGCATAGACTTTGAACCCATCAGCGTAGAATTTATGATCAATTAAAGGAATGTCCACCAGAACT
GATCCATTTTCCAAAATACTTGCACATGACGTGGAGAGGTCAAAAGATTGAGATGGTGGTCTATAACAAGGAGATCCATTCAGAGTAGTTCTCAGCCAGCAACCTTCCTT
TATGCATTTCAAAAACTTGCAAGGAATTGAAACTCGTCTAGTTTTCAAGTTGTGAATCCAACCCAATAGCAAAAGCCCATTCTGCGCGGTCAGTGGTGAAGAAACAACAG
AAATTTCTGTATTAGGAGGAGATATAAAAGGGATATCAGAAGCACCAACATGGGTTATGAGAAAATCCATGAGTTGTTTTCTAGTTATAGATTCACCGGCAAAATGGACT
GGACAGATGTAATCAGCTCCCAACTCAACATAGCCATTATTTTTCCAAGGATTTGAATCAAGCAATTGTGCCCATTTGCTACCATCTGCTGGGATGAGAAGCACCTGCTT
GTGTTTGATAACAACACCGTATTTGTCTACTATTGGCATAACACTGCACAGAGACTTGACCTCGTCATCTGTCAGATACCTTTTTGATGATGAGTGATACAGGAAGTGAA
CATACCTAATAATGTTCTTTGGGTTATCGCCAATAGAATAGACAAGGAGTTCTGCAAATTGATATACGCTGATGGTATCAACTTTAGCCTGGTCTCTAAGCCATCGCAAC
AACGTGTCTTTCCTAGAACATAACCGGATGGATTTATGTGTGCTTTCAGGCATAAAAAAACAGTTGGCAACGAATTTGAATTCCCCGTTCGATTTATTCAGCCATGATAC
ATTATGGTCATGACAAGCTAGGTGAACCCTTCTTCCACCATTCCGAGCAGATTCATTTAAACTGCATAGGGAAACATTCCCATCAACACCAACATATCTTACAAGTGGTA
CATTCTTCATGTTTGTGACATGAAATCTCGAACTCCAATTGTCAGCAAGAAATTGTAAAAGCTCCAAATAAACGTCATCCGACACACCCTCCACAATGTTAGTACCCTGC
AGGCACTTTGCATACCATTCATCATCAACTGGTTTCACACCAAGGAAACTAAGAACCTGATCGTATTCCTCTATATCGAATGAAGAACTTAAGATGTACTTTCCATGGGA
CGCTAGATTAAGCAAACTCACTCCTTGATTGTGTGCCTTCATTAAAATATTCCAAAAAGCCGGCATAATTCTACCCACTTCAGGAGGTTTGTGGAAGAACCTCTGCTTCA
GGAAAGAATGACTAGGAACAATATTTGTTTCAAGCAACTTTTCTCTGATTAAATTCATAACAACATTCAACTTCTCGTAGGAAGAAGAAATGATGGGCAGGAAGTTGAAC
ATATGGGCCAAAGAAGACAAGGGAGCTTCAACCGTATTTTTGACCAATGAGATGAAAGCATTGACAAAAGCGGAGGGAACACAATCAAGAATCCCCTGATTCCATTTGTT
ATCAAGCAGAATGGTTTCCCTGGATGATGATAAAACAAAATCTGCCTGAATTATAAAGGGAAAGTTGGTTATCATCTCGGTAGGAAGGAAAGAGTAGACCCCAGGGGAGT
TTAATCCTCTATTGAGACGCTCTCCTTTTGGAAATGCCAACGTGATCACCAATTCCTCCACTCCCATCCTTCGTTCTACTCTGTTTTGTTCCTTGACTGGAAACTTCTGC
TTCCACATGTAGTAGGAGCATTGGGTATCAATTTCACAGTTACTTCCCTCGGAAGAGAGATGGAGAGTGTAGGATTCAGCATGAATGTTCTTCCTTGTAACGAAGTCCGT
CTCGCTGGAAATGGCAATTGCACTAACAGTGTTGGAATTGGGATCCTCGTTGACTTCCCTGACAGTAAACTGCTTAATCTTTGAAAGGAACAGTAGAACTTCTGGGTGAA
TGGTTGAAAGTTGGCGCTTGACGGGTACAATCTTGTCGGGCTTCAATGGCAAGACTATTGTGGTTGTTGGGAGTTGAGAATGGGGACCATAAATTTCCTTGATGTTAGAA
AGAATGGGGTTGTGTTCAACCCACTCGGGAACAACGTAGCCAACACCAGAATGTGGGCAGGGATGTTCACTGAAGCGTATCTGATATCCGTTGCTGAATATGTAAGGGTG
AGAAGTAACGAGGAACACACTTTTGAATCCGATTCCTGCGGAGTTGGGGAGATAAAGGATGGGGTGAAGTGGGTGAAAATTGAAAGAGGAAGAGATCAGGAAAAGAGAGG
GGAGGGCAAGGTACCTTTCTCCCCAATATAGCCGCGTTTCCTGTTGTTCTTCTTGGTGGAGCGGCCAACGCTGCAAATGGAGTCGATGTTTTTGGAAGAAAAGCCGGTCT
CGTT
Protein sequenceShow/hide protein sequence
MGCGNSRLIPDGESIPARIRPLMRLRFSDLRRRKNGSNLETGTLSKKVLLKDHEDNSTSMHVIDKKSVSFSSYSSPSGSTKPAPAPAHINKQDNKDDEDYSPIPIPLSSN
NAIKEDLHEDHNKKSGEDHIEDTARSIICPGSPSFRVYFVEEEDDDKASVEIKDDVSHNNSPTHDRIDTTTTTTTCANFGQAQDSKVIITKGKEGSRSCTKAISSKKRQG
GVRNLLNVKSCYHLRCSGNDRSTQLLARKAQAQAY