; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19990 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19990
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022160
Genome locationchr2:14839843..14845608
RNA-Seq ExpressionMoc02g19990
SyntenyMoc02g19990
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.6e-5546.23Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQ---LEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPA-VNGNMRDHARN------------------
        MST S+LLP DPEIE+T ++ R+EQRL+KQ   ++K+KERE        V      +AD   R   D  A +  ++     N                  
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQ---LEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPA-VNGNMRDHARN------------------

Query:  ---GEFNHIQMADNR-------------------DVAMREYAATAFQNFDSGIDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQ
           G+F  ++  D R                   D A+R    T F    SG  QA AWLNAFP  +I T   +V+KFL K+FPPT +AD+REEIISFRQ
Subjt:  ---GEFNHIQMADNR-------------------DVAMREYAATAFQNFDSGIDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQ

Query:  YDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIAT
         + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ WCS++ R   K+ DPA VLALD  T
Subjt:  YDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIAT

Query:  SMQKR
        SMQK+
Subjt:  SMQKR

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]1.3e-6586.58Show/hide
Query:  VKNKLGFIDGSIERPTGDLLLAWIHNNHVEQQSVSLYFTKLKIAWDELYQFRPICTCTYTCGGAKSASEFLQLEYIINLLMGLNEFYASTRAELLLMDPP
        VKNKLGFIDGSIE PT D L AWI NNHVEQQSVSLYFTKLK  WDEL+QFRP+CTCT TCGGAKS SEFLQLEYIINLLMGL+EFY STRAELLLMDPP
Subjt:  VKNKLGFIDGSIERPTGDLLLAWIHNNHVEQQSVSLYFTKLKIAWDELYQFRPICTCTYTCGGAKSASEFLQLEYIINLLMGLNEFYASTRAELLLMDPP

Query:  PSVNKALSLVRQEEQQRSIGTFTMIPTASFFPLVAQHSASKPPSKATNY
        PSVNKALSLVRQ+EQQRSIGT   IPTA  F LVAQHSASKPPSKATNY
Subjt:  PSVNKALSLVRQEEQQRSIGTFTMIPTASFFPLVAQHSASKPPSKATNY

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]3.4e-7961.51Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEF--NHIQMADNRDV--------------AMREY----------
        E+R   Q  ++K ++ E   ESE EST T M +IPP++P DPP VNGNM        F  N I  A N ++                RE+          
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEF--NHIQMADNRDV--------------AMREY----------

Query:  ----AATAFQNF-DSGI----------DQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLP
             A  F N  D  +          D+AR WLN FPPGSITTW SLVEKFLTK+FPPT HADI EEI++FRQYDREPVHEAWERFKEL+RKCPNHGLP
Subjt:  ----AATAFQNF-DSGI----------DQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLP

Query:  ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQK
        ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQR + APKKQD AGVLALDI TSMQK
Subjt:  ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]1.8e-12876.22Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARN EFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGI                                                                    DQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPT HADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRYRAAPKKQDPAGVLALDIATSMQK
        CSQR RAAPKKQDPAGVLALDIATSMQK
Subjt:  CSQRYRAAPKKQDPAGVLALDIATSMQK

XP_022159074.1 uncharacterized protein LOC111025516 [Momordica charantia]3.7e-5742.73Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLV
        T  ++ +S +                                                                     QA AWLNAF   +ITTW  +V
Subjt:  TAFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPT +AD+REEIISFRQ   E V+ AWERFK+LIR CPN  + AC+QIEHFFRG D PTKMMLN AANG FT K+FN+IV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRYRAAPKKQDPAGVLALDIATSMQKR
        WCS+R R   K+ DP GV ALD  TSMQK+
Subjt:  WCSQRYRAAPKKQDPAGVLALDIATSMQKR

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129477.5e-5646.23Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQ---LEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPA-VNGNMRDHARN------------------
        MST S+LLP DPEIE+T ++ R+EQRL+KQ   ++K+KERE        V      +AD   R   D  A +  ++     N                  
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQ---LEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPA-VNGNMRDHARN------------------

Query:  ---GEFNHIQMADNR-------------------DVAMREYAATAFQNFDSGIDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQ
           G+F  ++  D R                   D A+R    T F    SG  QA AWLNAFP  +I T   +V+KFL K+FPPT +AD+REEIISFRQ
Subjt:  ---GEFNHIQMADNR-------------------DVAMREYAATAFQNFDSGIDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQ

Query:  YDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIAT
         + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ WCS++ R   K+ DPA VLALD  T
Subjt:  YDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIAT

Query:  SMQKR
        SMQK+
Subjt:  SMQKR

A0A6J1CR17 uncharacterized protein LOC1110134416.2e-6686.58Show/hide
Query:  VKNKLGFIDGSIERPTGDLLLAWIHNNHVEQQSVSLYFTKLKIAWDELYQFRPICTCTYTCGGAKSASEFLQLEYIINLLMGLNEFYASTRAELLLMDPP
        VKNKLGFIDGSIE PT D L AWI NNHVEQQSVSLYFTKLK  WDEL+QFRP+CTCT TCGGAKS SEFLQLEYIINLLMGL+EFY STRAELLLMDPP
Subjt:  VKNKLGFIDGSIERPTGDLLLAWIHNNHVEQQSVSLYFTKLKIAWDELYQFRPICTCTYTCGGAKSASEFLQLEYIINLLMGLNEFYASTRAELLLMDPP

Query:  PSVNKALSLVRQEEQQRSIGTFTMIPTASFFPLVAQHSASKPPSKATNY
        PSVNKALSLVRQ+EQQRSIGT   IPTA  F LVAQHSASKPPSKATNY
Subjt:  PSVNKALSLVRQEEQQRSIGTFTMIPTASFFPLVAQHSASKPPSKATNY

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221601.7e-7961.51Show/hide
Query:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEF--NHIQMADNRDV--------------AMREY----------
        E+R   Q  ++K ++ E   ESE EST T M +IPP++P DPP VNGNM        F  N I  A N ++                RE+          
Subjt:  EQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEF--NHIQMADNRDV--------------AMREY----------

Query:  ----AATAFQNF-DSGI----------DQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLP
             A  F N  D  +          D+AR WLN FPPGSITTW SLVEKFLTK+FPPT HADI EEI++FRQYDREPVHEAWERFKEL+RKCPNHGLP
Subjt:  ----AATAFQNF-DSGI----------DQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLP

Query:  ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQK
        ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQR + APKKQD AGVLALDI TSMQK
Subjt:  ACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQK

A0A6J1DW02 uncharacterized protein LOC1110248978.7e-12976.22Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARN EFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGI                                                                    DQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPT HADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRYRAAPKKQDPAGVLALDIATSMQK
        CSQR RAAPKKQDPAGVLALDIATSMQK
Subjt:  CSQRYRAAPKKQDPAGVLALDIATSMQK

A0A6J1E1A9 uncharacterized protein LOC1110255161.8e-5742.73Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAA
        MST SFLLP +PEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I MAD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLV
        T  ++ +S +                                                                     QA AWLNAF   +ITTW  +V
Subjt:  TAFQNFDSGI--------------------------------------------------------------------DQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPT +AD+REEIISFRQ   E V+ AWERFK+LIR CPN  + AC+QIEHFFRG D PTKMMLN AANG FT K+FN+IV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRYRAAPKKQDPAGVLALDIATSMQKR
        WCS+R R   K+ DP GV ALD  TSMQK+
Subjt:  WCSQRYRAAPKKQDPAGVLALDIATSMQKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAGTTCGTTGTCCATCTTTTCTTACAGCGTCGTCGCAGCCATTCGAGGACATCACCTTCGTATTCGCTGTGAGTTGTTTCGTAGTCGGATTTTTGGTTTGTGGTC
GTCGAATTCTTGGTGGATTTCTATTCGTGTGGCTGCTAGGCTCCATTGCCGTGGGAAGCGAGCGAGCAAGGATCTAGGAATAGAGGGACGATCCACGGTTAGTGAGGCTC
GATTTGGCGACTACGAGCGACGGACAGACGGTAGGGTTCTTGCAAATGAAGTTGCCTCGGATGTTCCCACCGACTCTCCTTCAAATTCACCGGAATTTTCTTTCATTACA
CCTCAATCTATTTCTAAGGCTGCTACAAATTCGTATTATATCCATTATACAGATAATACTCGATTGGTTCTAGTGAATCAAGTTCTGACAGAGGAAAAGTATACATCATG
GGGCCGATCTATGATAATCGCATTATTAGTCAAAAACAAATTAGGTTTTATTGACGGATCTATTGAGCGTCCCACCGGTGATCTTCTACTGGCATGGATTCACAATAACC
ATGTGGAGCAGCAATCGGTGAGTCTATACTTCACCAAATTGAAGATTGCTTGGGATGAATTGTATCAATTTCGTCCCATTTGCACTTGTACTTACACATGTGGTGGTGCT
AAGTCTGCTTCTGAGTTTCTACAACTAGAATACATTATCAATTTGCTCATGGGACTCAATGAGTTCTACGCTTCAACTCGAGCAGAGTTATTATTGATGGATCCCCCGCC
TTCCGTCAATAAAGCTTTATCATTAGTTCGACAGGAGGAGCAGCAAAGATCGATTGGCACCTTTACTATGATTCCTACTGCTTCTTTCTTTCCACTGGTTGCTCAACACT
CTGCATCTAAACCACCTTCGAAAGCCACTAATTATGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAA
ACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCAATGGCAGATAT
TCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGGTGAATTCAACCATATCCAAATGGCGGACAACAGAGACGTGGCAA
TGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTA
GTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTAGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGA
GAGATTTAAAGAATTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTCAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACA
ATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATATAGGGCAGCA
CCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTTGACATTGCGACCTCGATGCAAAAGAGATGGTTACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGAGTTCGTTGTCCATCTTTTCTTACAGCGTCGTCGCAGCCATTCGAGGACATCACCTTCGTATTCGCTGTGAGTTGTTTCGTAGTCGGATTTTTGGTTTGTGGTC
GTCGAATTCTTGGTGGATTTCTATTCGTGTGGCTGCTAGGCTCCATTGCCGTGGGAAGCGAGCGAGCAAGGATCTAGGAATAGAGGGACGATCCACGGTTAGTGAGGCTC
GATTTGGCGACTACGAGCGACGGACAGACGGTAGGGTTCTTGCAAATGAAGTTGCCTCGGATGTTCCCACCGACTCTCCTTCAAATTCACCGGAATTTTCTTTCATTACA
CCTCAATCTATTTCTAAGGCTGCTACAAATTCGTATTATATCCATTATACAGATAATACTCGATTGGTTCTAGTGAATCAAGTTCTGACAGAGGAAAAGTATACATCATG
GGGCCGATCTATGATAATCGCATTATTAGTCAAAAACAAATTAGGTTTTATTGACGGATCTATTGAGCGTCCCACCGGTGATCTTCTACTGGCATGGATTCACAATAACC
ATGTGGAGCAGCAATCGGTGAGTCTATACTTCACCAAATTGAAGATTGCTTGGGATGAATTGTATCAATTTCGTCCCATTTGCACTTGTACTTACACATGTGGTGGTGCT
AAGTCTGCTTCTGAGTTTCTACAACTAGAATACATTATCAATTTGCTCATGGGACTCAATGAGTTCTACGCTTCAACTCGAGCAGAGTTATTATTGATGGATCCCCCGCC
TTCCGTCAATAAAGCTTTATCATTAGTTCGACAGGAGGAGCAGCAAAGATCGATTGGCACCTTTACTATGATTCCTACTGCTTCTTTCTTTCCACTGGTTGCTCAACACT
CTGCATCTAAACCACCTTCGAAAGCCACTAATTATGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAA
ACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCAATGGCAGATAT
TCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGGTGAATTCAACCATATCCAAATGGCGGACAACAGAGACGTGGCAA
TGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTA
GTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTAGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGA
GAGATTTAAAGAATTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTCAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACA
ATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATATAGGGCAGCA
CCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTTGACATTGCGACCTCGATGCAAAAGAGATGGTTACAATGA
Protein sequenceShow/hide protein sequence
MPSSLSIFSYSVVAAIRGHHLRIRCELFRSRIFGLWSSNSWWISIRVAARLHCRGKRASKDLGIEGRSTVSEARFGDYERRTDGRVLANEVASDVPTDSPSNSPEFSFIT
PQSISKAATNSYYIHYTDNTRLVLVNQVLTEEKYTSWGRSMIIALLVKNKLGFIDGSIERPTGDLLLAWIHNNHVEQQSVSLYFTKLKIAWDELYQFRPICTCTYTCGGA
KSASEFLQLEYIINLLMGLNEFYASTRAELLLMDPPPSVNKALSLVRQEEQQRSIGTFTMIPTASFFPLVAQHSASKPPSKATNYGKLKCMSTRSFLLPLDPEIERTLRK
TRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNGEFNHIQMADNRDVAMREYAATAFQNFDSGIDQARAWLNAFPPGSITTWGSL
VEKFLTKFFPPTSHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAA
PKKQDPAGVLALDIATSMQKRWLQ