; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr6:13101340..13103784
RNA-Seq ExpressionMoc06g16650
SyntenyMoc06g16650
Gene Ontology termsGO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144472.1 LOW QUALITY PROTEIN: ATP-dependent helicase NAM7-like [Momordica charantia]7.6e-2187.69Show/hide
Query:  DPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELP
        DP+LTK P+VFDDLEQ+R T KI EILVALNEARGEDPLEDDGN+GAAQGQLNVDGEDEDLGELP
Subjt:  DPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELP

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]3.4e-17072.16Show/hide
Query:  QLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTK
        QLNVD EDED GELPQEVHGDEFEDEE+NDDISQYEV+VRTPVHESQQV+EEP  K +                      + +   V +     +   + 
Subjt:  QLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTK

Query:  KVVEIAPGPRTRATVARLAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRR
                PRTR  VARLAAQ+EA+AGPSKKAK  RVQR AEE LEEAN+EEPDSTEQTPSRVKRVRLEVRRPTFTTRDIL ERGFD AQE VPEYVR+R
Subjt:  KVVEIAPGPRTRATVARLAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRR

Query:  LVDNGWESLFAPTTRVSEALVKEFLHCHQPKPRGCS---ESNEILVHPSDGQVEEARRLICRPHKKWIVSTTGKLSLKPLDINEQATVWMYVVKNRLIPT
        +V+NGWE+LFAP TRVSEALVKEF     P  RG       NEILVHPSD QVEEARRLICRPHK W +ST GKLSLKPLDINEQATVWMYVVKNRLIPT
Subjt:  LVDNGWESLFAPTTRVSEALVKEFLHCHQPKPRGCS---ESNEILVHPSDGQVEEARRLICRPHKKWIVSTTGKLSLKPLDINEQATVWMYVVKNRLIPT

Query:  SHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVREEDSPITAVDPETR
        S+DSSIKRN+AM+VYIL+K VEFNFGELIRNEI+SCSEK+                AGVEA DANVVMP KPF SL +VRGYSIVREEDSPITA DPETR
Subjt:  SHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVREEDSPITAVDPETR

Query:  WVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHESSDDE
         VVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAA+LPSSSR PTD  + ESSDDE
Subjt:  WVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]6.4e-3650.72Show/hide
Query:  EFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTKKVVEIAPG----PRTRATVAR
        E + ++E   + + E         ++ + E     SK + P L  SL     N + + + TS+E+V LTKVVKK +  K + EI PG    P TRAT+A 
Subjt:  EFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTKKVVEIAPG----PRTRATVAR

Query:  LAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRRLVDNGWESLFAPTTRVS
        LAAQ+EA+AGP KKAKR +  R +EE L+E N EE DS EQTPS+ KRVR EV+R  FT R+IL E+GFD AQE VP+Y++RRL++NGWE+LFAPT RVS
Subjt:  LAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRRLVDNGWESLFAPTTRVS

Query:  EALVKEF
        E LVKEF
Subjt:  EALVKEF

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]7.9e-2637.82Show/hide
Query:  MEGSSSSKPHNKEKEKKRVLLPPPTKSDNPVSEFLELSIPPPLSTTVAVHVEGQNMLVG----------------IQCQIAPGAIMDE------TPLATL
        MEGSS SKP +KE EKK+V+LPPP   +  V+   E           +     +N  VG                +  +     I D+        +A L
Subjt:  MEGSSSSKPHNKEKEKKRVLLPPPTKSDNPVSEFLELSIPPPLSTTVAVHVEGQNMLVG----------------IQCQIAPGAIMDE------TPLATL

Query:  QGLLSPSFPDPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRT
                 +       +  +  EQ+R TSKI +ILVALNEA GEDPLEDDGNS  AQG+LNVDGEDEDLG+LPQEVHGDE E+EEENDDISQYEVR+  
Subjt:  QGLLSPSFPDPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRT

Query:  PVHESQQ-VNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVS---LTKVVKKTQKTKKVVE---------IAPGPRTRATVARLAAQREAKAGP
         VHESQ+  NE P++  +     + +      ++        S EEV+    T   ++  K K+V+E         I P       V  + A  E    P
Subjt:  PVHESQQ-VNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVS---LTKVVKKTQKTKKVVE---------IAPGPRTRATVARLAAQREAKAGP

Query:  SKKAKRTRVQRG
         K A   R  RG
Subjt:  SKKAKRTRVQRG

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]1.1e-5667.98Show/hide
Query:  VWMYVVKNRLIPTSHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVRE
        +W YVVKN LI TS+DSSI++ + M+VYILMK +EFNF ELIRNEI  C+EKMVG L+ P  I ELCL+AGVEAD  +VVM  K  TS+ RVRGY IVRE
Subjt:  VWMYVVKNRLIPTSHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVRE

Query:  EDSPITAVDPETRWVVTREQYDE---LRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHE
        EDSPITA DP+TR VVTREQYDE   LRH Y+LL  TQ ATC FLKK+YGD APS PDELAA+LPSSSR   D + H+
Subjt:  EDSPITAVDPETRWVVTREQYDE---LRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHE

TrEMBL top hitse value%identityAlignment
A0A6J1CTS8 LOW QUALITY PROTEIN: ATP-dependent helicase NAM7-like3.7e-2187.69Show/hide
Query:  DPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELP
        DP+LTK P+VFDDLEQ+R T KI EILVALNEARGEDPLEDDGN+GAAQGQLNVDGEDEDLGELP
Subjt:  DPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELP

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.6e-17072.16Show/hide
Query:  QLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTK
        QLNVD EDED GELPQEVHGDEFEDEE+NDDISQYEV+VRTPVHESQQV+EEP  K +                      + +   V +     +   + 
Subjt:  QLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTK

Query:  KVVEIAPGPRTRATVARLAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRR
                PRTR  VARLAAQ+EA+AGPSKKAK  RVQR AEE LEEAN+EEPDSTEQTPSRVKRVRLEVRRPTFTTRDIL ERGFD AQE VPEYVR+R
Subjt:  KVVEIAPGPRTRATVARLAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRR

Query:  LVDNGWESLFAPTTRVSEALVKEFLHCHQPKPRGCS---ESNEILVHPSDGQVEEARRLICRPHKKWIVSTTGKLSLKPLDINEQATVWMYVVKNRLIPT
        +V+NGWE+LFAP TRVSEALVKEF     P  RG       NEILVHPSD QVEEARRLICRPHK W +ST GKLSLKPLDINEQATVWMYVVKNRLIPT
Subjt:  LVDNGWESLFAPTTRVSEALVKEFLHCHQPKPRGCS---ESNEILVHPSDGQVEEARRLICRPHKKWIVSTTGKLSLKPLDINEQATVWMYVVKNRLIPT

Query:  SHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVREEDSPITAVDPETR
        S+DSSIKRN+AM+VYIL+K VEFNFGELIRNEI+SCSEK+                AGVEA DANVVMP KPF SL +VRGYSIVREEDSPITA DPETR
Subjt:  SHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVREEDSPITAVDPETR

Query:  WVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHESSDDE
         VVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAA+LPSSSR PTD  + ESSDDE
Subjt:  WVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHESSDDE

A0A6J1DW11 uncharacterized protein LOC1110236203.1e-3650.72Show/hide
Query:  EFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTKKVVEIAPG----PRTRATVAR
        E + ++E   + + E         ++ + E     SK + P L  SL     N + + + TS+E+V LTKVVKK +  K + EI PG    P TRAT+A 
Subjt:  EFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVSLTKVVKKTQKTKKVVEIAPG----PRTRATVAR

Query:  LAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRRLVDNGWESLFAPTTRVS
        LAAQ+EA+AGP KKAKR +  R +EE L+E N EE DS EQTPS+ KRVR EV+R  FT R+IL E+GFD AQE VP+Y++RRL++NGWE+LFAPT RVS
Subjt:  LAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILFERGFDVAQELVPEYVRRRLVDNGWESLFAPTTRVS

Query:  EALVKEF
        E LVKEF
Subjt:  EALVKEF

A0A6J1DW79 uncharacterized protein LOC1110249643.8e-2637.82Show/hide
Query:  MEGSSSSKPHNKEKEKKRVLLPPPTKSDNPVSEFLELSIPPPLSTTVAVHVEGQNMLVG----------------IQCQIAPGAIMDE------TPLATL
        MEGSS SKP +KE EKK+V+LPPP   +  V+   E           +     +N  VG                +  +     I D+        +A L
Subjt:  MEGSSSSKPHNKEKEKKRVLLPPPTKSDNPVSEFLELSIPPPLSTTVAVHVEGQNMLVG----------------IQCQIAPGAIMDE------TPLATL

Query:  QGLLSPSFPDPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRT
                 +       +  +  EQ+R TSKI +ILVALNEA GEDPLEDDGNS  AQG+LNVDGEDEDLG+LPQEVHGDE E+EEENDDISQYEVR+  
Subjt:  QGLLSPSFPDPMLTKTPLVFDDLEQKRITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRT

Query:  PVHESQQ-VNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVS---LTKVVKKTQKTKKVVE---------IAPGPRTRATVARLAAQREAKAGP
         VHESQ+  NE P++  +     + +      ++        S EEV+    T   ++  K K+V+E         I P       V  + A  E    P
Subjt:  PVHESQQ-VNEEPLQKSKKEHPVLWMSLVRPWRNHLPLLHKTSDEEVS---LTKVVKKTQKTKKVVE---------IAPGPRTRATVARLAAQREAKAGP

Query:  SKKAKRTRVQRG
         K A   R  RG
Subjt:  SKKAKRTRVQRG

A0A6J1E204 uncharacterized protein LOC1110257025.5e-5767.98Show/hide
Query:  VWMYVVKNRLIPTSHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVRE
        +W YVVKN LI TS+DSSI++ + M+VYILMK +EFNF ELIRNEI  C+EKMVG L+ P  I ELCL+AGVEAD  +VVM  K  TS+ RVRGY IVRE
Subjt:  VWMYVVKNRLIPTSHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVRE

Query:  EDSPITAVDPETRWVVTREQYDE---LRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHE
        EDSPITA DP+TR VVTREQYDE   LRH Y+LL  TQ ATC FLKK+YGD APS PDELAA+LPSSSR   D + H+
Subjt:  EDSPITAVDPETRWVVTREQYDE---LRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCATCTTCCTCCAAGCCGCACAACAAAGAGAAGGAGAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAATCGGATAACCCCGTTTCCGAGTTTTTA
GAACTTTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAAACATGTTAGTGGGGATTCAATGCCAAATTGCACCTGGCGCAATTATG
GATGAGACTCCACTGGCCACTTTACAAGGTCTTTTGTCTCCATCTTTTCCAGATCCTATGTTGACTAAAACGCCCCTAGTTTTTGATGATTTAGAACAGAAAAGG
ATAACATCGAAAATTGCCGAAATTTTGGTGGCGTTGAATGAAGCAAGGGGAGAGGATCCATTAGAGGATGATGGAAACAGTGGAGCAGCACAAGGACAATTGAAT
GTTGATGGAGAGGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGTTTGAGGACGAAGAAGAAAATGACGATATCTCTCAATATGAAGTGAGA
GTACGAACTCCGGTGCACGAATCTCAGCAAGTTAATGAGGAGCCCCTGCAAAAGAGCAAGAAGGAACATCCAGTCCTGTGGATGTCCCTAGTGAGGCCATGGAGG
AATCATCTTCCTCTTCTTCACAAAACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACACAAAAGACGAAAAAAGTGGTAGAAATTGCGCCTGGG
CCTAGGACCCGAGCTACTGTAGCACGTTTGGCTGCCCAAAGAGAAGCCAAGGCTGGTCCATCTAAAAAAGCCAAGAGGACTAGGGTGCAAAGAGGGGCAGAAGAG
GCACTTGAGGAGGCCAATAAAGAGGAGCCCGATTCTACCGAGCAAACACCATCAAGAGTAAAAAGGGTGAGATTAGAGGTGAGGAGGCCCACCTTCACAACACGT
GATATCCTCTTTGAGAGAGGTTTTGATGTGGCCCAAGAGCTGGTGCCGGAATATGTTAGAAGAAGGCTTGTGGATAATGGTTGGGAGTCGTTGTTTGCCCCAACT
ACGCGTGTATCCGAGGCCTTGGTGAAAGAGTTTTTACACTGCCATCAACCCAAACCGAGGGGATGTAGTGAGAGTAATGAAATTTTGGTGCATCCATCGGACGGG
CAAGTGGAGGAGGCGCGTAGACTTATTTGTAGACCACATAAGAAATGGATCGTCTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATCAATGAGCAAGCG
ACGGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAAGGCGATGATGGTGTACATTCTCATGAAGAGCGTT
GAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCCGAGAAAATGGTAGGTCATCTTGTTTGTCCTGGACTAATAACTGAGTTATGCTTGCAG
GCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAATAAGCCGTTCACATCTCTAACAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCC
ATTACTGCTGTGGATCCCGAGACTAGATGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGT
GCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCATTCCCCGATGAGCTTGCGGCCAATTTACCATCTTCTTCCCGTTTCCCTACCGATTACACCAACCAT
GAATCTTCCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCATCTTCCTCCAAGCCGCACAACAAAGAGAAGGAGAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAATCGGATAACCCCGTTTCCGAGTTTTTA
GAACTTTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAAACATGTTAGTGGGGATTCAATGCCAAATTGCACCTGGCGCAATTATG
GATGAGACTCCACTGGCCACTTTACAAGGTCTTTTGTCTCCATCTTTTCCAGATCCTATGTTGACTAAAACGCCCCTAGTTTTTGATGATTTAGAACAGAAAAGG
ATAACATCGAAAATTGCCGAAATTTTGGTGGCGTTGAATGAAGCAAGGGGAGAGGATCCATTAGAGGATGATGGAAACAGTGGAGCAGCACAAGGACAATTGAAT
GTTGATGGAGAGGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGTTTGAGGACGAAGAAGAAAATGACGATATCTCTCAATATGAAGTGAGA
GTACGAACTCCGGTGCACGAATCTCAGCAAGTTAATGAGGAGCCCCTGCAAAAGAGCAAGAAGGAACATCCAGTCCTGTGGATGTCCCTAGTGAGGCCATGGAGG
AATCATCTTCCTCTTCTTCACAAAACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACACAAAAGACGAAAAAAGTGGTAGAAATTGCGCCTGGG
CCTAGGACCCGAGCTACTGTAGCACGTTTGGCTGCCCAAAGAGAAGCCAAGGCTGGTCCATCTAAAAAAGCCAAGAGGACTAGGGTGCAAAGAGGGGCAGAAGAG
GCACTTGAGGAGGCCAATAAAGAGGAGCCCGATTCTACCGAGCAAACACCATCAAGAGTAAAAAGGGTGAGATTAGAGGTGAGGAGGCCCACCTTCACAACACGT
GATATCCTCTTTGAGAGAGGTTTTGATGTGGCCCAAGAGCTGGTGCCGGAATATGTTAGAAGAAGGCTTGTGGATAATGGTTGGGAGTCGTTGTTTGCCCCAACT
ACGCGTGTATCCGAGGCCTTGGTGAAAGAGTTTTTACACTGCCATCAACCCAAACCGAGGGGATGTAGTGAGAGTAATGAAATTTTGGTGCATCCATCGGACGGG
CAAGTGGAGGAGGCGCGTAGACTTATTTGTAGACCACATAAGAAATGGATCGTCTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATCAATGAGCAAGCG
ACGGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAAGGCGATGATGGTGTACATTCTCATGAAGAGCGTT
GAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCCGAGAAAATGGTAGGTCATCTTGTTTGTCCTGGACTAATAACTGAGTTATGCTTGCAG
GCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAATAAGCCGTTCACATCTCTAACAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCC
ATTACTGCTGTGGATCCCGAGACTAGATGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGT
GCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCATTCCCCGATGAGCTTGCGGCCAATTTACCATCTTCTTCCCGTTTCCCTACCGATTACACCAACCAT
GAATCTTCCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MEGSSSSKPHNKEKEKKRVLLPPPTKSDNPVSEFLELSIPPPLSTTVAVHVEGQNMLVGIQCQIAPGAIMDETPLATLQGLLSPSFPDPMLTKTPLVFDDLEQKR
ITSKIAEILVALNEARGEDPLEDDGNSGAAQGQLNVDGEDEDLGELPQEVHGDEFEDEEENDDISQYEVRVRTPVHESQQVNEEPLQKSKKEHPVLWMSLVRPWR
NHLPLLHKTSDEEVSLTKVVKKTQKTKKVVEIAPGPRTRATVARLAAQREAKAGPSKKAKRTRVQRGAEEALEEANKEEPDSTEQTPSRVKRVRLEVRRPTFTTR
DILFERGFDVAQELVPEYVRRRLVDNGWESLFAPTTRVSEALVKEFLHCHQPKPRGCSESNEILVHPSDGQVEEARRLICRPHKKWIVSTTGKLSLKPLDINEQA
TVWMYVVKNRLIPTSHDSSIKRNKAMMVYILMKSVEFNFGELIRNEIRSCSEKMVGHLVCPGLITELCLQAGVEADDANVVMPNKPFTSLTRVRGYSIVREEDSP
ITAVDPETRWVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAANLPSSSRFPTDYTNHESSDDE