; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr1:2833532..2840559
RNA-Seq ExpressionMoc01g04330
SyntenyMoc01g04330
Gene Ontology termsGO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.9e-12762.42Show/hide
Query:  QRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEV-VKKTQKKKKVSEIAPGAISRPKTRAAVARLA
        + QLNVD EDED GELPQEVHGDEFE+EE+NDDISQYEV   + ++ S   +     T ++E +   V V     ++  S  + GA+SRP+TR AVARLA
Subjt:  QRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEV-VKKTQKKKKVSEIAPGAISRPKTRAAVARLA

Query:  TQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN---------------
         QKEA+AGPSKKAK ARVQR AEEPLEE NEEEPDSTEQTPSRVKRVRLEVRRP FTTRD+LLERGFDEAQEPVPEYVR+R+VEN               
Subjt:  TQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN---------------

Query:  -----------------------VLVHPSDEQVEEVRRLICRPNKAWTVSTTGKLSLKPLDIKEQATVSMYVVKNRLIPTSHDSSIKRNRAMMVYILMKG
                               +LVHPSDEQVEE RRLICRP+K WT+ST GKLSLKPLDI EQATV MYVVKNRLIPTS+DSSIKRNRAM+VYIL+KG
Subjt:  -----------------------VLVHPSDEQVEEVRRLICRPNKAWTVSTTGKLSLKPLDIKEQATVSMYVVKNRLIPTSHDSSIKRNRAMMVYILMKG

Query:  IEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG-------------------------------HKYELLLV
        +EFNFGELIRNEI+SCSEK+ G                VEA DANVV  KKPF SLR+V G                               HKYELLLV
Subjt:  IEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG-------------------------------HKYELLLV

Query:  TQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDDE
        TQRATCAFLKKIYGDEAPSFPDELAADLP SSR PT+S DDES DDE
Subjt:  TQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]1.3e-3762.42Show/hide
Query:  EEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEE
        EE +   S+ + PSLSSLNVSD N VA   TS+E+V L +VVKK + KK + EI PGA SRP TRA +A LA QKEA+AGP KKAKR +  R +EEPL+E
Subjt:  EEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEE

Query:  DNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN
         N+EE DS EQTPS+ KRVR EV+R NFT R++L+E+GFDEAQEPVP+Y++RRL+EN
Subjt:  DNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]5.1e-3159.01Show/hide
Query:  FSQEQVSGDSEHDTEPLEHSDSATVEIHSQIAPGTILDETPPATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHG
        F   +VSGDSEHD EPLEHSDSATV+I  QIAP TI+ ETPPATLQ        E+LVALNEA GEDPL+DD NSG    +     ++   G  P +VH 
Subjt:  FSQEQVSGDSEHDTEPLEHSDSATVEIHSQIAPGTILDETPPATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHG

Query:  DEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAP
        +    EE +   SQ +  SLSSLNVSD N VAT E SDEEV+L +VVKKTQKKKKV+EI P
Subjt:  DEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAP

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]4.8e-2137.46Show/hide
Query:  QVNKEGSSEKKLGGVSKVYLQKG-------AVLDEEIARLQERAEMFSQEQVSGDSEHDT-------------EPLEHSDSATVEIHSQIAPGTILDETP
        +VN+ G SEKKL G SKVYL+K        + LDE IAR+ E+ ++ ++E+   D +++                +E+S   + EI  ++          
Subjt:  QVNKEGSSEKKLGGVSKVYLQKG-------AVLDEEIARLQERAEMFSQEQVSGDSEHDT-------------EPLEHSDSATVEIHSQIAPGTILDETP

Query:  PATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEV
            Q +  ++  +ILVALNEA GEDPLEDD NS  AQ +LNVDGEDED+G+LPQEVHGDE EEEEENDDISQYEV     ++ S  +            
Subjt:  PATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEV

Query:  SLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRV
               K +   +V E A   +  P         AT + + +     +K            EE NEEEP STEQ  S+ KRV
Subjt:  SLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRV

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]1.4e-3654.75Show/hide
Query:  YVVKNRLIPTSHDSSIKRNRAMMVYILMKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG---------
        YVVKN LI TS+DSSI++ R M+VYILMKGIEFNF ELIRNEI  C+EKMVGPL+FP  I ELCL+A VEA   +VV  KK  TS+RRV G         
Subjt:  YVVKNRLIPTSHDSSIKRNRAMMVYILMKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG---------

Query:  -------------------------HKYELLLVTQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDD
                                 H Y+LL  TQ ATC FLKK+YGD APS PDELAADLP SSR     T D+S  D
Subjt:  -------------------------HKYELLLVTQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDD

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.4e-12762.42Show/hide
Query:  QRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEV-VKKTQKKKKVSEIAPGAISRPKTRAAVARLA
        + QLNVD EDED GELPQEVHGDEFE+EE+NDDISQYEV   + ++ S   +     T ++E +   V V     ++  S  + GA+SRP+TR AVARLA
Subjt:  QRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEV-VKKTQKKKKVSEIAPGAISRPKTRAAVARLA

Query:  TQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN---------------
         QKEA+AGPSKKAK ARVQR AEEPLEE NEEEPDSTEQTPSRVKRVRLEVRRP FTTRD+LLERGFDEAQEPVPEYVR+R+VEN               
Subjt:  TQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN---------------

Query:  -----------------------VLVHPSDEQVEEVRRLICRPNKAWTVSTTGKLSLKPLDIKEQATVSMYVVKNRLIPTSHDSSIKRNRAMMVYILMKG
                               +LVHPSDEQVEE RRLICRP+K WT+ST GKLSLKPLDI EQATV MYVVKNRLIPTS+DSSIKRNRAM+VYIL+KG
Subjt:  -----------------------VLVHPSDEQVEEVRRLICRPNKAWTVSTTGKLSLKPLDIKEQATVSMYVVKNRLIPTSHDSSIKRNRAMMVYILMKG

Query:  IEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG-------------------------------HKYELLLV
        +EFNFGELIRNEI+SCSEK+ G                VEA DANVV  KKPF SLR+V G                               HKYELLLV
Subjt:  IEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG-------------------------------HKYELLLV

Query:  TQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDDE
        TQRATCAFLKKIYGDEAPSFPDELAADLP SSR PT+S DDES DDE
Subjt:  TQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDDE

A0A6J1DRR9 uncharacterized protein LOC1110237612.5e-3159.01Show/hide
Query:  FSQEQVSGDSEHDTEPLEHSDSATVEIHSQIAPGTILDETPPATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHG
        F   +VSGDSEHD EPLEHSDSATV+I  QIAP TI+ ETPPATLQ        E+LVALNEA GEDPL+DD NSG    +     ++   G  P +VH 
Subjt:  FSQEQVSGDSEHDTEPLEHSDSATVEIHSQIAPGTILDETPPATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHG

Query:  DEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAP
        +    EE +   SQ +  SLSSLNVSD N VAT E SDEEV+L +VVKKTQKKKKV+EI P
Subjt:  DEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAP

A0A6J1DW11 uncharacterized protein LOC1110236206.1e-3862.42Show/hide
Query:  EEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEE
        EE +   S+ + PSLSSLNVSD N VA   TS+E+V L +VVKK + KK + EI PGA SRP TRA +A LA QKEA+AGP KKAKR +  R +EEPL+E
Subjt:  EEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEVSLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEE

Query:  DNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN
         N+EE DS EQTPS+ KRVR EV+R NFT R++L+E+GFDEAQEPVP+Y++RRL+EN
Subjt:  DNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPVPEYVRRRLVEN

A0A6J1DW79 uncharacterized protein LOC1110249642.3e-2137.46Show/hide
Query:  QVNKEGSSEKKLGGVSKVYLQKG-------AVLDEEIARLQERAEMFSQEQVSGDSEHDT-------------EPLEHSDSATVEIHSQIAPGTILDETP
        +VN+ G SEKKL G SKVYL+K        + LDE IAR+ E+ ++ ++E+   D +++                +E+S   + EI  ++          
Subjt:  QVNKEGSSEKKLGGVSKVYLQKG-------AVLDEEIARLQERAEMFSQEQVSGDSEHDT-------------EPLEHSDSATVEIHSQIAPGTILDETP

Query:  PATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEV
            Q +  ++  +ILVALNEA GEDPLEDD NS  AQ +LNVDGEDED+G+LPQEVHGDE EEEEENDDISQYEV     ++ S  +            
Subjt:  PATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEV

Query:  SLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRV
               K +   +V E A   +  P         AT + + +     +K            EE NEEEP STEQ  S+ KRV
Subjt:  SLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRV

A0A6J1E204 uncharacterized protein LOC1110257026.7e-3754.75Show/hide
Query:  YVVKNRLIPTSHDSSIKRNRAMMVYILMKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG---------
        YVVKN LI TS+DSSI++ R M+VYILMKGIEFNF ELIRNEI  C+EKMVGPL+FP  I ELCL+A VEA   +VV  KK  TS+RRV G         
Subjt:  YVVKNRLIPTSHDSSIKRNRAMMVYILMKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWG---------

Query:  -------------------------HKYELLLVTQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDD
                                 H Y+LL  TQ ATC FLKK+YGD APS PDELAADLP SSR     T D+S  D
Subjt:  -------------------------HKYELLLVTQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCGCCTGTTCCAGTGCCAATGTGCCCCCTGCCTGTCCGCACGAGCCTCACTCCACCAACACATGCACGCCCCTGCACACCCGTTAGCGCCCCTCGTGTGCCCGC
TGTAACCCTAGCCTCTGCAGCAACCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTATTTT
CCATGTTTTGCATCAAAACTAAGAGCCAAATTAAGCAAGTGGCCAATAAGATGAGGAATAGTCCACCAAAAACATTGCCTACCAATATGGAAGAGCAAGGTGAGGCAATC
AATGTAAGGAGCAAGCTTGAAGATGATGAACTTGAGCTCCCAATAGACAATGATGATCCACCAATCCTTGAAATGATAAACGAGGTATGGAAGAATGTGGAAGTAAGCAA
CGGCAAAGAGCAGGTTAATAAAGAAGGTTCTAGTGAGAAGAAATTAGGAGGTGTTAGTAAAGTTTATCTTCAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTC
AAGAGAGAGCGGAGATGTTCAGTCAAGAACAGGTTAGTGGGGACTCAGAACATGACACGGAGCCCTTAGAGCATTCAGATTCAGCCACGGTTGAAATTCATAGTCAAATT
GCGCCTGGCACAATTTTGGATGAGACTCCACCGGCCACTCTACAAGGAAAGGACAATGCCGAAAATGCCGAAATTTTGGTGGCGTTGAATGAAGCAATGGGAGAGGATCC
ATTAGAGGATGACAGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGAAGATGAAGATATTGGAGAATTACCCCAAGAAGTGCATGGAGATGAATTTGAAG
AGGAAGAAGAAAATGACGATATCTCTCAATATGAAGTCCCTTCTTTGTCGAGTTTGAATGTTTCTGACACAAACCTTGTTGCTACTACAGAGACTTCAGATGAGGAGGTG
AGCTTGGCCGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGTCAGAAATTGCACCAGGCGCAATTTCTAGGCCTAAGACCCGAGCTGCTGTAGCACGTTTAGCTAC
CCAAAAAGAAGCCAAGGCTGGTCCATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAGAAGAGCCACTTGAGGAGGACAACGAAGAGGAGCCTGATTCTACCG
AACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCAACTTCACAACACGTGATGTCCTCCTTGAGAGGGGTTTTGATGAGGCCCAAGAGCCGGTG
CCGGAATATGTTAGGAGGAGGCTTGTGGAGAATGTTTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGTGCGTAGACTTATTTGTAGACCAAATAAGGCATGGACCGT
CTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATTAAAGAGCAAGCGACAGTTTCAATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCA
TTAAGCGCAATAGGGCGATGATGGTGTACATTCTCATGAAGGGCATTGAATTCAACTTTGGGGAGCTCATAAGGAACGAGATACGGAGTTGCTCCGAGAAAATGGTAGGT
CCTCTTGTTTTTCCTGGACTAATAACTGAGTTATGCTTGCAGGCGAGAGTGGAGGCCTATGATGCCAATGTTGTGACGCTCAAGAAGCCGTTCACATCCCTAAGAAGAGT
TTGGGGGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTTAAGAAGATATACGGTGATGAAGCACCTTCTTTCCCCGATGAGCTTGCGGCCG
ACTTACCATTTTCTTCCCGTTTTCCTACCAATTCCACCGACGATGAGTCTTTCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCGCCTGTTCCAGTGCCAATGTGCCCCCTGCCTGTCCGCACGAGCCTCACTCCACCAACACATGCACGCCCCTGCACACCCGTTAGCGCCCCTCGTGTGCCCGC
TGTAACCCTAGCCTCTGCAGCAACCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTATTTT
CCATGTTTTGCATCAAAACTAAGAGCCAAATTAAGCAAGTGGCCAATAAGATGAGGAATAGTCCACCAAAAACATTGCCTACCAATATGGAAGAGCAAGGTGAGGCAATC
AATGTAAGGAGCAAGCTTGAAGATGATGAACTTGAGCTCCCAATAGACAATGATGATCCACCAATCCTTGAAATGATAAACGAGGTATGGAAGAATGTGGAAGTAAGCAA
CGGCAAAGAGCAGGTTAATAAAGAAGGTTCTAGTGAGAAGAAATTAGGAGGTGTTAGTAAAGTTTATCTTCAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTC
AAGAGAGAGCGGAGATGTTCAGTCAAGAACAGGTTAGTGGGGACTCAGAACATGACACGGAGCCCTTAGAGCATTCAGATTCAGCCACGGTTGAAATTCATAGTCAAATT
GCGCCTGGCACAATTTTGGATGAGACTCCACCGGCCACTCTACAAGGAAAGGACAATGCCGAAAATGCCGAAATTTTGGTGGCGTTGAATGAAGCAATGGGAGAGGATCC
ATTAGAGGATGACAGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGAAGATGAAGATATTGGAGAATTACCCCAAGAAGTGCATGGAGATGAATTTGAAG
AGGAAGAAGAAAATGACGATATCTCTCAATATGAAGTCCCTTCTTTGTCGAGTTTGAATGTTTCTGACACAAACCTTGTTGCTACTACAGAGACTTCAGATGAGGAGGTG
AGCTTGGCCGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGTCAGAAATTGCACCAGGCGCAATTTCTAGGCCTAAGACCCGAGCTGCTGTAGCACGTTTAGCTAC
CCAAAAAGAAGCCAAGGCTGGTCCATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAGAAGAGCCACTTGAGGAGGACAACGAAGAGGAGCCTGATTCTACCG
AACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCAACTTCACAACACGTGATGTCCTCCTTGAGAGGGGTTTTGATGAGGCCCAAGAGCCGGTG
CCGGAATATGTTAGGAGGAGGCTTGTGGAGAATGTTTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGTGCGTAGACTTATTTGTAGACCAAATAAGGCATGGACCGT
CTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATTAAAGAGCAAGCGACAGTTTCAATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCA
TTAAGCGCAATAGGGCGATGATGGTGTACATTCTCATGAAGGGCATTGAATTCAACTTTGGGGAGCTCATAAGGAACGAGATACGGAGTTGCTCCGAGAAAATGGTAGGT
CCTCTTGTTTTTCCTGGACTAATAACTGAGTTATGCTTGCAGGCGAGAGTGGAGGCCTATGATGCCAATGTTGTGACGCTCAAGAAGCCGTTCACATCCCTAAGAAGAGT
TTGGGGGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTTAAGAAGATATACGGTGATGAAGCACCTTCTTTCCCCGATGAGCTTGCGGCCG
ACTTACCATTTTCTTCCCGTTTTCCTACCAATTCCACCGACGATGAGTCTTTCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MPSPVPVPMCPLPVRTSLTPPTHARPCTPVSAPRVPAVTLASAATKSGSERVELKSQEKSGIAPGAFSQHSVFSMFCIKTKSQIKQVANKMRNSPPKTLPTNMEEQGEAI
NVRSKLEDDELELPIDNDDPPILEMINEVWKNVEVSNGKEQVNKEGSSEKKLGGVSKVYLQKGAVLDEEIARLQERAEMFSQEQVSGDSEHDTEPLEHSDSATVEIHSQI
APGTILDETPPATLQGKDNAENAEILVALNEAMGEDPLEDDRNSGAAQRQLNVDGEDEDIGELPQEVHGDEFEEEEENDDISQYEVPSLSSLNVSDTNLVATTETSDEEV
SLAEVVKKTQKKKKVSEIAPGAISRPKTRAAVARLATQKEAKAGPSKKAKRARVQRGAEEPLEEDNEEEPDSTEQTPSRVKRVRLEVRRPNFTTRDVLLERGFDEAQEPV
PEYVRRRLVENVLVHPSDEQVEEVRRLICRPNKAWTVSTTGKLSLKPLDIKEQATVSMYVVKNRLIPTSHDSSIKRNRAMMVYILMKGIEFNFGELIRNEIRSCSEKMVG
PLVFPGLITELCLQARVEAYDANVVTLKKPFTSLRRVWGHKYELLLVTQRATCAFLKKIYGDEAPSFPDELAADLPFSSRFPTNSTDDESFDDE