; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr7:3954342..3962467
RNA-Seq ExpressionMoc07g04620
SyntenyMoc07g04620
Gene Ontology termsGO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]6.6e-17470.97Show/hide
Query:  VAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQDEEVSLAEVVKKTQK
        + + QLNVD EDED GELP+++H DEFE+EE+NDDISQYEV+VRTPVHESQQ DE+PP +EQEGTSGPVDV SEAMEESSSSSSQ               
Subjt:  VAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQDEEVSLAEVVKKTQK

Query:  KKKVAEIAPGAISGPRTQAAVARLAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVP
                 GA+S PRT+ AVARLAAQKEA+A                        EE DSTEQTPSRVKRVRLEVRRPTFTTRDILLERGF EAQE VP
Subjt:  KKKVAEIAPGAISGPRTQAAVARLAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVP

Query:  EYVRRRLMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVFNAIGNEILVHPSDKQVEEARRLICRPHKTWTIST
        EYVR+R++ENGWETLFA ITRVSE L KEFYTAINPNRGD VRVR                        GNEILVHPSD+QVEEARRLICRPHKTWTIST
Subjt:  EYVRRRLMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVFNAIGNEILVHPSDKQVEEARRLICRPHKTWTIST

Query:  TGKLSLKPLNINKQAIVWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKK
         GKLSLKPL+IN+QA VWMYVVKNRLIPTS+D SIKRNRAM+VYIL+KGVEFNF ELIRNEI+SCSEK+                 GVEA DANVVMPKK
Subjt:  TGKLSLKPLNINKQAIVWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKK

Query:  PFTSLRRVRGYSIVREEDSPITAADLETRGVVTREQYDELRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSSLLPTDSNDDESSDDE
        PF SLR+VRGYSIVREEDSPITAAD ETRGVVTREQYDELRHKYELLL TQRAT  FLKKI+GDEAPSFPDELA DLPSSS LPTDSNDDESSDDE
Subjt:  PFTSLRRVRGYSIVREEDSPITAADLETRGVVTREQYDELRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSSLLPTDSNDDESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]2.5e-4051.8Show/hide
Query:  VHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQ-----------------------DEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVAR
        +HESQQDDE+   QEQEG SG VDV +EA+EESSSSSS+                       +E+V L +VVKK + KK + EI PGA S P T+A +A 
Subjt:  VHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQ-----------------------DEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVAR

Query:  LAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRRLMENGWETLFASITRVS
        LAAQKEA+A                        EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GF EAQE VP+Y++RRL+ENGWETLFA   RVS
Subjt:  LAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRRLMENGWETLFASITRVS

Query:  EDLGKEFYTAINPNRGDVVRVR
        E L KEFY  INPNRGD +  R
Subjt:  EDLGKEFYTAINPNRGDVVRVR

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]1.7e-5234.81Show/hide
Query:  VDREANSRTMEGSSSSKPYDKEKENKRVLLPPPTKPGMIPLEPPRSSHEKLVFDPREQRRKYEEAIRMNPRRNLSIAGSNSEKVNMESHDARVNKEGSSE
        VD   +  TMEGSS SKP DKE E K+V+LPPP  P                                                  E H ARVN+ G SE
Subjt:  VDREANSRTMEGSSSSKPYDKEKENKRVLLPPPTKPGMIPLEPPRSSHEKLVFDPREQRRKYEEAIRMNPRRNLSIAGSNSEKVNMESHDARVNKEGSSE

Query:  KRLGGVNKVYLRKNQSLEEKGAVLDDEIARLQERAEMFSKNNEIRDKENERVYEKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSE
        K+L G +KVYLRKNQS+ +K + LD+ IAR+ E+ ++ +K  EI DK+NE +  KI ELN KWQ FMENS+++SEEIQ+ELN                  
Subjt:  KRLGGVNKVYLRKNQSLEEKGAVLDDEIARLQERAEMFSKNNEIRDKENERVYEKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSE

Query:  SLELSIFPPLSTTVAVHVEGQEQVSRDLEHDTEPLEHSDSATVEIQSQIAPGAIMDETLPATLQGILSPSFPDPILTKMPLVLDDSEQERTTSRIAEILV
                                                                                              EQERTTS+I +ILV
Subjt:  SLELSIFPPLSTTVAVHVEGQEQVSRDLEHDTEPLEHSDSATVEIQSQIAPGAIMDETLPATLQGILSPSFPDPILTKMPLVLDDSEQERTTSRIAEILV

Query:  ALNEARGEDLLEDDGNSGVAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSS
        ALNEA GED LEDDGNS  AQ +LNVDGEDEDLG+LP+++H DE EEEEENDDISQYEVR+   VHESQ+D  + P +  EG S PVDV +EA  +SSSS
Subjt:  ALNEARGEDLLEDDGNSGVAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSS

Query:  SSQDEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVARLAAQKEAKAEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRR
        SS+D                                       +++E   EE  STEQ  S+ K                                    
Subjt:  SSQDEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVARLAAQKEAKAEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRR

Query:  LMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRG
                      RV E L KEFY AI+PN+GD VRVRG
Subjt:  LMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRG

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]2.2e-0469.05Show/hide
Query:  GVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITAADLET
        G++A+D +VV PKK  TS+RRVRGY IVREEDS IT AD ET
Subjt:  GVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITAADLET

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]4.6e-4290.91Show/hide
Query:  MPEQTQLLESFSLEQTMIDFMARTDESIRRLQIQVELMVNELRNSLPEAFPNN--EQGEAINERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK
        MPEQT LLE FSLEQTMIDFMARTDESIRRLQIQVELMV+ELRNS  EA P N  EQGEA NERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK
Subjt:  MPEQTQLLESFSLEQTMIDFMARTDESIRRLQIQVELMVNELRNSLPEAFPNN--EQGEAINERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK

Query:  PIEVMEHELT
        PIEVMEHELT
Subjt:  PIEVMEHELT

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]1.1e-5471.43Show/hide
Query:  VWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKKPFTSLRRVRGYSIVRE
        +W YVVKN LI TS+D SI++ R M+VYILMKG+EFNF ELIRNEI  C+EKMVG L+F   I ELCL+ GVEAD  +VVM KK  TS+RRVRGY IVRE
Subjt:  VWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKKPFTSLRRVRGYSIVRE

Query:  EDSPITAADLETRGVVTREQYDE---LRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSS
        EDSPITAAD +TRGVVTREQYDE   LRH Y+LL  TQ AT EFLKK++GD APS PDELA DLPSSS
Subjt:  EDSPITAADLETRGVVTREQYDE---LRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSS

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220073.2e-17470.97Show/hide
Query:  VAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQDEEVSLAEVVKKTQK
        + + QLNVD EDED GELP+++H DEFE+EE+NDDISQYEV+VRTPVHESQQ DE+PP +EQEGTSGPVDV SEAMEESSSSSSQ               
Subjt:  VAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQDEEVSLAEVVKKTQK

Query:  KKKVAEIAPGAISGPRTQAAVARLAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVP
                 GA+S PRT+ AVARLAAQKEA+A                        EE DSTEQTPSRVKRVRLEVRRPTFTTRDILLERGF EAQE VP
Subjt:  KKKVAEIAPGAISGPRTQAAVARLAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVP

Query:  EYVRRRLMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVFNAIGNEILVHPSDKQVEEARRLICRPHKTWTIST
        EYVR+R++ENGWETLFA ITRVSE L KEFYTAINPNRGD VRVR                        GNEILVHPSD+QVEEARRLICRPHKTWTIST
Subjt:  EYVRRRLMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVFNAIGNEILVHPSDKQVEEARRLICRPHKTWTIST

Query:  TGKLSLKPLNINKQAIVWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKK
         GKLSLKPL+IN+QA VWMYVVKNRLIPTS+D SIKRNRAM+VYIL+KGVEFNF ELIRNEI+SCSEK+                 GVEA DANVVMPKK
Subjt:  TGKLSLKPLNINKQAIVWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKK

Query:  PFTSLRRVRGYSIVREEDSPITAADLETRGVVTREQYDELRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSSLLPTDSNDDESSDDE
        PF SLR+VRGYSIVREEDSPITAAD ETRGVVTREQYDELRHKYELLL TQRAT  FLKKI+GDEAPSFPDELA DLPSSS LPTDSNDDESSDDE
Subjt:  PFTSLRRVRGYSIVREEDSPITAADLETRGVVTREQYDELRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSSLLPTDSNDDESSDDE

A0A6J1DW11 uncharacterized protein LOC1110236201.2e-4051.8Show/hide
Query:  VHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQ-----------------------DEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVAR
        +HESQQDDE+   QEQEG SG VDV +EA+EESSSSSS+                       +E+V L +VVKK + KK + EI PGA S P T+A +A 
Subjt:  VHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQ-----------------------DEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVAR

Query:  LAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRRLMENGWETLFASITRVS
        LAAQKEA+A                        EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GF EAQE VP+Y++RRL+ENGWETLFA   RVS
Subjt:  LAAQKEAKA------------------------EETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRRLMENGWETLFASITRVS

Query:  EDLGKEFYTAINPNRGDVVRVR
        E L KEFY  INPNRGD +  R
Subjt:  EDLGKEFYTAINPNRGDVVRVR

A0A6J1DW79 uncharacterized protein LOC1110249648.2e-5334.81Show/hide
Query:  VDREANSRTMEGSSSSKPYDKEKENKRVLLPPPTKPGMIPLEPPRSSHEKLVFDPREQRRKYEEAIRMNPRRNLSIAGSNSEKVNMESHDARVNKEGSSE
        VD   +  TMEGSS SKP DKE E K+V+LPPP  P                                                  E H ARVN+ G SE
Subjt:  VDREANSRTMEGSSSSKPYDKEKENKRVLLPPPTKPGMIPLEPPRSSHEKLVFDPREQRRKYEEAIRMNPRRNLSIAGSNSEKVNMESHDARVNKEGSSE

Query:  KRLGGVNKVYLRKNQSLEEKGAVLDDEIARLQERAEMFSKNNEIRDKENERVYEKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSE
        K+L G +KVYLRKNQS+ +K + LD+ IAR+ E+ ++ +K  EI DK+NE +  KI ELN KWQ FMENS+++SEEIQ+ELN                  
Subjt:  KRLGGVNKVYLRKNQSLEEKGAVLDDEIARLQERAEMFSKNNEIRDKENERVYEKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSE

Query:  SLELSIFPPLSTTVAVHVEGQEQVSRDLEHDTEPLEHSDSATVEIQSQIAPGAIMDETLPATLQGILSPSFPDPILTKMPLVLDDSEQERTTSRIAEILV
                                                                                              EQERTTS+I +ILV
Subjt:  SLELSIFPPLSTTVAVHVEGQEQVSRDLEHDTEPLEHSDSATVEIQSQIAPGAIMDETLPATLQGILSPSFPDPILTKMPLVLDDSEQERTTSRIAEILV

Query:  ALNEARGEDLLEDDGNSGVAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSS
        ALNEA GED LEDDGNS  AQ +LNVDGEDEDLG+LP+++H DE EEEEENDDISQYEVR+   VHESQ+D  + P +  EG S PVDV +EA  +SSSS
Subjt:  ALNEARGEDLLEDDGNSGVAQRQLNVDGEDEDLGELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSS

Query:  SSQDEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVARLAAQKEAKAEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRR
        SS+D                                       +++E   EE  STEQ  S+ K                                    
Subjt:  SSQDEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVARLAAQKEAKAEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRR

Query:  LMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRG
                      RV E L KEFY AI+PN+GD VRVRG
Subjt:  LMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRG

A0A6J1DW79 uncharacterized protein LOC1110249641.1e-0469.05Show/hide
Query:  GVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITAADLET
        G++A+D +VV PKK  TS+RRVRGY IVREEDS IT AD ET
Subjt:  GVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITAADLET

A0A6J1DW79 uncharacterized protein LOC1110249642.2e-4290.91Show/hide
Query:  MPEQTQLLESFSLEQTMIDFMARTDESIRRLQIQVELMVNELRNSLPEAFPNN--EQGEAINERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK
        MPEQT LLE FSLEQTMIDFMARTDESIRRLQIQVELMV+ELRNS  EA P N  EQGEA NERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK
Subjt:  MPEQTQLLESFSLEQTMIDFMARTDESIRRLQIQVELMVNELRNSLPEAFPNN--EQGEAINERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEK

Query:  PIEVMEHELT
        PIEVMEHELT
Subjt:  PIEVMEHELT

A0A6J1E204 uncharacterized protein LOC1110257025.1e-5571.43Show/hide
Query:  VWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKKPFTSLRRVRGYSIVRE
        +W YVVKN LI TS+D SI++ R M+VYILMKG+EFNF ELIRNEI  C+EKMVG L+F   I ELCL+ GVEAD  +VVM KK  TS+RRVRGY IVRE
Subjt:  VWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKMVGFLVFLRLITELCLQGGVEADDANVVMPKKPFTSLRRVRGYSIVRE

Query:  EDSPITAADLETRGVVTREQYDE---LRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSS
        EDSPITAAD +TRGVVTREQYDE   LRH Y+LL  TQ AT EFLKK++GD APS PDELA DLPSSS
Subjt:  EDSPITAADLETRGVVTREQYDE---LRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTCTTCTTGGAGGACATACATCAAAAAGATCTTCATTAGCAAAGTATTCATTTAAAGTACCTACCTCTTATTCTTGGGAACTGTCGGATATCTCTCTTGGACT
CATTGGTCTCCTTAGTGTGGATATTGGTGGGGATACCCTTTGGAGATGCTCACTTGAATGTTGGCTTGGTGGGGATATCCCGGCGCGTGCTGGTGCTGAGAGGCATGCGG
GGTGTTGGGCACTGGGCGGCAGAGGCGCTGAAGGAGGTAAGGACGCACAGACGCTTGGAGTGGGGCGCGCGAACAGACATGGCTGTGCGCATGGGTGTTGGACTAACAGG
AGGCGCGCGGACATGTGCGGCAGGCAGGGGTGCACATGGGTGCTGGGCAGGCGCTGGTGTGCAGGCATGAGCGGCAGGCGTGGGCGTGTAAGTGCGGGTGCAAGCAATGG
CGTGCAAGCTGGGGCGGCAGGCAATGGGTGCTTTGGCAGGCAGCACACGCATGGATTGTGTGCTGTGAGGGGGCGTGCTGTGAGGTTCTCGCAGCATGTTGGCGTGCGCA
GCAGGGGTGTGCGCCTAGCGATGCATGCGTGCGGGTGGCCTTTAGATAGGCTTGTCCTTGAATTAAGGCCACTTGCTTGGCTTCTCATGCTCTCCATGCTTTCTCTTGAC
CATAGTGGGTTCTTCATTGGCCTTCAAAATGGTCAAAATTTGTTGATGGGTCCAATAAAGCCTCCTTCGTTTAGTGGGTCTTTCCTCAAGCTTCTCTTTATTGTTGGCGT
AGGTGGCATTTTGGAGTGTTGGGTCAAATATGATCACATCAAACATCATCTGGTAGATGGGCAAAATCATCAAACTGGTTTGCCAACCATGGTTACCATAGTCTCTGCAG
CAACCAAAAGCTGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCGAACATAGTGTGTTCTCCATGTTTTGCATCAAAACT
ATGATCACAATTGCATATTTGAACTATTTCACTGCATTACTTGATTTTGAAAAGCAACAAGTCCCTGAAGATTCGACCTTGGATTACCTCCGAGTATTACTTGTGACCGT
GTGCACTTGCACGAGTCAACAAGGGAACCAACAAGAATTCAACCATTATTCAAATCCATACAACCAAGATTGGAGGGATCATTCGAATTTCCCGTGGGGAGAGCTTCAAC
CTGAAAGTATATTCCATCTATATGAGCATGCATTCCCACCAGAATTCCCGTCACAATCTCAACAGGAATACAATCAACCATGGATGCCAGAACAAACGCAATTATTGGAA
AGTTTTAGCCTTGAACAGACTATGATAGACTTTATGGCGAGAACTGATGAATCAATACGCAGGCTGCAAATCCAAGTGGAGTTGATGGTTAATGAGCTGAGGAATAGTCT
ACCAGAAGCTTTTCCAAACAATGAGCAAGGTGAGGCAATTAATGAAAGGAGCGAGCTTGAAGATGACGAACTTGAGCTCCCAATAGACAATGATGATCCACCAATCCCTG
AAGTGATAAATGAGGTATGGAAGAATAAGGAAGTTGAACCCGAGAAACCGATAGAGGTAATGGAGCATGAGCTTACAAGGTTTATTGGAGGGATTTTCGTTGTCGATCGT
GAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCATACGACAAAGAGAAGGAAAATAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCC
ACTTGAACCTCCTAGGAGTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAACCTATCCATAG
CTGGTTCAAATTCTGAAAAAGTGAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTTCTAGTGAAAAGAGATTAGGAGGTGTTAATAAAGTTTATCTTCGAAAA
AATCAATCTCTAGAAGAAAAAGGTGCTGTCTTAGATGATGAAATAGCTAGACTTCAAGAGAGAGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGATAAAGAGAATGA
GAGAGTTTATGAGAAAATTGAGGAACTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATAC
GTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCCGAGTCTTTAGAACTGTCTATCTTTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAG
GTTAGTAGAGACTTAGAACACGACACGGAGCCCTTGGAGCACTCAGATTCGGCCACGGTCGAAATTCAAAGCCAAATTGCGCCTGGCGCAATTATGGATGAGACTCTACC
GGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAATGCCCCTAGTTCTTGATGATTCAGAACAAGAAAGGACAACATCGAGAATTGCCG
AAATTTTGGTAGCGTTGAATGAAGCAAGGGGAGAAGATCTATTAGAGGATGATGGAAATAGTGGGGTAGCACAAAGACAATTGAATGTTGATGGAGAGGATGAAGATCTT
GGAGAATTACCCCGAAAAATGCATGCAGATGAGTTTGAAGAGGAGGAAGAAAATGATGATATCTCTCAATATGAAGTGAGAGTACGAACTCCAGTGCATGAATCTCAGCA
AGATGATGAGAAGCCCCCTGCACAAGAGCAAGAAGGAACATCTGGTCCTGTCGATGTCCTTAGTGAGGCCATGGAGGAATCATCTTCCTCTTCTTCACAAGATGAGGAGG
TGAGTTTGGCTGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCAGAAATTGCGCCTGGCGCAATTTCTGGGCCTAGGACCCAAGCCGCTGTAGCACGTTTGGCT
GCCCAAAAAGAAGCCAAGGCTGAGGAGACCGATTCTACCGAGCAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATAT
CCTCCTTGAGAGAGGTTTTCATGAGGCCCAAGAGCTGGTGCCGGAATATGTTAGAAGGAGGCTTATGGAGAATGGTTGGGAGACATTGTTTGCCTCCATTACACGTGTAT
CAGAGGACTTGGGGAAAGAGTTTTACACTGCCATTAACCCAAACCGAGGGGATGTAGTGAGAGTACGAGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCAC
TATGGTTTGTTGGATGTTTTTAATGCCATAGGTAATGAAATTTTGGTGCATCCATCGGACAAGCAAGTGGAGGAGGCGCGTAGGCTTATTTGTAGACCACATAAGACATG
GACCATTTCAACCACAGGGAAGCTTTCCCTAAAGCCGCTTAACATCAACAAGCAAGCAATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATT
TTTCCATTAAGCGCAACAGGGCGATGATGGTGTACATTCTCATGAAGGGTGTTGAGTTCAACTTTGAGGAGCTCATAAGGAACGAGATTCGGAGTTGCTCCGAGAAAATG
GTAGGTTTTCTTGTTTTTCTTAGACTAATAACTGAGTTATGCTTGCAGGGGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATCACTAAG
AAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGCTGCGGATCTCGAGACCCGAGGGGTGGTGACTAGGGAGCAATATGATGAGCTTAGGCACA
AGTACGAGCTTCTTTTGTTTACTCAACGTGCCACATATGAGTTCCTCAAGAAGATATTCGGTGATGAAGCACCTTCTTTCCCCGATGAGCTTGCGGTCGATTTACCATCT
TCTTCCCTTCTTCCTACAGATTCCAACGACGATGAGTCTTCCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGCTCTTCTTGGAGGACATACATCAAAAAGATCTTCATTAGCAAAGTATTCATTTAAAGTACCTACCTCTTATTCTTGGGAACTGTCGGATATCTCTCTTGGACT
CATTGGTCTCCTTAGTGTGGATATTGGTGGGGATACCCTTTGGAGATGCTCACTTGAATGTTGGCTTGGTGGGGATATCCCGGCGCGTGCTGGTGCTGAGAGGCATGCGG
GGTGTTGGGCACTGGGCGGCAGAGGCGCTGAAGGAGGTAAGGACGCACAGACGCTTGGAGTGGGGCGCGCGAACAGACATGGCTGTGCGCATGGGTGTTGGACTAACAGG
AGGCGCGCGGACATGTGCGGCAGGCAGGGGTGCACATGGGTGCTGGGCAGGCGCTGGTGTGCAGGCATGAGCGGCAGGCGTGGGCGTGTAAGTGCGGGTGCAAGCAATGG
CGTGCAAGCTGGGGCGGCAGGCAATGGGTGCTTTGGCAGGCAGCACACGCATGGATTGTGTGCTGTGAGGGGGCGTGCTGTGAGGTTCTCGCAGCATGTTGGCGTGCGCA
GCAGGGGTGTGCGCCTAGCGATGCATGCGTGCGGGTGGCCTTTAGATAGGCTTGTCCTTGAATTAAGGCCACTTGCTTGGCTTCTCATGCTCTCCATGCTTTCTCTTGAC
CATAGTGGGTTCTTCATTGGCCTTCAAAATGGTCAAAATTTGTTGATGGGTCCAATAAAGCCTCCTTCGTTTAGTGGGTCTTTCCTCAAGCTTCTCTTTATTGTTGGCGT
AGGTGGCATTTTGGAGTGTTGGGTCAAATATGATCACATCAAACATCATCTGGTAGATGGGCAAAATCATCAAACTGGTTTGCCAACCATGGTTACCATAGTCTCTGCAG
CAACCAAAAGCTGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCGAACATAGTGTGTTCTCCATGTTTTGCATCAAAACT
ATGATCACAATTGCATATTTGAACTATTTCACTGCATTACTTGATTTTGAAAAGCAACAAGTCCCTGAAGATTCGACCTTGGATTACCTCCGAGTATTACTTGTGACCGT
GTGCACTTGCACGAGTCAACAAGGGAACCAACAAGAATTCAACCATTATTCAAATCCATACAACCAAGATTGGAGGGATCATTCGAATTTCCCGTGGGGAGAGCTTCAAC
CTGAAAGTATATTCCATCTATATGAGCATGCATTCCCACCAGAATTCCCGTCACAATCTCAACAGGAATACAATCAACCATGGATGCCAGAACAAACGCAATTATTGGAA
AGTTTTAGCCTTGAACAGACTATGATAGACTTTATGGCGAGAACTGATGAATCAATACGCAGGCTGCAAATCCAAGTGGAGTTGATGGTTAATGAGCTGAGGAATAGTCT
ACCAGAAGCTTTTCCAAACAATGAGCAAGGTGAGGCAATTAATGAAAGGAGCGAGCTTGAAGATGACGAACTTGAGCTCCCAATAGACAATGATGATCCACCAATCCCTG
AAGTGATAAATGAGGTATGGAAGAATAAGGAAGTTGAACCCGAGAAACCGATAGAGGTAATGGAGCATGAGCTTACAAGGTTTATTGGAGGGATTTTCGTTGTCGATCGT
GAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCATACGACAAAGAGAAGGAAAATAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCC
ACTTGAACCTCCTAGGAGTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAACCTATCCATAG
CTGGTTCAAATTCTGAAAAAGTGAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTTCTAGTGAAAAGAGATTAGGAGGTGTTAATAAAGTTTATCTTCGAAAA
AATCAATCTCTAGAAGAAAAAGGTGCTGTCTTAGATGATGAAATAGCTAGACTTCAAGAGAGAGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGATAAAGAGAATGA
GAGAGTTTATGAGAAAATTGAGGAACTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATAC
GTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCCGAGTCTTTAGAACTGTCTATCTTTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAG
GTTAGTAGAGACTTAGAACACGACACGGAGCCCTTGGAGCACTCAGATTCGGCCACGGTCGAAATTCAAAGCCAAATTGCGCCTGGCGCAATTATGGATGAGACTCTACC
GGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAATGCCCCTAGTTCTTGATGATTCAGAACAAGAAAGGACAACATCGAGAATTGCCG
AAATTTTGGTAGCGTTGAATGAAGCAAGGGGAGAAGATCTATTAGAGGATGATGGAAATAGTGGGGTAGCACAAAGACAATTGAATGTTGATGGAGAGGATGAAGATCTT
GGAGAATTACCCCGAAAAATGCATGCAGATGAGTTTGAAGAGGAGGAAGAAAATGATGATATCTCTCAATATGAAGTGAGAGTACGAACTCCAGTGCATGAATCTCAGCA
AGATGATGAGAAGCCCCCTGCACAAGAGCAAGAAGGAACATCTGGTCCTGTCGATGTCCTTAGTGAGGCCATGGAGGAATCATCTTCCTCTTCTTCACAAGATGAGGAGG
TGAGTTTGGCTGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCAGAAATTGCGCCTGGCGCAATTTCTGGGCCTAGGACCCAAGCCGCTGTAGCACGTTTGGCT
GCCCAAAAAGAAGCCAAGGCTGAGGAGACCGATTCTACCGAGCAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATAT
CCTCCTTGAGAGAGGTTTTCATGAGGCCCAAGAGCTGGTGCCGGAATATGTTAGAAGGAGGCTTATGGAGAATGGTTGGGAGACATTGTTTGCCTCCATTACACGTGTAT
CAGAGGACTTGGGGAAAGAGTTTTACACTGCCATTAACCCAAACCGAGGGGATGTAGTGAGAGTACGAGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCAC
TATGGTTTGTTGGATGTTTTTAATGCCATAGGTAATGAAATTTTGGTGCATCCATCGGACAAGCAAGTGGAGGAGGCGCGTAGGCTTATTTGTAGACCACATAAGACATG
GACCATTTCAACCACAGGGAAGCTTTCCCTAAAGCCGCTTAACATCAACAAGCAAGCAATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATT
TTTCCATTAAGCGCAACAGGGCGATGATGGTGTACATTCTCATGAAGGGTGTTGAGTTCAACTTTGAGGAGCTCATAAGGAACGAGATTCGGAGTTGCTCCGAGAAAATG
GTAGGTTTTCTTGTTTTTCTTAGACTAATAACTGAGTTATGCTTGCAGGGGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATCACTAAG
AAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGCTGCGGATCTCGAGACCCGAGGGGTGGTGACTAGGGAGCAATATGATGAGCTTAGGCACA
AGTACGAGCTTCTTTTGTTTACTCAACGTGCCACATATGAGTTCCTCAAGAAGATATTCGGTGATGAAGCACCTTCTTTCCCCGATGAGCTTGCGGTCGATTTACCATCT
TCTTCCCTTCTTCCTACAGATTCCAACGACGATGAGTCTTCCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MYALLGGHTSKRSSLAKYSFKVPTSYSWELSDISLGLIGLLSVDIGGDTLWRCSLECWLGGDIPARAGAERHAGCWALGGRGAEGGKDAQTLGVGRANRHGCAHGCWTNR
RRADMCGRQGCTWVLGRRWCAGMSGRRGRVSAGASNGVQAGAAGNGCFGRQHTHGLCAVRGRAVRFSQHVGVRSRGVRLAMHACGWPLDRLVLELRPLAWLLMLSMLSLD
HSGFFIGLQNGQNLLMGPIKPPSFSGSFLKLLFIVGVGGILECWVKYDHIKHHLVDGQNHQTGLPTMVTIVSAATKSCSERVELKSQEKSGIAPGAFSEHSVFSMFCIKT
MITIAYLNYFTALLDFEKQQVPEDSTLDYLRVLLVTVCTCTSQQGNQQEFNHYSNPYNQDWRDHSNFPWGELQPESIFHLYEHAFPPEFPSQSQQEYNQPWMPEQTQLLE
SFSLEQTMIDFMARTDESIRRLQIQVELMVNELRNSLPEAFPNNEQGEAINERSELEDDELELPIDNDDPPIPEVINEVWKNKEVEPEKPIEVMEHELTRFIGGIFVVDR
EANSRTMEGSSSSKPYDKEKENKRVLLPPPTKPGMIPLEPPRSSHEKLVFDPREQRRKYEEAIRMNPRRNLSIAGSNSEKVNMESHDARVNKEGSSEKRLGGVNKVYLRK
NQSLEEKGAVLDDEIARLQERAEMFSKNNEIRDKENERVYEKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSESLELSIFPPLSTTVAVHVEGQEQ
VSRDLEHDTEPLEHSDSATVEIQSQIAPGAIMDETLPATLQGILSPSFPDPILTKMPLVLDDSEQERTTSRIAEILVALNEARGEDLLEDDGNSGVAQRQLNVDGEDEDL
GELPRKMHADEFEEEEENDDISQYEVRVRTPVHESQQDDEKPPAQEQEGTSGPVDVLSEAMEESSSSSSQDEEVSLAEVVKKTQKKKKVAEIAPGAISGPRTQAAVARLA
AQKEAKAEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFHEAQELVPEYVRRRLMENGWETLFASITRVSEDLGKEFYTAINPNRGDVVRVRGKVVKFSPSIINTH
YGLLDVFNAIGNEILVHPSDKQVEEARRLICRPHKTWTISTTGKLSLKPLNINKQAIVWMYVVKNRLIPTSHDFSIKRNRAMMVYILMKGVEFNFEELIRNEIRSCSEKM
VGFLVFLRLITELCLQGGVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITAADLETRGVVTREQYDELRHKYELLLFTQRATYEFLKKIFGDEAPSFPDELAVDLPS
SSLLPTDSNDDESSDDE