; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g08220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g08220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr3:5714077..5716439
RNA-Seq ExpressionMoc03g08220
SyntenyMoc03g08220
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.1e-15866.39Show/hide
Query:  QRQLNVDGDDEDLGELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFV
        + QLNVD +DED GELPQEVHGDE E+EE+NDD+SQYEV++RTPVHESQQVDEEPP KEQEG  GPVDVPSEAMEESSSSSSQG                
Subjt:  QRQLNVDGDDEDLGELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFV

Query:  ATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRP
                                       A++RPRTR AVARLAAQKEA                          +E DST+QTPSRVKRVRLEVRRP
Subjt:  ATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRP

Query:  NFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVSEALVKEFYAALNPNRGDVVRVWGKVVKFSLSIINTHYGLLLMKFLVHPSDEQVEEA
         FT RDILLERGFDEAQEPVPEYVR+++VENGWE+LFAP TRVSEALVKEFY A+NPNRGD VRV G                   + LVHPSDEQVEEA
Subjt:  NFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVSEALVKEFYAALNPNRGDVVRVWGKVVKFSLSIINTHYGLLLMKFLVHPSDEQVEEA

Query:  RRLICRPHKTWTVSTTKKLSLKPLDINEQATIWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQ
        RRLICRPHKTWT+ST  KLSLKPLDINEQAT+WMY+VKN+LIPTS+DSSIKRNRAM+VYILVKG+EFNFGELIRNEI+SCSEK+ G              
Subjt:  RRLICRPHKTWTVSTTKKLSLKPLDINEQATIWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQ

Query:  AAVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPETRGVVTREQYDELRHKYKLLLVTQRATCAIFKKI
          VEA DANVVMPKKPF SLR+VRGYSIVREEDSPIT ADPETRGVVTREQYDELRHKY+LLLVTQRATCA  KKI
Subjt:  AAVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPETRGVVTREQYDELRHKYKLLLVTQRATCAIFKKI

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]2.8e-5760.27Show/hide
Query:  VHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVAR
        +HESQQ DEE  V+EQEG  G VDVP+EA+EESSSSSS+GK+P L +LNVSDPNFVA A TS+E+V LTKVVKK + KK + +I   A +RP TR  +A 
Subjt:  VHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVAR

Query:  LAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVS
        LAAQKEA                          +E DS +QTPS+ KRVR EV+R NFTAR+IL+E+GFDEAQEPVP+Y++R+L+ENGWE+LFAPT RVS
Subjt:  LAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVS

Query:  EALVKEFYAALNPNRGDVV
        E LVKEFYA +NPNRGD +
Subjt:  EALVKEFYAALNPNRGDVV

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]7.4e-3451.98Show/hide
Query:  QVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDLGELPQEVHGDEVE
        +VSGDSEHD EPLEHSDS TV+I+ QI PS I+         E     + E+LV+LNEARGEDPL+DDGNSG                            
Subjt:  QVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDLGELPQEVHGDEVE

Query:  EEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVA
                               Q DEEP  +EQEG  GP+DV SEAMEESSSS SQ KT  L +LNVSDPNFVATAE SDEEV+L KVVKKTQKKKKVA
Subjt:  EEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVA

Query:  KI
        +I
Subjt:  KI

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]8.4e-5435.42Show/hide
Query:  MEGSSSSKPHDKEKENKRMLLPPPTKPGMIPLEPPWISHEKLVFDSREQRRKYEDAIRMNPRRNLSIGGTIYEKINMESNDATVNKEGSSEKKLGGVNKV
        MEGSS SKP DKE E K+++LPPP  P                                                  E + A VN+ G SEKKL G +KV
Subjt:  MEGSSSSKPHDKEKENKRMLLPPPTKPGMIPLEPPWISHEKLVFDSREQRRKYEDAIRMNPRRNLSIGGTIYEKINMESNDATVNKEGSSEKKLGGVNKV

Query:  YLRKNQSLEEKGAVLDEEIARLQERVEIFSKNNEIRDKENERVYAKIEELNIKWQAFMENSKKVSEEIQLELNSMSIRHRMNLSQDNPISNSLELSIPLP
        YLRKNQS+ +K + LDE IAR+ E+V+I +K  EI DK+NE + AKI ELN KWQ FMENS+++SEEIQ+ELN                           
Subjt:  YLRKNQSLEEKGAVLDEEIARLQERVEIFSKNNEIRDKENERVYAKIEELNIKWQAFMENSKKVSEEIQLELNSMSIRHRMNLSQDNPISNSLELSIPLP

Query:  ISTTIAVQVEGQEQVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDL
                                                              +ERTT +I +ILV+LNEA GEDPLEDDGNS  AQ +LNVDG+DEDL
Subjt:  ISTTIAVQVEGQEQVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDL

Query:  GELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLT
        G+LPQEVHGDE EEEEENDD+SQYEVR+   VHESQ+   E P++  EG   PVDVP+EA  +SSSSSS+                    + S EEV+  
Subjt:  GELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLT

Query:  KVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEAQETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAP
                                              +E  ST+Q  S+ K                                                
Subjt:  KVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEAQETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAP

Query:  TTRVSEALVKEFYAALNPNRGDVVRVWG
          RV EALVKEFYAA++PN+GD VRV G
Subjt:  TTRVSEALVKEFYAALNPNRGDVVRVWG

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]9.4e-0570.73Show/hide
Query:  VEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPET
        ++A+D +VV PKK  TS+RRVRGY IVREEDS IT ADPET
Subjt:  VEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPET

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]1.8e-4870.27Show/hide
Query:  IWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQAAVEADDANVVMPKKPFTSLRRVRGYSIVRE
        +W Y+VKN LI TS+DSSI++ R M+VYIL+KGIEFNF ELIRNEI  C+EKMVGPL+FP  I ELCL+A VEAD  +VVM KK  TS+RRVRGY IVRE
Subjt:  IWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQAAVEADDANVVMPKKPFTSLRRVRGYSIVRE

Query:  EDSPITTADPETRGVVTREQYDE---LRHKYKLLLVTQRATCAIFKKI
        EDSPIT ADP+TRGVVTREQYDE   LRH Y LL  TQ ATC   KK+
Subjt:  EDSPITTADPETRGVVTREQYDE---LRHKYKLLLVTQRATCAIFKKI

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.0e-15866.39Show/hide
Query:  QRQLNVDGDDEDLGELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFV
        + QLNVD +DED GELPQEVHGDE E+EE+NDD+SQYEV++RTPVHESQQVDEEPP KEQEG  GPVDVPSEAMEESSSSSSQG                
Subjt:  QRQLNVDGDDEDLGELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFV

Query:  ATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRP
                                       A++RPRTR AVARLAAQKEA                          +E DST+QTPSRVKRVRLEVRRP
Subjt:  ATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRP

Query:  NFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVSEALVKEFYAALNPNRGDVVRVWGKVVKFSLSIINTHYGLLLMKFLVHPSDEQVEEA
         FT RDILLERGFDEAQEPVPEYVR+++VENGWE+LFAP TRVSEALVKEFY A+NPNRGD VRV G                   + LVHPSDEQVEEA
Subjt:  NFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVSEALVKEFYAALNPNRGDVVRVWGKVVKFSLSIINTHYGLLLMKFLVHPSDEQVEEA

Query:  RRLICRPHKTWTVSTTKKLSLKPLDINEQATIWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQ
        RRLICRPHKTWT+ST  KLSLKPLDINEQAT+WMY+VKN+LIPTS+DSSIKRNRAM+VYILVKG+EFNFGELIRNEI+SCSEK+ G              
Subjt:  RRLICRPHKTWTVSTTKKLSLKPLDINEQATIWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQ

Query:  AAVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPETRGVVTREQYDELRHKYKLLLVTQRATCAIFKKI
          VEA DANVVMPKKPF SLR+VRGYSIVREEDSPIT ADPETRGVVTREQYDELRHKY+LLLVTQRATCA  KKI
Subjt:  AAVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPETRGVVTREQYDELRHKYKLLLVTQRATCAIFKKI

A0A6J1DRR9 uncharacterized protein LOC1110237613.6e-3451.98Show/hide
Query:  QVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDLGELPQEVHGDEVE
        +VSGDSEHD EPLEHSDS TV+I+ QI PS I+         E     + E+LV+LNEARGEDPL+DDGNSG                            
Subjt:  QVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDLGELPQEVHGDEVE

Query:  EEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVA
                               Q DEEP  +EQEG  GP+DV SEAMEESSSS SQ KT  L +LNVSDPNFVATAE SDEEV+L KVVKKTQKKKKVA
Subjt:  EEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVA

Query:  KI
        +I
Subjt:  KI

A0A6J1DW11 uncharacterized protein LOC1110236201.4e-5760.27Show/hide
Query:  VHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVAR
        +HESQQ DEE  V+EQEG  G VDVP+EA+EESSSSSS+GK+P L +LNVSDPNFVA A TS+E+V LTKVVKK + KK + +I   A +RP TR  +A 
Subjt:  VHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVAKIASDAITRPRTRVAVAR

Query:  LAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVS
        LAAQKEA                          +E DS +QTPS+ KRVR EV+R NFTAR+IL+E+GFDEAQEPVP+Y++R+L+ENGWE+LFAPT RVS
Subjt:  LAAQKEA--------------------------QETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVS

Query:  EALVKEFYAALNPNRGDVV
        E LVKEFYA +NPNRGD +
Subjt:  EALVKEFYAALNPNRGDVV

A0A6J1DW79 uncharacterized protein LOC1110249644.1e-5435.42Show/hide
Query:  MEGSSSSKPHDKEKENKRMLLPPPTKPGMIPLEPPWISHEKLVFDSREQRRKYEDAIRMNPRRNLSIGGTIYEKINMESNDATVNKEGSSEKKLGGVNKV
        MEGSS SKP DKE E K+++LPPP  P                                                  E + A VN+ G SEKKL G +KV
Subjt:  MEGSSSSKPHDKEKENKRMLLPPPTKPGMIPLEPPWISHEKLVFDSREQRRKYEDAIRMNPRRNLSIGGTIYEKINMESNDATVNKEGSSEKKLGGVNKV

Query:  YLRKNQSLEEKGAVLDEEIARLQERVEIFSKNNEIRDKENERVYAKIEELNIKWQAFMENSKKVSEEIQLELNSMSIRHRMNLSQDNPISNSLELSIPLP
        YLRKNQS+ +K + LDE IAR+ E+V+I +K  EI DK+NE + AKI ELN KWQ FMENS+++SEEIQ+ELN                           
Subjt:  YLRKNQSLEEKGAVLDEEIARLQERVEIFSKNNEIRDKENERVYAKIEELNIKWQAFMENSKKVSEEIQLELNSMSIRHRMNLSQDNPISNSLELSIPLP

Query:  ISTTIAVQVEGQEQVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDL
                                                              +ERTT +I +ILV+LNEA GEDPLEDDGNS  AQ +LNVDG+DEDL
Subjt:  ISTTIAVQVEGQEQVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDL

Query:  GELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLT
        G+LPQEVHGDE EEEEENDD+SQYEVR+   VHESQ+   E P++  EG   PVDVP+EA  +SSSSSS+                    + S EEV+  
Subjt:  GELPQEVHGDEVEEEEENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLT

Query:  KVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEAQETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAP
                                              +E  ST+Q  S+ K                                                
Subjt:  KVVKKTQKKKKVAKIASDAITRPRTRVAVARLAAQKEAQETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAP

Query:  TTRVSEALVKEFYAALNPNRGDVVRVWG
          RV EALVKEFYAA++PN+GD VRV G
Subjt:  TTRVSEALVKEFYAALNPNRGDVVRVWG

A0A6J1DW79 uncharacterized protein LOC1110249644.6e-0570.73Show/hide
Query:  VEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPET
        ++A+D +VV PKK  TS+RRVRGY IVREEDS IT ADPET
Subjt:  VEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPET

A0A6J1DW79 uncharacterized protein LOC1110249648.8e-4970.27Show/hide
Query:  IWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQAAVEADDANVVMPKKPFTSLRRVRGYSIVRE
        +W Y+VKN LI TS+DSSI++ R M+VYIL+KGIEFNF ELIRNEI  C+EKMVGPL+FP  I ELCL+A VEAD  +VVM KK  TS+RRVRGY IVRE
Subjt:  IWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQAAVEADDANVVMPKKPFTSLRRVRGYSIVRE

Query:  EDSPITTADPETRGVVTREQYDE---LRHKYKLLLVTQRATCAIFKKI
        EDSPIT ADP+TRGVVTREQYDE   LRH Y LL  TQ ATC   KK+
Subjt:  EDSPITTADPETRGVVTREQYDE---LRHKYKLLLVTQRATCAIFKKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGGAGAATAAGAGAATGTTGTTGCCTCCACCAACTAAACCGGGTATGATTCCTCTTGAACCTCCT
TGGATTTCTCATGAAAAATTAGTTTTTGATTCTAGGGAACAAAGAAGAAAATATGAGGATGCTATAAGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACA
ATTTATGAAAAAATTAATATGGAATCTAATGATGCTACAGTTAATAAAGAAGGTTCTAGTGAAAAGAAATTAGGAGGAGTTAATAAAGTTTATCTTCGAAAAAAT
CAATCTCTAGAGGAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGAGTGGAGATTTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAAT
GAGAGGGTTTATGCAAAAATTGAGGAACTAAACATAAAATGGCAAGCATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATG
AGTATACGTCATAGGATGAATCTTTCTCAAGATAACCCCATTTCCAACTCTTTAGAACTGTCTATCCCTCTCCCTATTTCCACTACTATTGCTGTGCAGGTTGAA
GGTCAAGAACAGGTTAGTGGAGACTCAGAACACGACACGGAGCCCTTGGAGCACTCAGATTCGACCACGGTCGAAATTCAGAGCCAAATTGAGCCTAGCGCAATT
TTGGATAGACTCCACCAGCCACTCTACAAGGAAAGGACAACGCTGAGAATTGCTGAAATTTTAGTGTCATTGAATGAAGCAAGGGGAGAGGATCCATTAGAGGAT
GATGGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGATGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGGTTGAAGAGGAA
GAAGAAAATGATGATGTCTCTCAATATGAAGTGAGACTACGAACTCCGGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTGTAAAAGAGCAAGAAGGAAAA
TTCGGCCCTGTGGATGTCCCTAGTGAGGCCATGGAGGAATCATCTTCCTCTTCTTCACAAGGTAAGACCCCTTATTTGTACAATTTGAATGTTTCTGACCCAAAC
TTTGTTGCTACTGCAGAGACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACGCAAAAGAAGAAAAAAGTGGCAAAAATTGCGTCAGACGCAATT
ACTAGGCCTAGGACCCGCGTCGCTGTAGCACGTTTGGCTGCCCAAAAAGAAGCCCAGGAGACCGATTCTACCCAACAAACACCATCAAGAGTAAAAAGGGTGAGA
TTAGAGGTGCGAAGGCCCAACTTCACAGCACGTGATATCCTCCTTGAGAGGGGCTTTGATGAAGCACAAGAGCCCGTGCCAGAATATGTTAGGAGGAAGCTTGTG
GAGAATGGTTGGGAGTCGTTGTTTGCCCCAACTACACGTGTATCGGAGGCCTTGGTGAAGGAGTTTTATGCTGCCCTCAATCCCAACCGAGGGGATGTAGTGAGA
GTATGGGGTAAAGTGGTAAAATTCTCACTTTCCATTATTAATACTCACTATGGTTTGTTGTTAATGAAATTTTTAGTGCATCCATCGGACGAGCAAGTGGAGGAG
GCACGTAGACTTATTTGTAGACCACATAAGACATGGACCGTCTCAACCACGAAGAAGCTTTCCTTAAAGCCCCTTGACATCAATGAGCAAGCGACAATATGGATG
TATATGGTGAAGAACCAGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAGAGCGATGGTGGTGTACATTCTCGTGAAGGGCATTGAGTTCAACTTT
GGGGAGCTCATAAGGAACGAGATTCGGAGTTGCTCTGAGAAAATGGTAGGTCCTCTTGTTTTTCCTGGACTAATAACTGAGTTATGCTTGCAGGCGGCAGTGGAA
GCTGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATCCCTAAGAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACTACTGCG
GATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTATAAGCTTCTTTTAGTTACTCAACGTGCCACATGTGCTATCTTCAAG
AAGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGGAGAATAAGAGAATGTTGTTGCCTCCACCAACTAAACCGGGTATGATTCCTCTTGAACCTCCT
TGGATTTCTCATGAAAAATTAGTTTTTGATTCTAGGGAACAAAGAAGAAAATATGAGGATGCTATAAGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACA
ATTTATGAAAAAATTAATATGGAATCTAATGATGCTACAGTTAATAAAGAAGGTTCTAGTGAAAAGAAATTAGGAGGAGTTAATAAAGTTTATCTTCGAAAAAAT
CAATCTCTAGAGGAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGAGTGGAGATTTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAAT
GAGAGGGTTTATGCAAAAATTGAGGAACTAAACATAAAATGGCAAGCATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATG
AGTATACGTCATAGGATGAATCTTTCTCAAGATAACCCCATTTCCAACTCTTTAGAACTGTCTATCCCTCTCCCTATTTCCACTACTATTGCTGTGCAGGTTGAA
GGTCAAGAACAGGTTAGTGGAGACTCAGAACACGACACGGAGCCCTTGGAGCACTCAGATTCGACCACGGTCGAAATTCAGAGCCAAATTGAGCCTAGCGCAATT
TTGGATAGACTCCACCAGCCACTCTACAAGGAAAGGACAACGCTGAGAATTGCTGAAATTTTAGTGTCATTGAATGAAGCAAGGGGAGAGGATCCATTAGAGGAT
GATGGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGATGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGGTTGAAGAGGAA
GAAGAAAATGATGATGTCTCTCAATATGAAGTGAGACTACGAACTCCGGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTGTAAAAGAGCAAGAAGGAAAA
TTCGGCCCTGTGGATGTCCCTAGTGAGGCCATGGAGGAATCATCTTCCTCTTCTTCACAAGGTAAGACCCCTTATTTGTACAATTTGAATGTTTCTGACCCAAAC
TTTGTTGCTACTGCAGAGACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACGCAAAAGAAGAAAAAAGTGGCAAAAATTGCGTCAGACGCAATT
ACTAGGCCTAGGACCCGCGTCGCTGTAGCACGTTTGGCTGCCCAAAAAGAAGCCCAGGAGACCGATTCTACCCAACAAACACCATCAAGAGTAAAAAGGGTGAGA
TTAGAGGTGCGAAGGCCCAACTTCACAGCACGTGATATCCTCCTTGAGAGGGGCTTTGATGAAGCACAAGAGCCCGTGCCAGAATATGTTAGGAGGAAGCTTGTG
GAGAATGGTTGGGAGTCGTTGTTTGCCCCAACTACACGTGTATCGGAGGCCTTGGTGAAGGAGTTTTATGCTGCCCTCAATCCCAACCGAGGGGATGTAGTGAGA
GTATGGGGTAAAGTGGTAAAATTCTCACTTTCCATTATTAATACTCACTATGGTTTGTTGTTAATGAAATTTTTAGTGCATCCATCGGACGAGCAAGTGGAGGAG
GCACGTAGACTTATTTGTAGACCACATAAGACATGGACCGTCTCAACCACGAAGAAGCTTTCCTTAAAGCCCCTTGACATCAATGAGCAAGCGACAATATGGATG
TATATGGTGAAGAACCAGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAGAGCGATGGTGGTGTACATTCTCGTGAAGGGCATTGAGTTCAACTTT
GGGGAGCTCATAAGGAACGAGATTCGGAGTTGCTCTGAGAAAATGGTAGGTCCTCTTGTTTTTCCTGGACTAATAACTGAGTTATGCTTGCAGGCGGCAGTGGAA
GCTGATGATGCCAATGTTGTGATGCCCAAGAAGCCGTTCACATCCCTAAGAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACTACTGCG
GATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTATAAGCTTCTTTTAGTTACTCAACGTGCCACATGTGCTATCTTCAAG
AAGATATAA
Protein sequenceShow/hide protein sequence
MEGSSSSKPHDKEKENKRMLLPPPTKPGMIPLEPPWISHEKLVFDSREQRRKYEDAIRMNPRRNLSIGGTIYEKINMESNDATVNKEGSSEKKLGGVNKVYLRKN
QSLEEKGAVLDEEIARLQERVEIFSKNNEIRDKENERVYAKIEELNIKWQAFMENSKKVSEEIQLELNSMSIRHRMNLSQDNPISNSLELSIPLPISTTIAVQVE
GQEQVSGDSEHDTEPLEHSDSTTVEIQSQIEPSAILDRLHQPLYKERTTLRIAEILVSLNEARGEDPLEDDGNSGAAQRQLNVDGDDEDLGELPQEVHGDEVEEE
EENDDVSQYEVRLRTPVHESQQVDEEPPVKEQEGKFGPVDVPSEAMEESSSSSSQGKTPYLYNLNVSDPNFVATAETSDEEVSLTKVVKKTQKKKKVAKIASDAI
TRPRTRVAVARLAAQKEAQETDSTQQTPSRVKRVRLEVRRPNFTARDILLERGFDEAQEPVPEYVRRKLVENGWESLFAPTTRVSEALVKEFYAALNPNRGDVVR
VWGKVVKFSLSIINTHYGLLLMKFLVHPSDEQVEEARRLICRPHKTWTVSTTKKLSLKPLDINEQATIWMYMVKNQLIPTSHDSSIKRNRAMVVYILVKGIEFNF
GELIRNEIRSCSEKMVGPLVFPGLITELCLQAAVEADDANVVMPKKPFTSLRRVRGYSIVREEDSPITTADPETRGVVTREQYDELRHKYKLLLVTQRATCAIFK
KI