; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g18480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g18480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr11:14104255..14106360
RNA-Seq ExpressionMoc11g18480
SyntenyMoc11g18480
Gene Ontology termsGO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.3e-14167.35Show/hide
Query:  EDPLEDDKNMHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEP
        ED  E  + +HGDEFEDEE+NDDISQYEV+V+TPVHESQQVDEEPP +EQEGTSGPVD+PSEAMEES SSSSQG                         P
Subjt:  EDPLEDDKNMHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEP

Query:  RTHTAVARLATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESL
        RT TAVARLA QKE E GPSKKAK A+VQR AEE LEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDE QEPVPEYVR+R+V+NG E+L
Subjt:  RTHTAVARLATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESL

Query:  FAPTTCVSEAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWT------------------------------------------
        FAP T VSEA+VKEFYTAINPNRGD VRVRGNEILVH  DEQVEEARRLICRPHKTWT                                          
Subjt:  FAPTTCVSEAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWT------------------------------------------

Query:  ------------------------------AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDELRHKYELFLVTQRATC
                                      AGVE  DANVVMPKK F SLR+VRGYSIVREEDSPIT AD ETRGVVTRE YDELRHKYEL LVTQRATC
Subjt:  ------------------------------AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDELRHKYELFLVTQRATC

Query:  AFLKKIYGDEAPSFPDELAADLPSFSRFPTDSTNDESSDDE
        AFLKKIYGDEAPSFPDELAADLPS SR PTDS +DESSDDE
Subjt:  AFLKKIYGDEAPSFPDELAADLPSFSRFPTDSTNDESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]1.1e-5358.11Show/hide
Query:  VHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEE--------------------------PRTHTAVAR
        +HESQQ DEE   QEQEG SG VD+P+EA+EES SSSS+GK+PSLSSLNVSDPNFV+   TS+E+                          P T   +A 
Subjt:  VHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEE--------------------------PRTHTAVAR

Query:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS
        LA QKE E GP KKAKR K  R +EE L+E N+EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GFDE QEPVP+Y++RRL++NG E+LFAPT  VS
Subjt:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS

Query:  EAMVKEFYTAINPNRGDAVRVR
        E +VKEFY  INPNRGDA+  R
Subjt:  EAMVKEFYTAINPNRGDAVRVR

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]8.7e-3250.24Show/hide
Query:  QVSGDSEHVMEPLEHSDLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKNMHGDEF
        +VSGDSEH MEPLEHSD ATV+I+CQIAPS IM ETPP +LQ                                E+LVALNEARGEDPL+DD N      
Subjt:  QVSGDSEHVMEPLEHSDLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKNMHGDEF

Query:  EDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVARLATQKEV
                              S Q DEEP AQEQEGTSGP+D+ SEAMEES SS SQ KT SLSSLNVSDPNFV+T E SDEE      V +   +K+V
Subjt:  EDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVARLATQKEV

Query:  -EVGPSK
         E+ P++
Subjt:  -EVGPSK

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]2.1e-3331.24Show/hide
Query:  SKHNEIRDKENEGVYAKIEELNIKWQEFMENSKKVSEEIQLELNRMSIRRRMNLSQDNPVTESLELSIPPLLSTIVAVHVEGQEQVSGDSEHVMEPLEHS
        +K  EI DK+NE + AKI ELN KWQ FMENS+++SEEIQ+ELN                                                        
Subjt:  SKHNEIRDKENEGVYAKIEELNIKWQEFMENSKKVSEEIQLELNRMSIRRRMNLSQDNPVTESLELSIPPLLSTIVAVHVEGQEQVSGDSEHVMEPLEHS

Query:  DLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKN----------------------
                                                        EQERTT KI +ILVALNEA GEDPLEDD N                      
Subjt:  DLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKN----------------------

Query:  -MHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVAR
         +HGDE E+EEENDDISQYEVR+   VHESQ+   E P +  EG S PVD+P+EA  +S SSSS+  +                                
Subjt:  -MHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVAR

Query:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS
                                    EE NEEEP STEQ  S+ KR                                                  V 
Subjt:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS

Query:  EAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWTAGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRET
        EA+VKEFY AI+PN+GDAVRVR                             G++ +D +VV PKK  TS+RRVRGY IVREEDS IT AD ET
Subjt:  EAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWTAGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRET

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]2.0e-2868.14Show/hide
Query:  AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDE---LRHKYELFLVTQRATCAFLKKIYGDEAPSFPDELAADLPSFSR
        AGVE D  +VVM KK  TS+RRVRGY IVREEDSPIT AD +TRGVVTRE YDE   LRH Y+L   TQ ATC FLKK+YGD APS PDELAADLPS SR
Subjt:  AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDE---LRHKYELFLVTQRATCAFLKKIYGDEAPSFPDELAADLPSFSR

Query:  FPTDSTNDESSDD
             T D+S  D
Subjt:  FPTDSTNDESSDD

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220076.5e-14267.35Show/hide
Query:  EDPLEDDKNMHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEP
        ED  E  + +HGDEFEDEE+NDDISQYEV+V+TPVHESQQVDEEPP +EQEGTSGPVD+PSEAMEES SSSSQG                         P
Subjt:  EDPLEDDKNMHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEP

Query:  RTHTAVARLATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESL
        RT TAVARLA QKE E GPSKKAK A+VQR AEE LEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDE QEPVPEYVR+R+V+NG E+L
Subjt:  RTHTAVARLATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESL

Query:  FAPTTCVSEAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWT------------------------------------------
        FAP T VSEA+VKEFYTAINPNRGD VRVRGNEILVH  DEQVEEARRLICRPHKTWT                                          
Subjt:  FAPTTCVSEAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWT------------------------------------------

Query:  ------------------------------AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDELRHKYELFLVTQRATC
                                      AGVE  DANVVMPKK F SLR+VRGYSIVREEDSPIT AD ETRGVVTRE YDELRHKYEL LVTQRATC
Subjt:  ------------------------------AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDELRHKYELFLVTQRATC

Query:  AFLKKIYGDEAPSFPDELAADLPSFSRFPTDSTNDESSDDE
        AFLKKIYGDEAPSFPDELAADLPS SR PTDS +DESSDDE
Subjt:  AFLKKIYGDEAPSFPDELAADLPSFSRFPTDSTNDESSDDE

A0A6J1DRR9 uncharacterized protein LOC1110237614.2e-3250.24Show/hide
Query:  QVSGDSEHVMEPLEHSDLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKNMHGDEF
        +VSGDSEH MEPLEHSD ATV+I+CQIAPS IM ETPP +LQ                                E+LVALNEARGEDPL+DD N      
Subjt:  QVSGDSEHVMEPLEHSDLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKNMHGDEF

Query:  EDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVARLATQKEV
                              S Q DEEP AQEQEGTSGP+D+ SEAMEES SS SQ KT SLSSLNVSDPNFV+T E SDEE      V +   +K+V
Subjt:  EDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVARLATQKEV

Query:  -EVGPSK
         E+ P++
Subjt:  -EVGPSK

A0A6J1DW11 uncharacterized protein LOC1110236205.1e-5458.11Show/hide
Query:  VHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEE--------------------------PRTHTAVAR
        +HESQQ DEE   QEQEG SG VD+P+EA+EES SSSS+GK+PSLSSLNVSDPNFV+   TS+E+                          P T   +A 
Subjt:  VHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEE--------------------------PRTHTAVAR

Query:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS
        LA QKE E GP KKAKR K  R +EE L+E N+EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GFDE QEPVP+Y++RRL++NG E+LFAPT  VS
Subjt:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS

Query:  EAMVKEFYTAINPNRGDAVRVR
        E +VKEFY  INPNRGDA+  R
Subjt:  EAMVKEFYTAINPNRGDAVRVR

A0A6J1DW79 uncharacterized protein LOC1110249641.0e-3331.24Show/hide
Query:  SKHNEIRDKENEGVYAKIEELNIKWQEFMENSKKVSEEIQLELNRMSIRRRMNLSQDNPVTESLELSIPPLLSTIVAVHVEGQEQVSGDSEHVMEPLEHS
        +K  EI DK+NE + AKI ELN KWQ FMENS+++SEEIQ+ELN                                                        
Subjt:  SKHNEIRDKENEGVYAKIEELNIKWQEFMENSKKVSEEIQLELNRMSIRRRMNLSQDNPVTESLELSIPPLLSTIVAVHVEGQEQVSGDSEHVMEPLEHS

Query:  DLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKN----------------------
                                                        EQERTT KI +ILVALNEA GEDPLEDD N                      
Subjt:  DLATVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKN----------------------

Query:  -MHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVAR
         +HGDE E+EEENDDISQYEVR+   VHESQ+   E P +  EG S PVD+P+EA  +S SSSS+  +                                
Subjt:  -MHGDEFEDEEENDDISQYEVRVQTPVHESQQVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVAR

Query:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS
                                    EE NEEEP STEQ  S+ KR                                                  V 
Subjt:  LATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVS

Query:  EAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWTAGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRET
        EA+VKEFY AI+PN+GDAVRVR                             G++ +D +VV PKK  TS+RRVRGY IVREEDS IT AD ET
Subjt:  EAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRLICRPHKTWTAGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRET

A0A6J1E204 uncharacterized protein LOC1110257029.7e-2968.14Show/hide
Query:  AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDE---LRHKYELFLVTQRATCAFLKKIYGDEAPSFPDELAADLPSFSR
        AGVE D  +VVM KK  TS+RRVRGY IVREEDSPIT AD +TRGVVTRE YDE   LRH Y+L   TQ ATC FLKK+YGD APS PDELAADLPS SR
Subjt:  AGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDE---LRHKYELFLVTQRATCAFLKKIYGDEAPSFPDELAADLPSFSR

Query:  FPTDSTNDESSDD
             T D+S  D
Subjt:  FPTDSTNDESSDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGTAAACATAACGAAATTAGGGATAAAGAGAATGAGGGGGTTTATGCGAAAATTGAGGAGTTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAG
AAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGAATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTACCGAGTCTTTAGAACTGTCTATC
CCTCCCCTTCTTTCCACTATTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGTTAGTGGAGACTCAGAACACGTCATGGAGCCCTTGGAGCATTCAGATTTGGCC
ACGGTCGAAATTCAATGCCAAATTGCGCCTAGCGCAATTATGGATGAGACTCCACCGACCAGTCTACAAGGTATTTTTTCTCCATCTTTTCCAGATCCTATCTTG
ACTAAAAAGCCCTTAGTTTTTGATGATTTAGAACAGGAAAGGACAACGCCGAAAATTGCCGAAATTTTGGTAGCTTTGAATGAAGCAAGGGGAGAAGATCCATTG
GAGGATGATAAAAACATGCATGGAGATGAGTTTGAGGACGAAGAAGAAAATGACGATATCTCTCAATATGAAGTGAGAGTACAAACTCCGGTGCACGAATCTCAG
CAAGTTGATGAGGAGCCCCCTGCACAAGAGCAAGAAGGAACATCCGGTCCTGTGGATATCCCTAGTGAGGCCATGGAGGAATCATTTTCTTCTTCTTCACAAGGT
AAGACCCCTTCTTTGTCAAGTTTGAATGTTTCTGACCCAAACTTTGTTTCTACTATAGAGACTTCAGATGAAGAGCCTAGGACCCACACTGCTGTAGCACGTTTG
GCTACTCAAAAAGAAGTCGAGGTTGGTCCATCTAAAAAAGCCAAGAGGGCTAAGGTGCAAAGAGGGGCAGAAGAGCTACTTGAGGAGGCCAATGAAGAGGAGCCC
GATTCTACCGAACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTCGATGAG
ACCCAAGAACCGGTGCCGGAATATGTTAGGAGAAGACTTGTGGATAATGGTCAGGAGTCGTTGTTTGCCCCAACTACGTGTGTATCCGAGGCCATGGTGAAAGAG
TTTTACACTGCCATCAACCCAAACCGAGGGGATGCAGTGAGAGTACGGGGTAATGAAATTTTGGTGCATCTACCGGACGAGCAAGTGGAGGAGGCTCGTAGGCTT
ATTTGTAGACCACATAAGACATGGACCGCGGGAGTGGAGGTCGATGATGCCAATGTTGTGATGCCCAAGAAGCTGTTCACATCCCTAAGAAGAGTTCGGGGGTAT
TCCATTGTTCGAGAGGAAGATTCTCCCATTACCACTGCAGATCGCGAGACCCGAGGGGTGGTGACTAGGGAGCATTATGATGAACTTAGGCACAAGTACGAGCTT
TTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCATTCCCCGATGAGCTTGCGGCCGATTTACCATCTTTTTCC
CGTTTTCCTACTGATTCCACCAACGATGAATCTTCCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAGTAAACATAACGAAATTAGGGATAAAGAGAATGAGGGGGTTTATGCGAAAATTGAGGAGTTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAG
AAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGAATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTACCGAGTCTTTAGAACTGTCTATC
CCTCCCCTTCTTTCCACTATTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGTTAGTGGAGACTCAGAACACGTCATGGAGCCCTTGGAGCATTCAGATTTGGCC
ACGGTCGAAATTCAATGCCAAATTGCGCCTAGCGCAATTATGGATGAGACTCCACCGACCAGTCTACAAGGTATTTTTTCTCCATCTTTTCCAGATCCTATCTTG
ACTAAAAAGCCCTTAGTTTTTGATGATTTAGAACAGGAAAGGACAACGCCGAAAATTGCCGAAATTTTGGTAGCTTTGAATGAAGCAAGGGGAGAAGATCCATTG
GAGGATGATAAAAACATGCATGGAGATGAGTTTGAGGACGAAGAAGAAAATGACGATATCTCTCAATATGAAGTGAGAGTACAAACTCCGGTGCACGAATCTCAG
CAAGTTGATGAGGAGCCCCCTGCACAAGAGCAAGAAGGAACATCCGGTCCTGTGGATATCCCTAGTGAGGCCATGGAGGAATCATTTTCTTCTTCTTCACAAGGT
AAGACCCCTTCTTTGTCAAGTTTGAATGTTTCTGACCCAAACTTTGTTTCTACTATAGAGACTTCAGATGAAGAGCCTAGGACCCACACTGCTGTAGCACGTTTG
GCTACTCAAAAAGAAGTCGAGGTTGGTCCATCTAAAAAAGCCAAGAGGGCTAAGGTGCAAAGAGGGGCAGAAGAGCTACTTGAGGAGGCCAATGAAGAGGAGCCC
GATTCTACCGAACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTCGATGAG
ACCCAAGAACCGGTGCCGGAATATGTTAGGAGAAGACTTGTGGATAATGGTCAGGAGTCGTTGTTTGCCCCAACTACGTGTGTATCCGAGGCCATGGTGAAAGAG
TTTTACACTGCCATCAACCCAAACCGAGGGGATGCAGTGAGAGTACGGGGTAATGAAATTTTGGTGCATCTACCGGACGAGCAAGTGGAGGAGGCTCGTAGGCTT
ATTTGTAGACCACATAAGACATGGACCGCGGGAGTGGAGGTCGATGATGCCAATGTTGTGATGCCCAAGAAGCTGTTCACATCCCTAAGAAGAGTTCGGGGGTAT
TCCATTGTTCGAGAGGAAGATTCTCCCATTACCACTGCAGATCGCGAGACCCGAGGGGTGGTGACTAGGGAGCATTATGATGAACTTAGGCACAAGTACGAGCTT
TTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCATTCCCCGATGAGCTTGCGGCCGATTTACCATCTTTTTCC
CGTTTTCCTACTGATTCCACCAACGATGAATCTTCCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MFSKHNEIRDKENEGVYAKIEELNIKWQEFMENSKKVSEEIQLELNRMSIRRRMNLSQDNPVTESLELSIPPLLSTIVAVHVEGQEQVSGDSEHVMEPLEHSDLA
TVEIQCQIAPSAIMDETPPTSLQGIFSPSFPDPILTKKPLVFDDLEQERTTPKIAEILVALNEARGEDPLEDDKNMHGDEFEDEEENDDISQYEVRVQTPVHESQ
QVDEEPPAQEQEGTSGPVDIPSEAMEESFSSSSQGKTPSLSSLNVSDPNFVSTIETSDEEPRTHTAVARLATQKEVEVGPSKKAKRAKVQRGAEELLEEANEEEP
DSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDETQEPVPEYVRRRLVDNGQESLFAPTTCVSEAMVKEFYTAINPNRGDAVRVRGNEILVHLPDEQVEEARRL
ICRPHKTWTAGVEVDDANVVMPKKLFTSLRRVRGYSIVREEDSPITTADRETRGVVTREHYDELRHKYELFLVTQRATCAFLKKIYGDEAPSFPDELAADLPSFS
RFPTDSTNDESSDDE