; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr4:9501049..9503458
RNA-Seq ExpressionMoc04g12460
SyntenyMoc04g12460
Gene Ontology termsGO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144472.1 LOW QUALITY PROTEIN: ATP-dependent helicase NAM7-like [Momordica charantia]1.1e-1984.62Show/hide
Query:  DPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLEDDGNSGAAQRQLNVDGDDENLGELP
        DPILTKKP+VFDDLE +RTT KI EILVALNE RGEDPLEDDGN+GAAQ QLNVDG+DE+LGELP
Subjt:  DPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLEDDGNSGAAQRQLNVDGDDENLGELP

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.9e-13463.39Show/hide
Query:  QRQLNVDGDDENLGELPQ------------------------------KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFI
        + QLNVD +DE+ GELPQ                              +QVDEEPP KE EG S PVDVPSEAMEES S SSQG  S   +        +
Subjt:  QRQLNVDGDDENLGELPQ------------------------------KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFI

Query:  ATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSE
        A  + ++   S  A +ARVQ+ AEEPL+EANEEEPDS EQTPSRVKRVRLEVRRPTFTTRDILLERGFDE Q+PV EYVR+++V+NGWE+LFAP TRVSE
Subjt:  ATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSE

Query:  ALVKEFYTAINPNRGDVVRVQGK-----------------------------------LPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMK
        ALVKEFYTAINPNRGD VRV+G                                     PLDINEQ  VWMYVVKNR+IPTS+DSSIKRNR M+VYIL+K
Subjt:  ALVKEFYTAINPNRGDVVRVQGK-----------------------------------LPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMK

Query:  GTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPETRGVVTREKYNELKHKYELLL
        G EFNF ELIRNEI+SCSEK+                AGVEA DANVVM KKPF SLR+VRGYSIVRE D PITA DPETRGVVTRE+Y+EL+HKYELLL
Subjt:  GTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPETRGVVTREKYNELKHKYELLL

Query:  VTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDDE
        VTQRATCAFLKKIY DEAPSFPDELAADLPS S LPTDS DDESSDDE
Subjt:  VTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]6.3e-4151.16Show/hide
Query:  KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTENSDEEVSLTAV----------------------------SARVQ
        +Q DEE   +E EG S  VDVP+EA+EES S SS+GK+ SLSSLNVSDPNF+A    S+E+V LT V                            S   Q
Subjt:  KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTENSDEEVSLTAV----------------------------SARVQ

Query:  KGA-----------------EEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSEALV
        K A                 EEPL E N+EE DSIEQTPS+ KRVR EV+R  FT R+IL+E+GFDE Q+PV +Y++R+L++NGWE+LFAPT RVSE LV
Subjt:  KGA-----------------EEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSEALV

Query:  KEFYTAINPNRGDVV
        KEFY  INPNRGD +
Subjt:  KEFYTAINPNRGDVV

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]3.2e-3230.25Show/hide
Query:  MEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLESPRISHEKLVFDTREQRRKYEEAIKMNPRRNLSIGGTNFEKINMDSHDARVNKEGSSEKKLRGVSKV
        MEGSS SKP DKE EKK+V+LPPP  P                                                  + H ARVN+ G SEKKL G SKV
Subjt:  MEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLESPRISHEKLVFDTREQRRKYEEAIKMNPRRNLSIGGTNFEKINMDSHDARVNKEGSSEKKLRGVSKV

Query:  YLRKNQSLKEKRMNLSQ------DNPVSESLELSIPPPLSTTVAVHV-----------EDPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLED
        YLRKNQS+ +K  +L +      +    E+ E  I    +  +   +           E+     + +  +  E +RTTSKI +ILVALNE  GEDPLED
Subjt:  YLRKNQSLKEKRMNLSQ------DNPVSESLELSIPPPLSTTVAVHV-----------EDPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLED

Query:  DGNSGAAQRQLNVDGDDENLGELPQK-----------------------------QVDE-EPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLN
        DGNS  AQ +LNVDG+DE+LG+LPQ+                             Q D+ E P + HEG SDPVDVP+EA  +S S SS+          
Subjt:  DGNSGAAQRQLNVDGDDENLGELPQK-----------------------------QVDE-EPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLN

Query:  VSDPNFIATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFA
                  +NS EEV                    NEEEP S EQ  S+ K                                               
Subjt:  VSDPNFIATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFA

Query:  PTTRVSEALVKEFYTAINPNRGDVVRVQGKLPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFP
           RV EALVKEFY AI+PN+GD VRV+                                                                        
Subjt:  PTTRVSEALVKEFYTAINPNRGDVVRVQGKLPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFP

Query:  GLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPET
                  G++A+D +VV  KK  TS+RRVRGY IVRE D  IT  DPET
Subjt:  GLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPET

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]3.1e-5667.58Show/hide
Query:  VWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVRE
        +W YVVKN +I TS+DSSI++ RVM+VYILMKG EFNF ELIRNEI  C+EKMVGP+IFP  I ELCL AGVEAD  +VVM+KK  TS+RRVRGY IVRE
Subjt:  VWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVRE

Query:  GDFPITAMDPETRGVVTREKYNE---LKHKYELLLVTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDD
         D PITA DP+TRGVVTRE+Y+E   L+H Y+LL  TQ ATC FLKK+Y D APS PDELAADLPS S      T D+S  D
Subjt:  GDFPITAMDPETRGVVTREKYNE---LKHKYELLLVTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDD

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.4e-13463.39Show/hide
Query:  QRQLNVDGDDENLGELPQ------------------------------KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFI
        + QLNVD +DE+ GELPQ                              +QVDEEPP KE EG S PVDVPSEAMEES S SSQG  S   +        +
Subjt:  QRQLNVDGDDENLGELPQ------------------------------KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFI

Query:  ATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSE
        A  + ++   S  A +ARVQ+ AEEPL+EANEEEPDS EQTPSRVKRVRLEVRRPTFTTRDILLERGFDE Q+PV EYVR+++V+NGWE+LFAP TRVSE
Subjt:  ATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSE

Query:  ALVKEFYTAINPNRGDVVRVQGK-----------------------------------LPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMK
        ALVKEFYTAINPNRGD VRV+G                                     PLDINEQ  VWMYVVKNR+IPTS+DSSIKRNR M+VYIL+K
Subjt:  ALVKEFYTAINPNRGDVVRVQGK-----------------------------------LPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMK

Query:  GTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPETRGVVTREKYNELKHKYELLL
        G EFNF ELIRNEI+SCSEK+                AGVEA DANVVM KKPF SLR+VRGYSIVRE D PITA DPETRGVVTRE+Y+EL+HKYELLL
Subjt:  GTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPETRGVVTREKYNELKHKYELLL

Query:  VTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDDE
        VTQRATCAFLKKIY DEAPSFPDELAADLPS S LPTDS DDESSDDE
Subjt:  VTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDDE

A0A6J1DRR9 uncharacterized protein LOC1110237616.7e-2058.97Show/hide
Query:  IAEILVALNETRGEDPLEDDGNSGAAQRQLNVDGDDENLGELPQKQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTE
        + E+LVALNE RGEDPL+DDGNSG                     Q DEEP A+E EG S P+DV SEAMEES S  SQ KTSSLSSLNVSDPNF+AT E
Subjt:  IAEILVALNETRGEDPLEDDGNSGAAQRQLNVDGDDENLGELPQKQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTE

Query:  NSDEEVSLTAVSARVQK
         SDEEV+L  V  + QK
Subjt:  NSDEEVSLTAVSARVQK

A0A6J1DW11 uncharacterized protein LOC1110236203.1e-4151.16Show/hide
Query:  KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTENSDEEVSLTAV----------------------------SARVQ
        +Q DEE   +E EG S  VDVP+EA+EES S SS+GK+ SLSSLNVSDPNF+A    S+E+V LT V                            S   Q
Subjt:  KQVDEEPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTENSDEEVSLTAV----------------------------SARVQ

Query:  KGA-----------------EEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSEALV
        K A                 EEPL E N+EE DSIEQTPS+ KRVR EV+R  FT R+IL+E+GFDE Q+PV +Y++R+L++NGWE+LFAPT RVSE LV
Subjt:  KGA-----------------EEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFAPTTRVSEALV

Query:  KEFYTAINPNRGDVV
        KEFY  INPNRGD +
Subjt:  KEFYTAINPNRGDVV

A0A6J1DW79 uncharacterized protein LOC1110249641.5e-3230.25Show/hide
Query:  MEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLESPRISHEKLVFDTREQRRKYEEAIKMNPRRNLSIGGTNFEKINMDSHDARVNKEGSSEKKLRGVSKV
        MEGSS SKP DKE EKK+V+LPPP  P                                                  + H ARVN+ G SEKKL G SKV
Subjt:  MEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLESPRISHEKLVFDTREQRRKYEEAIKMNPRRNLSIGGTNFEKINMDSHDARVNKEGSSEKKLRGVSKV

Query:  YLRKNQSLKEKRMNLSQ------DNPVSESLELSIPPPLSTTVAVHV-----------EDPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLED
        YLRKNQS+ +K  +L +      +    E+ E  I    +  +   +           E+     + +  +  E +RTTSKI +ILVALNE  GEDPLED
Subjt:  YLRKNQSLKEKRMNLSQ------DNPVSESLELSIPPPLSTTVAVHV-----------EDPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLED

Query:  DGNSGAAQRQLNVDGDDENLGELPQK-----------------------------QVDE-EPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLN
        DGNS  AQ +LNVDG+DE+LG+LPQ+                             Q D+ E P + HEG SDPVDVP+EA  +S S SS+          
Subjt:  DGNSGAAQRQLNVDGDDENLGELPQK-----------------------------QVDE-EPPAKEHEGISDPVDVPSEAMEESPSPSSQGKTSSLSSLN

Query:  VSDPNFIATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFA
                  +NS EEV                    NEEEP S EQ  S+ K                                               
Subjt:  VSDPNFIATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEVQKPVLEYVRRKLVDNGWESLFA

Query:  PTTRVSEALVKEFYTAINPNRGDVVRVQGKLPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFP
           RV EALVKEFY AI+PN+GD VRV+                                                                        
Subjt:  PTTRVSEALVKEFYTAINPNRGDVVRVQGKLPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFP

Query:  GLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPET
                  G++A+D +VV  KK  TS+RRVRGY IVRE D  IT  DPET
Subjt:  GLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPET

A0A6J1E204 uncharacterized protein LOC1110257021.5e-5667.58Show/hide
Query:  VWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVRE
        +W YVVKN +I TS+DSSI++ RVM+VYILMKG EFNF ELIRNEI  C+EKMVGP+IFP  I ELCL AGVEAD  +VVM+KK  TS+RRVRGY IVRE
Subjt:  VWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIRSCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVRE

Query:  GDFPITAMDPETRGVVTREKYNE---LKHKYELLLVTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDD
         D PITA DP+TRGVVTRE+Y+E   L+H Y+LL  TQ ATC FLKK+Y D APS PDELAADLPS S      T D+S  D
Subjt:  GDFPITAMDPETRGVVTREKYNE---LKHKYELLLVTQRATCAFLKKIYDDEAPSFPDELAADLPSCSHLPTDSTDDESSDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCATCTTCCTCCAAGCCGCACGACAAAGAGAAGGAAAAGAAGAGAGTGTTATTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAATCTCCTAGGAT
TTCTCATGAAAAGTTAGTTTTTGATACTAGGGAACAAAGAAGGAAATATGAGGAAGCTATAAAAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTTTGAAA
AAATCAATATGGATTCTCATGATGCTCGAGTTAATAAAGAAGGTTCTAGTGAGAAAAAATTAAGAGGTGTTAGTAAAGTCTATCTTCGAAAAAATCAATCTTTAAAGGAA
AAAAGGATGAATCTTTCTCAAGATAACCCTGTTTCCGAGTCTTTAGAACTGTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGATCCTATCTTGAC
TAAAAAGCCCCTAGTTTTTGATGATTTAGAACCGAAAAGGACAACGTCGAAAATTGCTGAAATTTTGGTGGCGTTGAATGAAACAAGGGGAGAGGATCCATTGGAGGATG
ATGGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGATGATGAAAATCTTGGAGAATTACCCCAAAAACAAGTTGATGAGGAGCCCCCTGCAAAAGAGCAC
GAAGGAATATCTGATCCTGTGGATGTCCCTAGTGAGGCCATGGAGGAATCACCTTCTCCTTCTTCACAAGGTAAGACCTCTTCTTTGTCAAGTTTGAATGTTTCTGACCC
AAACTTCATTGCTACTACAGAGAATTCAGATGAGGAGGTGAGCTTGACTGCAGTGAGCGCTAGGGTGCAAAAAGGGGCTGAAGAACCACTTAAGGAGGCCAACGAAGAGG
AGCCCGATTCTATCGAACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAA
GTCCAAAAGCCGGTGTTGGAATATGTTAGGAGAAAGCTTGTGGATAATGGTTGGGAGTCGTTGTTTGCCCCAACTACACGTGTATCAGAAGCCTTGGTGAAAGAGTTTTA
CACTGCCATCAACCCAAACAGAGGGGATGTAGTGAGAGTACAGGGTAAATTGCCCCTTGACATTAATGAGCAAACGATGGTTTGGATGTATGTGGTGAAGAACCGGATGA
TCCCCACTTCTCACGATTCCTCCATTAAGCGCAATAGGGTGATGATGGTGTACATTCTCATGAAGGGCACTGAGTTCAACTTTAGGGAACTCATAAGAAACGAGATACGA
AGTTGCTCTGAGAAAATGGTAGGTCCTATTATTTTTCCTGGACTAATAACTGAGTTATGCTTGCCGGCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGGCCAAGAA
GCCGTTCACATCCCTAAGAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGGAGACTTTCCCATTACCGCCATGGATCCCGAGACCAGAGGGGTGGTGACTAGGGAGAAAT
ATAATGAGCTTAAGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGATGATGAAGCACCTTCTTTCCCCGATGAGCTT
GCGGCCGACTTACCATCTTGTTCCCATCTTCCTACCGATTCCACCGACGATGAGTCTTCCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCATCTTCCTCCAAGCCGCACGACAAAGAGAAGGAAAAGAAGAGAGTGTTATTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAATCTCCTAGGAT
TTCTCATGAAAAGTTAGTTTTTGATACTAGGGAACAAAGAAGGAAATATGAGGAAGCTATAAAAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTTTGAAA
AAATCAATATGGATTCTCATGATGCTCGAGTTAATAAAGAAGGTTCTAGTGAGAAAAAATTAAGAGGTGTTAGTAAAGTCTATCTTCGAAAAAATCAATCTTTAAAGGAA
AAAAGGATGAATCTTTCTCAAGATAACCCTGTTTCCGAGTCTTTAGAACTGTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGATCCTATCTTGAC
TAAAAAGCCCCTAGTTTTTGATGATTTAGAACCGAAAAGGACAACGTCGAAAATTGCTGAAATTTTGGTGGCGTTGAATGAAACAAGGGGAGAGGATCCATTGGAGGATG
ATGGAAACAGTGGGGCAGCACAAAGACAATTGAATGTTGATGGAGATGATGAAAATCTTGGAGAATTACCCCAAAAACAAGTTGATGAGGAGCCCCCTGCAAAAGAGCAC
GAAGGAATATCTGATCCTGTGGATGTCCCTAGTGAGGCCATGGAGGAATCACCTTCTCCTTCTTCACAAGGTAAGACCTCTTCTTTGTCAAGTTTGAATGTTTCTGACCC
AAACTTCATTGCTACTACAGAGAATTCAGATGAGGAGGTGAGCTTGACTGCAGTGAGCGCTAGGGTGCAAAAAGGGGCTGAAGAACCACTTAAGGAGGCCAACGAAGAGG
AGCCCGATTCTATCGAACAAACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGGAGGCCCACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAA
GTCCAAAAGCCGGTGTTGGAATATGTTAGGAGAAAGCTTGTGGATAATGGTTGGGAGTCGTTGTTTGCCCCAACTACACGTGTATCAGAAGCCTTGGTGAAAGAGTTTTA
CACTGCCATCAACCCAAACAGAGGGGATGTAGTGAGAGTACAGGGTAAATTGCCCCTTGACATTAATGAGCAAACGATGGTTTGGATGTATGTGGTGAAGAACCGGATGA
TCCCCACTTCTCACGATTCCTCCATTAAGCGCAATAGGGTGATGATGGTGTACATTCTCATGAAGGGCACTGAGTTCAACTTTAGGGAACTCATAAGAAACGAGATACGA
AGTTGCTCTGAGAAAATGGTAGGTCCTATTATTTTTCCTGGACTAATAACTGAGTTATGCTTGCCGGCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGGCCAAGAA
GCCGTTCACATCCCTAAGAAGAGTTCGGGGGTATTCCATTGTTCGAGAGGGAGACTTTCCCATTACCGCCATGGATCCCGAGACCAGAGGGGTGGTGACTAGGGAGAAAT
ATAATGAGCTTAAGCACAAGTACGAGCTTCTTTTGGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGATGATGAAGCACCTTCTTTCCCCGATGAGCTT
GCGGCCGACTTACCATCTTGTTCCCATCTTCCTACCGATTCCACCGACGATGAGTCTTCCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLESPRISHEKLVFDTREQRRKYEEAIKMNPRRNLSIGGTNFEKINMDSHDARVNKEGSSEKKLRGVSKVYLRKNQSLKE
KRMNLSQDNPVSESLELSIPPPLSTTVAVHVEDPILTKKPLVFDDLEPKRTTSKIAEILVALNETRGEDPLEDDGNSGAAQRQLNVDGDDENLGELPQKQVDEEPPAKEH
EGISDPVDVPSEAMEESPSPSSQGKTSSLSSLNVSDPNFIATTENSDEEVSLTAVSARVQKGAEEPLKEANEEEPDSIEQTPSRVKRVRLEVRRPTFTTRDILLERGFDE
VQKPVLEYVRRKLVDNGWESLFAPTTRVSEALVKEFYTAINPNRGDVVRVQGKLPLDINEQTMVWMYVVKNRMIPTSHDSSIKRNRVMMVYILMKGTEFNFRELIRNEIR
SCSEKMVGPIIFPGLITELCLPAGVEADDANVVMAKKPFTSLRRVRGYSIVREGDFPITAMDPETRGVVTREKYNELKHKYELLLVTQRATCAFLKKIYDDEAPSFPDEL
AADLPSCSHLPTDSTDDESSDDE