; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002133 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002133
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF3685 domain-containing protein
Genome locationchr4:39690400..39703954
RNA-Seq ExpressionLag0002133
SyntenyLag0002133
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022552 - Uncharacterised protein family Ycf55


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575973.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.1e-29574.97Show/hide
Query:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFRGISHFRPNGSPRFVARCSSGDGDSRTVLDAFFLGKALAEALTERIESTIGEVLG
        MSSGV  SVSPS FS++ TK KITHCSS S + F SSST SNL+      FRPNGS RF+A CSSGDGD+RTVLDAFFLGKA AE LTER+EST+GEVL 
Subjt:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFRGISHFRPNGSPRFVARCSSGDGDSRTVLDAFFLGKALAEALTERIESTIGEVLG

Query:  GIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVEESSDFLQFEVTGSA
         IGR+QAERQ+QI+DFQEEVI+RAKKAK+KA RDAKE QGPISSSIIS TIEVTSSPT S++ QQ  +P+  SE VVNQDPP                  
Subjt:  GIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVEESSDFLQFEVTGSA

Query:  SVFYLRRLNSSTAGVKRYQLFESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSN
                              SSSL P        MAE+V V P I+LQ+ RTPF+ KS APC+FSFKREQR+SSC SYKF RISTWRRR LSGF GSN
Subjt:  SVFYLRRLNSSTAGVKRYQLFESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSN

Query:  LIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKI
        LIV+PAPRK FR+HAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PKSNQPGNIIS T+SASDNPTFSGS MK DDQIN K+ALDVVKGKI
Subjt:  LIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKI

Query:  LDFLDAFERRRSMEN-------------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSS
        LDFLDAFERR+S+EN                                     EVNN+S ATIQNMDDLS IFSKFIQKSS PVCMSWLK+ELSM+NNDSS
Subjt:  LDFLDAFERRRSMEN-------------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSS

Query:  KVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTL
        K FLS MSEKLKAEDNIL GIKKSGKEELYAELMHFLSFG RRDYCYYD+SL+VKHGISILED LITFADGIASMYLEFISVDS+FFDEVDNIGLALCTL
Subjt:  KVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTL

Query:  STRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYF
        STRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIELP SRQ NIDNWWMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYF
Subjt:  STRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYF

Query:  SLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        SLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  SLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

KAG7014497.1 putative protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.1e-28072.53Show/hide
Query:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFRGISHFRPNGSPRFVARCSSGDGDSRTVLDAFFLGKALAEALTERIESTIGEVLG
        MSSGV  SVSPS FS++ TK KITHCSS S + F SSST SNL+      FRPNGS RF+A CSSGDGD+RTVLDAFFLGKA AE LTER+EST+GEVL 
Subjt:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFRGISHFRPNGSPRFVARCSSGDGDSRTVLDAFFLGKALAEALTERIESTIGEVLG

Query:  GIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVEESSDFLQ-FEVTGS
         IGR+QAERQ+QI+DFQEEVI+RAKKAK+KA RDAKE QGPISSSIIS TIEVTSSPT S++ QQ  +P+  SE VV QDPP   EE ++ L+ F+    
Subjt:  GIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVEESSDFLQ-FEVTGS

Query:  ASVFYLRRLNSSTAGVKRYQLFESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGS
          + +  +L +S           S+S++         M +      S+  +  RT                + ++SSC SYKF RISTWRRR LSGF GS
Subjt:  ASVFYLRRLNSSTAGVKRYQLFESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGS

Query:  NLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGK
        NLIV+PAPRK FR+HAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PKSNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGK
Subjt:  NLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGK

Query:  ILDFLDAFERRRSMENE-----------------------------------VNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSK
        ILDFLDAFERR+S+ENE                                   VNN+S ATIQNMDDLS IFSKFIQKSS PVCMSWLK+ELSM+NNDSSK
Subjt:  ILDFLDAFERRRSMENE-----------------------------------VNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSK

Query:  VFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLS
         FLS MSEKLKAEDNIL GIKKSGKEELYAELMHFLSFG RRDYCYYD+SL+VKHGISILED LITFADGIASMYLEFISVDS+FFDEVDNIGLALCTLS
Subjt:  VFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLS

Query:  TRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFS
        TRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIELP SRQ NIDNWWMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFS
Subjt:  TRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFS

Query:  LLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        LLIELSDIT P+IRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  LLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

XP_022151384.1 uncharacterized protein LOC111019333 isoform X1 [Momordica charantia]7.5e-23381.36Show/hide
Query:  FESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSL
        FE  S  P+     L MAEHV VTP I+LQ+ RTPFK+KS  PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF GS LIVNPAPRKTFR+HAYLRSL
Subjt:  FESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSL

Query:  VNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-----
        VNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PKSNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMEN     
Subjt:  VNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-----

Query:  --------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQG
                                        EVNNIS+ TIQNMDDLSKIFSKFIQKSS PVC SWLK ELSME NDSSK FLS MSEKLKAEDNILQG
Subjt:  --------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQG

Query:  IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQ
        IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILED LITFADGIASMYLEFISVDS+F DEVDN+GLALC LSTRALQRLRNEV MNQWLYQ
Subjt:  IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQ

Query:  NVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK
        NVEAIVSMYEDRFDLCTLGSQ IELP SRQ  IDNWWM+  LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK
Subjt:  NVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK

Query:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

XP_022151393.1 uncharacterized protein LOC111019333 isoform X2 [Momordica charantia]7.5e-23382.79Show/hide
Query:  LPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFV
        L MAEHV VTP I+LQ+ RTPFK+KS  PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF GS LIVNPAPRKTFR+HAYLRSLVNVDGT ASEVL V
Subjt:  LPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFV

Query:  DQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-------------------
        DQLLLM SIFLTYMAGVIP+PKSNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMEN                   
Subjt:  DQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-------------------

Query:  ------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELM
                          EVNNIS+ TIQNMDDLSKIFSKFIQKSS PVC SWLK ELSME NDSSK FLS MSEKLKAEDNILQGIKKSGKEELYAELM
Subjt:  ------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELM

Query:  HFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFD
        HFLSFGARRDYCYYDHSLYVKHGISILED LITFADGIASMYLEFISVDS+F DEVDN+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFD
Subjt:  HFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFD

Query:  LCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG
        LCTLGSQ IELP SRQ  IDNWWM+  LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG
Subjt:  LCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG

Query:  RSLGLIYTGIRQSLRWK
        RSLGLIYTGIRQSLRWK
Subjt:  RSLGLIYTGIRQSLRWK

XP_023549012.1 uncharacterized protein LOC111807500 isoform X2 [Cucurbita pepo subsp. pepo]5.7e-23382.91Show/hide
Query:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ
        MAE+V V P I+LQ+ RTPF+ KS APC+FSFKREQR+SSC SYKF RISTWRRR LSGF GSNLIV PAPRK FR+HAYLRSLVNVDGTTASEVLFVDQ
Subjt:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ

Query:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------
        LLLMTSIFLTYMAGVIP+PKSNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFERR+S+EN                     
Subjt:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------

Query:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF
                        EVNN+S ATIQNMDDLS IFSKFIQKSSQPVCMSWLK+ELSM+NNDSSK FLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF

Query:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDYCYYD+SL+VKHGISILED LITFADGIASMYLEFISVDS+FFDEVDNIGLALCTLSTRALQRLRNEVAMNQW YQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQ NIDNWWMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDITMP+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

TrEMBL top hitse value%identityAlignment
A0A6J1DC25 uncharacterized protein LOC111019333 isoform X23.6e-23382.79Show/hide
Query:  LPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFV
        L MAEHV VTP I+LQ+ RTPFK+KS  PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF GS LIVNPAPRKTFR+HAYLRSLVNVDGT ASEVL V
Subjt:  LPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFV

Query:  DQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-------------------
        DQLLLM SIFLTYMAGVIP+PKSNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMEN                   
Subjt:  DQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-------------------

Query:  ------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELM
                          EVNNIS+ TIQNMDDLSKIFSKFIQKSS PVC SWLK ELSME NDSSK FLS MSEKLKAEDNILQGIKKSGKEELYAELM
Subjt:  ------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELM

Query:  HFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFD
        HFLSFGARRDYCYYDHSLYVKHGISILED LITFADGIASMYLEFISVDS+F DEVDN+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFD
Subjt:  HFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFD

Query:  LCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG
        LCTLGSQ IELP SRQ  IDNWWM+  LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG
Subjt:  LCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIG

Query:  RSLGLIYTGIRQSLRWK
        RSLGLIYTGIRQSLRWK
Subjt:  RSLGLIYTGIRQSLRWK

A0A6J1DCX7 uncharacterized protein LOC111019333 isoform X13.6e-23381.36Show/hide
Query:  FESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSL
        FE  S  P+     L MAEHV VTP I+LQ+ RTPFK+KS  PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF GS LIVNPAPRKTFR+HAYLRSL
Subjt:  FESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSL

Query:  VNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-----
        VNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PKSNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMEN     
Subjt:  VNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN-----

Query:  --------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQG
                                        EVNNIS+ TIQNMDDLSKIFSKFIQKSS PVC SWLK ELSME NDSSK FLS MSEKLKAEDNILQG
Subjt:  --------------------------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQG

Query:  IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQ
        IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILED LITFADGIASMYLEFISVDS+F DEVDN+GLALC LSTRALQRLRNEV MNQWLYQ
Subjt:  IKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQ

Query:  NVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK
        NVEAIVSMYEDRFDLCTLGSQ IELP SRQ  IDNWWM+  LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK
Subjt:  NVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDK

Query:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  ISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

A0A6J1DDE2 uncharacterized protein LOC111019333 isoform X36.2e-23382.91Show/hide
Query:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ
        MAEHV VTP I+LQ+ RTPFK+KS  PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF GS LIVNPAPRKTFR+HAYLRSLVNVDGT ASEVL VDQ
Subjt:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ

Query:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------
        LLLM SIFLTYMAGVIP+PKSNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMEN                     
Subjt:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------

Query:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF
                        EVNNIS+ TIQNMDDLSKIFSKFIQKSS PVC SWLK ELSME NDSSK FLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Subjt:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF

Query:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFGARRDYCYYDHSLYVKHGISILED LITFADGIASMYLEFISVDS+F DEVDN+GLALC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLC
Subjt:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
        TLGSQ IELP SRQ  IDNWWM+  LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

A0A6J1GQ87 uncharacterized protein LOC111456110 isoform X26.2e-23382.72Show/hide
Query:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ
        MAE+V V P I+LQ+ RTPF+ KS APC+FSFKREQR+SSC SYKF RISTWRRR LSGF GSNLIV+PAPRK FR+HAYLRSLVNVDGTTASEVLFVDQ
Subjt:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ

Query:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------
        LLLMTSIFLTYMAGVIP+PKSNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFERR+S+EN                     
Subjt:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------

Query:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF
                        EVNN+S ATIQNMDDLS IFSKFIQKSS PVCMSWLK+ELSM+NNDSSK FLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF

Query:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDYCYYD+SL+VKHGISILED LITFADGIASMYLEFISVDS+FFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQ NIDNWWMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

A0A6J1JVH5 uncharacterized protein LOC111488215 isoform X23.6e-23382.91Show/hide
Query:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ
        MAE+V V P I+LQ+ RTPF+ KS APC+FSFKRE+RKSSC SYKF RISTWRRR LSGF GSNLIV+PAPRK FR+HA LRSLVNVDGTTASEVLFVDQ
Subjt:  MAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASEVLFVDQ

Query:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------
        LLLMTSIFLTYMAGVIP+PKSNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFE R+S+EN                     
Subjt:  LLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMEN---------------------

Query:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF
                        EVNN+S ATIQNMDDLS IFSKFIQKSSQPVCMSWLK+ELSM+NNDSSK FLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Subjt:  ----------------EVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF

Query:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC
        LSFG RRDYCYYD+SL+VKHGISILED LITFADGIASMYLEFISVDS+FFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLC
Subjt:  LSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLC

Query:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
        TL SQQIELP SRQ NIDNWWMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS
Subjt:  TLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRS

Query:  LGLIYTGIRQSLRWK
        LGLIYTGIRQSLRWK
Subjt:  LGLIYTGIRQSLRWK

SwissProt top hitse value%identityAlignment
P73628 Thylakoid protein sll17692.1e-0433.75Show/hide
Query:  VLDAFFLGKALAEALTERIESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIE
        VL AFFLG+A AE L+E++E  +   L  +G+  AE+++ +  F  EV  RA     +       V GP+S+  +  T++
Subjt:  VLDAFFLGKALAEALTERIESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIE

Q8LDV3 Uncharacterized protein At4g13200, chloroplastic2.1e-2042.36Show/hide
Query:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFR----GISHFRPNGSPRFVARCS-----SGDGDSRTVLDAFFLGKALAEALTERI
        MSS   P  SPS FSL N+  +    +SP    F   ++ SN  F      +   R + S R  + CS     SG+ ++++VLDAFFLGKALAE + ERI
Subjt:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFR----GISHFRPNGSPRFVARCS-----SGDGDSRTVLDAFFLGKALAEALTERI

Query:  ESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPT-----IEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVE
        EST+GEVL  IG+ QAE+QKQ+ + QEEV+ERAKKAK++AAR+  E QG ++S  ++ T     + V S  +TST              V ++   +DVE
Subjt:  ESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPT-----IEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVE

Query:  ESS
        ESS
Subjt:  ESS

Arabidopsis top hitse value%identityAlignment
AT4G13200.1 unknown protein1.5e-2142.36Show/hide
Query:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFR----GISHFRPNGSPRFVARCS-----SGDGDSRTVLDAFFLGKALAEALTERI
        MSS   P  SPS FSL N+  +    +SP    F   ++ SN  F      +   R + S R  + CS     SG+ ++++VLDAFFLGKALAE + ERI
Subjt:  MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFR----GISHFRPNGSPRFVARCS-----SGDGDSRTVLDAFFLGKALAEALTERI

Query:  ESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPT-----IEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVE
        EST+GEVL  IG+ QAE+QKQ+ + QEEV+ERAKKAK++AAR+  E QG ++S  ++ T     + V S  +TST              V ++   +DVE
Subjt:  ESTIGEVLGGIGRVQAERQKQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPT-----IEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVE

Query:  ESS
        ESS
Subjt:  ESS

AT5G48830.1 unknown protein7.8e-9543.38Show/hide
Query:  MAEHVTVTP--SIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASE-VLF
        M  HV V+P  S++L++        S    N   K  QR     S K  + +      L   C            T +   +  SL + DG   S  V  
Subjt:  MAEHVTVTP--SIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASE-VLF

Query:  VDQLLLMTSIFLTYMAGVIPLPK-SNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERR----------------------
         DQ+LL  SIFLTYMAGVIP+ K S       +  +   +  T   SG + D + + K   DVVK K+LD LDA +R                       
Subjt:  VDQLLLMTSIFLTYMAGVIPLPK-SNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERR----------------------

Query:  ---------------RSMENEVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAE
                       + +E E N IS  TI N D+    F++ ++++ Q  C +WLK EL +EN DS       +   L  +D I   I+KSGKE+L+AE
Subjt:  ---------------RSMENEVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAE

Query:  LMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDR
         ++F  FG+      YD S +  HG++ILEDF+IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+VSMYEDR
Subjt:  LMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDR

Query:  FDLCTLGSQQI-ELPSSRQVNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLV
        FDL  L +Q I  L  S      +WW K  L +T+   SS L Y +I  FS+PVKRTKEL+AL GWRYYFSL +ELSDI MP+IRVV+DK+SS ISFFLV
Subjt:  FDLCTLGSQQI-ELPSSRQVNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLV

Query:  CLIGRSLGLIYTGIRQSLRWK
         LIGRS+GLI+TGIRQSLRWK
Subjt:  CLIGRSLGLIYTGIRQSLRWK

AT5G48830.2 unknown protein6.8e-9142.61Show/hide
Query:  MAEHVTVTP--SIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASE-VLF
        M  HV V+P  S++L++        S    N   K  QR     S K  + +      L   C            T +   +  SL + DG   S  V  
Subjt:  MAEHVTVTP--SIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASE-VLF

Query:  VDQLLLMTSIFLTYMAGVIPLPK-SNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERR----------------------
         DQ+LL  SIFLTYMAGVIP+ K S       +  +   +  T   SG + D + + K   DVVK K+LD LDA +R                       
Subjt:  VDQLLLMTSIFLTYMAGVIPLPK-SNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERR----------------------

Query:  ---------------RSMENEVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSS-------KVFLSSMSEKLKAEDNILQGIKKSG
                       + +E E N IS  TI N D+    F++ ++++ Q  C +WLK EL +EN DS        +     +   L  +D I   I+KSG
Subjt:  ---------------RSMENEVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSS-------KVFLSSMSEKLKAEDNILQGIKKSG

Query:  KEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAI
        KE+L+AE ++F  FG+      YD S +  HG++ILEDF+IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+
Subjt:  KEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAI

Query:  VSMYEDRFDLCTLGSQQI-ELPSSRQVNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISS
        VSMYEDRFDL  L +Q I  L  S      +WW K  L +T+   SS L Y +I  FS+PVKRTKEL+AL GW YYFSL +ELSDI MP+IRVV+DK+SS
Subjt:  VSMYEDRFDLCTLGSQQI-ELPSSRQVNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISS

Query:  GISFFLVCLIGRSLGLIYTGIRQSLRWK
         ISFFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  GISFFLVCLIGRSLGLIYTGIRQSLRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGTGGGGTTACACCTTCGGTTTCCCCATCCCCTTTCTCTCTTCTGAACACTAAACCAAAAATTACTCATTGTTCATCTCCATCTCAACTTCTATTTTGCTCCTC
TTCCACTCCCTCAAATCTCAGGTTTCGAGGAATTTCACATTTTCGACCAAATGGGTCTCCCAGATTCGTAGCTCGTTGCAGCTCCGGTGATGGTGACAGCAGGACTGTTC
TAGATGCCTTTTTCTTGGGAAAAGCTTTAGCAGAAGCCTTAACTGAGCGTATCGAGTCAACAATTGGAGAGGTCTTAGGCGGGATTGGTAGGGTGCAAGCTGAACGACAA
AAACAAATTCTTGATTTCCAGGAGGAGGTGATAGAAAGAGCCAAAAAAGCCAAGGACAAAGCAGCACGTGATGCCAAAGAAGTACAAGGACCCATCTCCTCTTCAATAAT
ATCGCCTACGATCGAAGTTACTTCATCTCCAACTACTTCGACCGATCGACAACAACCCCTGAACCCCGACCCAGATTCCGAAATTGTTGTGAACCAGGATCCTCCTCTTG
ACGTTGAAGAATCGAGTGATTTTCTTCAGTTCGAGGTCACTGGCTCCGCCTCCGTTTTCTATTTACGGAGATTGAACAGTTCAACGGCTGGTGTAAAGAGGTACCAGCTG
TTTGAAAGCTCTTCCTTGTTCCCGAGCTCTGCGCTAATTCTCCTTCCAATGGCAGAGCATGTGACTGTCACACCATCTATCGAGCTGCAAGTTTGGAGAACTCCCTTCAA
AGTCAAAAGCTTTGCACCATGCAATTTCAGTTTTAAAAGAGAACAGAGAAAATCCTCTTGTGAGAGCTATAAGTTCATAAGGATCTCAACTTGGAGAAGGCGTGAGCTTA
GTGGTTTTTGTGGCTCAAACTTAATTGTAAATCCTGCTCCCAGGAAGACCTTCAGAGATCATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAG
GTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGAGTAATACCTTTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAAC
CGATTCAGCCTCAGATAACCCAACCTTTTCTGGTAGTGGCATGAAGGTTGATGATCAAATTAATCCGAAGCATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTC
TAGATGCTTTTGAACGTAGGAGAAGTATGGAAAATGAGGTCAACAATATTTCTGATGCTACTATTCAGAACATGGACGATTTGTCTAAAATATTTTCTAAATTTATCCAA
AAATCCTCTCAACCTGTTTGCATGTCTTGGCTGAAAAGCGAACTGTCTATGGAAAATAATGATTCTAGTAAGGTGTTTCTTTCTTCGATGTCTGAGAAGCTTAAAGCAGA
AGACAATATTTTACAAGGAATTAAAAAGTCTGGCAAGGAAGAGCTCTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTGCTATTATGACCATA
GCCTGTACGTCAAGCATGGGATTTCAATATTAGAAGATTTTCTAATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCACTTTCTTC
GATGAAGTGGATAACATTGGCCTGGCATTGTGTACCCTATCAACACGGGCACTTCAAAGGTTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGC
AATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTGGTAGTCAACAGATTGAGCTACCAAGCAGTAGACAGGTCAATATCGATAATTGGTGGATGAAACATA
TCCTCAGAAGAACTGAAACTTTGTCTTCTCAGTTACATTATGTTGTGATACGCTCCTTCTCCATGCCTGTAAAGAGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGA
TATTACTTCAGCCTGTTGATTGAATTATCCGACATTACGATGCCATTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCATTCTTTCTAGTTTGCCTGATTGG
AAGATCTTTAGGGCTCATCTATACAGGAATTAGGCAGTCACTAAGGTGGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAGTGGGGTTACACCTTCGGTTTCCCCATCCCCTTTCTCTCTTCTGAACACTAAACCAAAAATTACTCATTGTTCATCTCCATCTCAACTTCTATTTTGCTCCTC
TTCCACTCCCTCAAATCTCAGGTTTCGAGGAATTTCACATTTTCGACCAAATGGGTCTCCCAGATTCGTAGCTCGTTGCAGCTCCGGTGATGGTGACAGCAGGACTGTTC
TAGATGCCTTTTTCTTGGGAAAAGCTTTAGCAGAAGCCTTAACTGAGCGTATCGAGTCAACAATTGGAGAGGTCTTAGGCGGGATTGGTAGGGTGCAAGCTGAACGACAA
AAACAAATTCTTGATTTCCAGGAGGAGGTGATAGAAAGAGCCAAAAAAGCCAAGGACAAAGCAGCACGTGATGCCAAAGAAGTACAAGGACCCATCTCCTCTTCAATAAT
ATCGCCTACGATCGAAGTTACTTCATCTCCAACTACTTCGACCGATCGACAACAACCCCTGAACCCCGACCCAGATTCCGAAATTGTTGTGAACCAGGATCCTCCTCTTG
ACGTTGAAGAATCGAGTGATTTTCTTCAGTTCGAGGTCACTGGCTCCGCCTCCGTTTTCTATTTACGGAGATTGAACAGTTCAACGGCTGGTGTAAAGAGGTACCAGCTG
TTTGAAAGCTCTTCCTTGTTCCCGAGCTCTGCGCTAATTCTCCTTCCAATGGCAGAGCATGTGACTGTCACACCATCTATCGAGCTGCAAGTTTGGAGAACTCCCTTCAA
AGTCAAAAGCTTTGCACCATGCAATTTCAGTTTTAAAAGAGAACAGAGAAAATCCTCTTGTGAGAGCTATAAGTTCATAAGGATCTCAACTTGGAGAAGGCGTGAGCTTA
GTGGTTTTTGTGGCTCAAACTTAATTGTAAATCCTGCTCCCAGGAAGACCTTCAGAGATCATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAG
GTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGAGTAATACCTTTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAAC
CGATTCAGCCTCAGATAACCCAACCTTTTCTGGTAGTGGCATGAAGGTTGATGATCAAATTAATCCGAAGCATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTC
TAGATGCTTTTGAACGTAGGAGAAGTATGGAAAATGAGGTCAACAATATTTCTGATGCTACTATTCAGAACATGGACGATTTGTCTAAAATATTTTCTAAATTTATCCAA
AAATCCTCTCAACCTGTTTGCATGTCTTGGCTGAAAAGCGAACTGTCTATGGAAAATAATGATTCTAGTAAGGTGTTTCTTTCTTCGATGTCTGAGAAGCTTAAAGCAGA
AGACAATATTTTACAAGGAATTAAAAAGTCTGGCAAGGAAGAGCTCTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTGCTATTATGACCATA
GCCTGTACGTCAAGCATGGGATTTCAATATTAGAAGATTTTCTAATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCACTTTCTTC
GATGAAGTGGATAACATTGGCCTGGCATTGTGTACCCTATCAACACGGGCACTTCAAAGGTTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGC
AATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTGGTAGTCAACAGATTGAGCTACCAAGCAGTAGACAGGTCAATATCGATAATTGGTGGATGAAACATA
TCCTCAGAAGAACTGAAACTTTGTCTTCTCAGTTACATTATGTTGTGATACGCTCCTTCTCCATGCCTGTAAAGAGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGA
TATTACTTCAGCCTGTTGATTGAATTATCCGACATTACGATGCCATTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCATTCTTTCTAGTTTGCCTGATTGG
AAGATCTTTAGGGCTCATCTATACAGGAATTAGGCAGTCACTAAGGTGGAAGTAA
Protein sequenceShow/hide protein sequence
MSSGVTPSVSPSPFSLLNTKPKITHCSSPSQLLFCSSSTPSNLRFRGISHFRPNGSPRFVARCSSGDGDSRTVLDAFFLGKALAEALTERIESTIGEVLGGIGRVQAERQ
KQILDFQEEVIERAKKAKDKAARDAKEVQGPISSSIISPTIEVTSSPTTSTDRQQPLNPDPDSEIVVNQDPPLDVEESSDFLQFEVTGSASVFYLRRLNSSTAGVKRYQL
FESSSLFPSSALILLPMAEHVTVTPSIELQVWRTPFKVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPAPRKTFRDHAYLRSLVNVDGTTASE
VLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKVDDQINPKHALDVVKGKILDFLDAFERRRSMENEVNNISDATIQNMDDLSKIFSKFIQ
KSSQPVCMSWLKSELSMENNDSSKVFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDFLITFADGIASMYLEFISVDSTFF
DEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIELPSSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWR
YYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK