; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019852 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019852
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHomer protein isoform 2
Genome locationscaffold22:12492..15257
RNA-Seq ExpressionMS019852
SyntenyMS019852
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140001.1 uncharacterized protein LOC111010771 [Momordica charantia]0.0e+0099.25Show/hide
Query:  MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEKN
        MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSK MPRSFKRLKRSKEDSVEKN
Subjt:  MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEKN

Query:  ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYSE
        ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVT DGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHD+EQDIDLDSNAGYSE
Subjt:  ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYSE

Query:  NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC
        NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC
Subjt:  NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC

Query:  EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN
        EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN
Subjt:  EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN

Query:  KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG
        KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG
Subjt:  KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG

Query:  KKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF
        KKQSNTNEQPGSSQKTAAHQSSFSTQIG VGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF
Subjt:  KKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF

Query:  NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPF LKPVEENAKNEDISEETDSNSDSTSSSKD
Subjt:  NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

XP_022954557.1 uncharacterized protein LOC111456794 isoform X1 [Cucurbita moschata]4.0e-28176.99Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK
        MVELQ CA GLVN  ALCAAIDQESKGEN+NVIA+MADEL RERQRN +L++RISFLEAKLLQERV K+ +L D LGSCSK   RSFKRLKRSKE     
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK

Query:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS
         E +MK GT   +PK+ N E+RLVSWMSMDETQFVH EKLKECD TVD VD+DETD+E+ YY EE ++PFDIKDWE NGNS+SV+ V+QDI  DSN+   
Subjt:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS

Query:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI
        EN+S  ENKPTFVD+QTE +E GK  EPK+   ER+E + + +Y           +VG  +VSLQRKPPKLAFCPKEVKGIIESE LLQ+NAQSHTMRKI
Subjt:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI

Query:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP
        +VFSCLGIRHGCEE+YELDFNHFSI+RKGEPFISPQ+PGEHVLYENPGVRRKI YPNRHHPTLCPVQILEEEKAMRPLD NCPSCLFLCIKYGGRTRNLP
Subjt:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP

Query:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ
        QNEYVRQRMGRNKLKSFGPLMCRMA LAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDA GEELFLARST D  DDK VQEQ
Subjt:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ

Query:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF
        LAG TI  K RGKKQ++T+EQPGSS KTA H +S ST+ G VGYTSIQT A+AAFQSLPSPSQTP+DSNHPI      SSC VASYQNQ+P NYFPPNAF
Subjt:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF

Query:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        VPLMYWPPPNSFNPGLYPS YTYHSFP SGN ISF    C S P SPF  K +E+ AKNED+ +ETDSNSD+TSSSKD
Subjt:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

XP_022994184.1 uncharacterized protein LOC111489999 isoform X1 [Cucurbita maxima]1.2e-28076.84Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK
        MVELQ CA GLVN  ALCAAIDQESKGEN+NVIA+MADEL RERQRNA+L++RISFLEAKLLQERV K+ +LAD LGSCSK   RS KRLKRSKE     
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK

Query:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS
         E +MK GT   +PK+ N E+RLVSWM+MDETQFVH EKLKECD TVD VD+DETD+E+DYY EE ++PFDIKDWE NGNS+SV+ V+QDI  DSN    
Subjt:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS

Query:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI
        EN+S AENKPTFVD+QTE +E GK  EPK+   ER+E + + +Y           +VG  +V LQRKPPKLAFCPKEV+GIIESE LLQ+NAQSHTMRKI
Subjt:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI

Query:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP
        +VFSCLGIRHGCEE+YELDFNHFSI+RKGEPFISPQ+PGEHVLYENPGVRRKI YPNRHHPTLCPVQILEEEKAMRPLD NCPSCLFLCIKYGGRTRNLP
Subjt:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP

Query:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ
        QNEYVRQRMGRNKLKSFGPLMCR AMLAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDA GEELFLARST D  DDK VQ++
Subjt:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ

Query:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF
        LAG TI TK RGKKQ++T+EQPGSS KTA H +S ST+ G VGYTSIQT A+AAFQSLPSPSQ P+D+NHPI      SSC VASYQNQ+P NYFPPN F
Subjt:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF

Query:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        VPLMYWPPPNSFNPGLYPS YTYHSFP SGN ISF    C S P SPF  K +E+ AKNEDISEETDSNSD+TSSSKD
Subjt:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

XP_023542600.1 uncharacterized protein LOC111802457 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-28377.29Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK
        MVE+Q CA GLVN  ALCAAIDQESKGEN+NVIA+MADEL RERQRNA+L++RISFLEAKLLQERV K+ +L D LGSCSK   RSFKRLKRSKE     
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK

Query:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS
         E ++K GT   +PK+ N E+RLVSWMSMDETQFVH EKLKECD TVD VD+DETD+E+DYY EE ++PFDIK WE NGNS+SV+ V+QDI  DSN+   
Subjt:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS

Query:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI
        EN+S AENKPTFVD+QTE +E GK  EPK+   ER+E + + +Y           +VG  +VSLQRKPPKLAFCPKEVKGIIESE LLQ+NAQSHTMRKI
Subjt:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI

Query:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP
        +VFSCLGIRHGCEE+YELDFNHFSI+RKGEPFISPQ+PGEHVLYENPGVRRKI YPNRHHPTLCPVQILEEEKAMRP D NCPSCLFLCIKYGGRTRNLP
Subjt:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP

Query:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ
        QNEYVRQRMGRNKLKSFGPLMCRMA LAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDA GEELFLARST D  DDK VQEQ
Subjt:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ

Query:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF
        LAG TI TK RGKKQ++T+EQPGSS KTA H +S ST+ G VGYTSIQT A+AAFQSLPSPSQTP++SNHPI      SSC VASYQNQ+P NYFPPNAF
Subjt:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF

Query:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        VPLMYWPPPNSFNPGLYPS YTYHSFP SGN ISF    C S PCSPF  K +E+ AKNED+SEETDSNSD+TSSSKD
Subjt:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

XP_038894583.1 uncharacterized protein LOC120083101 [Benincasa hispida]2.3e-29277.02Show/hide
Query:  MVELQGC-AGLVNGP-ALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED---
        MVELQ C AGLVN P  LC+ ID ES+GENVNVIAQMADELQRERQRNAEL++RISFLEAKLL+E+V K+ QLAD LGSCSKP+ RSFKRLKR+KE+   
Subjt:  MVELQGC-AGLVNGP-ALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED---

Query:  ---SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDL
           +V+KNET+MK GT   +  DAN E++LVSWMSMDETQFVHCEK KECD+TVD  DTDETDDE+DY  E  ++PFD+KDWEINGNSKS +D +QDI L
Subjt:  ---SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDL

Query:  DSNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTL-----------------VYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEIL
        DSN GY EN+  AENK      QT+A+EYGK  EPK+EVEERSE  +T                  VY VG RNVSLQ+KPPKLAFCPKEVKGIIESE+L
Subjt:  DSNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTL-----------------VYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEIL

Query:  LQKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLF
         QKNAQSHTMRKI+VFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQ+PGEHVLYENPGVRR+IFYPNRH+PTLCPVQILEEEKAMRPLD NCPSC F
Subjt:  LQKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLF

Query:  LCIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARS
        LCIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLAN RSGSFFFKALGITLLFMAGFPDD+VRKETKYRNLDLLQKYYRTDKDAEGEELFL   
Subjt:  LCIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARS

Query:  TIDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASY
        T D  DDK VQEQ  GRT+  K RGK+ S+TN+QP SS          ST+ GLVGYTSIQT AVAAFQSLPSPSQ PVDSNHPI NP+V SSC  +ASY
Subjt:  TIDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASY

Query:  QNQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        Q Q+PFNYFPPN+FVPLMYWPPPNSFNPGLYPSPY YHSFPSSGN ISFQ+ P  S PCSPF  K +EENAKNEDISEETDSNSDSTSSSKD
Subjt:  QNQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

TrEMBL top hitse value%identityAlignment
A0A5A7UQE7 Homer protein isoform 26.9e-27974.38Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED----
        MVELQ CA GLVN   LCA ID ESKGENVNVIAQMADELQRERQRNAEL++RISFLEAKLL+ERV K++QLAD LGSCSK + RSFKRLKRSKE+    
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED----

Query:  --SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLD
          +VEKNE +MK GTH  + KD N E++LVSWMSMDETQFVHCEKLKECD+ VD VDTDETD+ED YY E  ++P DIKDWEINGNSKS +D +Q I LD
Subjt:  --SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLD

Query:  SNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVE---ERSEAKKTLVYN--------------VGLRNVSLQRKPPKLAFCPKEVKGIIESEILL
        SN+ Y  N S AEN       QT+A+EYGK QEPK+E+E   E++E KK  +Y+              VG RNVSLQ+KPPKLAFCPKEVK IIESE+LL
Subjt:  SNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVE---ERSEAKKTLVYN--------------VGLRNVSLQRKPPKLAFCPKEVKGIIESEILL

Query:  QKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFL
        QKNAQSHTMRKI+VFSCLGIRHGCEEIY+LDFN FS+LRKGEPFISPQ+PGEHVLYENPG+RR+IFYPNRH+PTLCPVQILEEEK+MRPLD NCPSC FL
Subjt:  QKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFL

Query:  CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARST
        CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLA ST
Subjt:  CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARST

Query:  IDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASYQ
         D  D+K VQ++L GRT+  K +GKK S+T +QP SS  T        T+ GLVGYTSIQT AVAAFQSLPSPSQ PV+SNHPI NP+V SSC  +A YQ
Subjt:  IDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASYQ

Query:  NQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        + +PFNYFPPNAFVP MYWPPPNSFN G+YPS Y YHS PSSGN ISFQ+ P  S P   F  K +EENAKNEDISEET+SNSD+TSS+KD
Subjt:  NQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

A0A5D3BHT6 Homer protein isoform 21.0e-27774.24Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED----
        MVELQ CA GLVN   LCA ID ESKGENVNVIAQMADELQRERQRNAEL++RISFLEAKLL+ERV  ++QLAD LGSCSK + RSFKRLKRSKE+    
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKED----

Query:  --SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLD
          +VEKNE +MK GTH  + KD N E++LVSWMSMDETQFVHCEKLKECD+ VD VDTDETD+ED YY E  ++P DIKDWEINGNSKS +D +Q I LD
Subjt:  --SVEKNETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLD

Query:  SNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVE---ERSEAKKTLVYN--------------VGLRNVSLQRKPPKLAFCPKEVKGIIESEILL
        SN+ Y  N S AEN       QT+A+EYGK QEPK+E+E   E++E KK  +Y+              VG RNVSLQ+KPPKLAFCPKEVK IIESE+LL
Subjt:  SNAGYSENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVE---ERSEAKKTLVYN--------------VGLRNVSLQRKPPKLAFCPKEVKGIIESEILL

Query:  QKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFL
        QKNAQSHTMRKI+VFSCLGIRHGCEEIY+LDFN FS+LRKGEPFISPQ+PGEHVLYENPG+RR+IFYPNRH+PTLCPVQILEEEK+MRPLD NCPSC FL
Subjt:  QKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFL

Query:  CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARST
        CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLA ST
Subjt:  CIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARST

Query:  IDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASYQ
         D  D+K VQ++L GRT+  K +GKK S+T +QP SS  T        T+ GLVGYTSIQT AVAAFQSLPSPSQ PV+SNHPI NP+V SSC  +A YQ
Subjt:  IDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSC-GVASYQ

Query:  NQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        + +PFNYFPPNAFVP MYWPPPNSFN G+YPS Y YHS PSSGN ISFQ+ P  S P   F  K +EENAKNEDISEET+SNSD+TSS+KD
Subjt:  NQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

A0A6J1CED1 uncharacterized protein LOC1110107710.0e+0099.25Show/hide
Query:  MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEKN
        MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSK MPRSFKRLKRSKEDSVEKN
Subjt:  MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEKN

Query:  ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYSE
        ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVT DGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHD+EQDIDLDSNAGYSE
Subjt:  ETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYSE

Query:  NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC
        NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC
Subjt:  NRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGC

Query:  EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN
        EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN
Subjt:  EEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRN

Query:  KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG
        KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG
Subjt:  KLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRG

Query:  KKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF
        KKQSNTNEQPGSSQKTAAHQSSFSTQIG VGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF
Subjt:  KKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSF

Query:  NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPF LKPVEENAKNEDISEETDSNSDSTSSSKD
Subjt:  NPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

A0A6J1GRF6 uncharacterized protein LOC111456794 isoform X12.0e-28176.99Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK
        MVELQ CA GLVN  ALCAAIDQESKGEN+NVIA+MADEL RERQRN +L++RISFLEAKLLQERV K+ +L D LGSCSK   RSFKRLKRSKE     
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK

Query:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS
         E +MK GT   +PK+ N E+RLVSWMSMDETQFVH EKLKECD TVD VD+DETD+E+ YY EE ++PFDIKDWE NGNS+SV+ V+QDI  DSN+   
Subjt:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS

Query:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI
        EN+S  ENKPTFVD+QTE +E GK  EPK+   ER+E + + +Y           +VG  +VSLQRKPPKLAFCPKEVKGIIESE LLQ+NAQSHTMRKI
Subjt:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI

Query:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP
        +VFSCLGIRHGCEE+YELDFNHFSI+RKGEPFISPQ+PGEHVLYENPGVRRKI YPNRHHPTLCPVQILEEEKAMRPLD NCPSCLFLCIKYGGRTRNLP
Subjt:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP

Query:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ
        QNEYVRQRMGRNKLKSFGPLMCRMA LAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDA GEELFLARST D  DDK VQEQ
Subjt:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ

Query:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF
        LAG TI  K RGKKQ++T+EQPGSS KTA H +S ST+ G VGYTSIQT A+AAFQSLPSPSQTP+DSNHPI      SSC VASYQNQ+P NYFPPNAF
Subjt:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF

Query:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        VPLMYWPPPNSFNPGLYPS YTYHSFP SGN ISF    C S P SPF  K +E+ AKNED+ +ETDSNSD+TSSSKD
Subjt:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

A0A6J1K272 uncharacterized protein LOC111489999 isoform X15.7e-28176.84Show/hide
Query:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK
        MVELQ CA GLVN  ALCAAIDQESKGEN+NVIA+MADEL RERQRNA+L++RISFLEAKLLQERV K+ +LAD LGSCSK   RS KRLKRSKE     
Subjt:  MVELQGCA-GLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEK

Query:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS
         E +MK GT   +PK+ N E+RLVSWM+MDETQFVH EKLKECD TVD VD+DETD+E+DYY EE ++PFDIKDWE NGNS+SV+ V+QDI  DSN    
Subjt:  NETVMKCGTHCFTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYS

Query:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI
        EN+S AENKPTFVD+QTE +E GK  EPK+   ER+E + + +Y           +VG  +V LQRKPPKLAFCPKEV+GIIESE LLQ+NAQSHTMRKI
Subjt:  ENRSCAENKPTFVDEQTEAEEYGKGQEPKTEVEERSEAKKTLVY-----------NVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKI

Query:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP
        +VFSCLGIRHGCEE+YELDFNHFSI+RKGEPFISPQ+PGEHVLYENPGVRRKI YPNRHHPTLCPVQILEEEKAMRPLD NCPSCLFLCIKYGGRTRNLP
Subjt:  VVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHVLYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLP

Query:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ
        QNEYVRQRMGRNKLKSFGPLMCR AMLAN RSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDA GEELFLARST D  DDK VQ++
Subjt:  QNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPDDLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQ

Query:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF
        LAG TI TK RGKKQ++T+EQPGSS KTA H +S ST+ G VGYTSIQT A+AAFQSLPSPSQ P+D+NHPI      SSC VASYQNQ+P NYFPPN F
Subjt:  LAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPSQTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAF

Query:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD
        VPLMYWPPPNSFNPGLYPS YTYHSFP SGN ISF    C S P SPF  K +E+ AKNEDISEETDSNSD+TSSSKD
Subjt:  VPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDSTSSSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGCTTCAAGGTTGTGCTGGTTTAGTGAATGGGCCTGCATTATGTGCTGCAATAGACCAGGAATCCAAAGGGGAGAATGTGAATGTTATTGCACAAATGGCTGA
TGAGTTACAGAGGGAGAGACAGAGAAATGCTGAACTGCTGGACAGAATATCATTTCTTGAAGCTAAATTGTTACAGGAAAGAGTGTATAAGGAAACCCAACTTGCTGATG
GACTTGGCAGTTGTTCCAAGCCAATGCCAAGAAGCTTCAAGAGGCTTAAAAGAAGCAAAGAAGATAGTGTTGAGAAAAATGAAACAGTAATGAAGTGTGGCACTCACTGT
TTCACCCCCAAAGATGCAAATCCAGAAAATCGATTGGTCAGTTGGATGAGTATGGATGAGACACAGTTTGTGCATTGTGAAAAATTGAAGGAATGTGATGTTACTGTGGA
TGGTGTAGATACAGACGAAACCGATGATGAAGATGATTATTATCAGGAGGAGATTGATGTTCCTTTTGACATAAAGGATTGGGAAATTAATGGGAATTCAAAAAGTGTCC
ATGATGTTGAACAAGATATCGATCTCGATTCGAATGCAGGATATTCGGAGAACCGCTCTTGTGCTGAAAATAAGCCAACATTTGTAGATGAGCAAACAGAAGCTGAAGAA
TATGGAAAAGGACAGGAGCCTAAAACTGAGGTAGAAGAGAGAAGTGAAGCCAAAAAAACTTTAGTGTATAATGTAGGATTACGAAATGTATCACTCCAAAGAAAGCCTCC
AAAGTTAGCTTTCTGTCCTAAAGAAGTAAAGGGGATTATTGAATCAGAAATCTTGCTGCAGAAAAATGCACAGTCCCACACCATGAGGAAGATAGTAGTTTTTTCATGTC
TTGGTATAAGGCATGGTTGTGAGGAGATCTACGAGTTAGACTTCAATCATTTCAGTATTCTAAGAAAAGGAGAGCCATTCATTTCTCCTCAAGATCCTGGGGAGCATGTC
TTGTATGAGAATCCTGGTGTGAGGAGGAAGATCTTCTACCCCAATAGACACCACCCGACATTATGCCCTGTTCAAATACTTGAGGAAGAGAAAGCTATGCGACCATTGGA
TGCTAACTGTCCTTCTTGCTTATTTCTCTGCATCAAATACGGCGGCAGGACGAGGAACCTCCCGCAGAACGAATATGTGAGGCAGCGAATGGGAAGAAACAAGCTCAAGT
CTTTTGGGCCACTCATGTGCAGGATGGCTATGCTGGCTAATACTCGCAGTGGAAGTTTTTTCTTCAAAGCCTTGGGCATTACCCTTCTCTTCATGGCTGGTTTTCCAGAT
GATCTCGTCCGCAAAGAAACCAAATATCGTAATTTGGACTTGCTTCAGAAATACTACAGGACAGACAAAGATGCCGAAGGCGAAGAGTTATTCCTCGCACGCTCAACGAT
TGACAAATTTGATGACAAATCTGTTCAGGAGCAGCTAGCTGGAAGAACTATTCCAACAAAACCAAGGGGGAAAAAACAAAGTAACACCAATGAACAACCTGGCTCTTCAC
AAAAGACAGCAGCCCATCAGTCATCATTTTCAACACAAATTGGATTAGTAGGATATACTTCAATCCAAACTAGTGCTGTGGCAGCATTTCAGTCACTGCCATCTCCATCT
CAGACACCAGTTGATAGTAACCATCCAATTTCCAACCCAGTGGTTGGAAGTTCCTGTGGTGTGGCCTCCTACCAAAATCAACACCCATTTAATTATTTCCCACCAAACGC
ATTTGTTCCTCTGATGTATTGGCCTCCTCCCAACTCATTCAATCCAGGACTTTACCCTTCTCCTTATACTTACCATTCTTTTCCATCTAGTGGAAATTGTATATCCTTCC
AAACTCATCCTTGTGGCAGCCTTCCATGCAGCCCCTTCAAACTAAAACCTGTGGAAGAAAATGCAAAGAATGAGGACATCTCAGAAGAAACGGATAGTAACTCTGACAGT
ACTTCAAGCAGTAAAGAT
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAGCTTCAAGGTTGTGCTGGTTTAGTGAATGGGCCTGCATTATGTGCTGCAATAGACCAGGAATCCAAAGGGGAGAATGTGAATGTTATTGCACAAATGGCTGA
TGAGTTACAGAGGGAGAGACAGAGAAATGCTGAACTGCTGGACAGAATATCATTTCTTGAAGCTAAATTGTTACAGGAAAGAGTGTATAAGGAAACCCAACTTGCTGATG
GACTTGGCAGTTGTTCCAAGCCAATGCCAAGAAGCTTCAAGAGGCTTAAAAGAAGCAAAGAAGATAGTGTTGAGAAAAATGAAACAGTAATGAAGTGTGGCACTCACTGT
TTCACCCCCAAAGATGCAAATCCAGAAAATCGATTGGTCAGTTGGATGAGTATGGATGAGACACAGTTTGTGCATTGTGAAAAATTGAAGGAATGTGATGTTACTGTGGA
TGGTGTAGATACAGACGAAACCGATGATGAAGATGATTATTATCAGGAGGAGATTGATGTTCCTTTTGACATAAAGGATTGGGAAATTAATGGGAATTCAAAAAGTGTCC
ATGATGTTGAACAAGATATCGATCTCGATTCGAATGCAGGATATTCGGAGAACCGCTCTTGTGCTGAAAATAAGCCAACATTTGTAGATGAGCAAACAGAAGCTGAAGAA
TATGGAAAAGGACAGGAGCCTAAAACTGAGGTAGAAGAGAGAAGTGAAGCCAAAAAAACTTTAGTGTATAATGTAGGATTACGAAATGTATCACTCCAAAGAAAGCCTCC
AAAGTTAGCTTTCTGTCCTAAAGAAGTAAAGGGGATTATTGAATCAGAAATCTTGCTGCAGAAAAATGCACAGTCCCACACCATGAGGAAGATAGTAGTTTTTTCATGTC
TTGGTATAAGGCATGGTTGTGAGGAGATCTACGAGTTAGACTTCAATCATTTCAGTATTCTAAGAAAAGGAGAGCCATTCATTTCTCCTCAAGATCCTGGGGAGCATGTC
TTGTATGAGAATCCTGGTGTGAGGAGGAAGATCTTCTACCCCAATAGACACCACCCGACATTATGCCCTGTTCAAATACTTGAGGAAGAGAAAGCTATGCGACCATTGGA
TGCTAACTGTCCTTCTTGCTTATTTCTCTGCATCAAATACGGCGGCAGGACGAGGAACCTCCCGCAGAACGAATATGTGAGGCAGCGAATGGGAAGAAACAAGCTCAAGT
CTTTTGGGCCACTCATGTGCAGGATGGCTATGCTGGCTAATACTCGCAGTGGAAGTTTTTTCTTCAAAGCCTTGGGCATTACCCTTCTCTTCATGGCTGGTTTTCCAGAT
GATCTCGTCCGCAAAGAAACCAAATATCGTAATTTGGACTTGCTTCAGAAATACTACAGGACAGACAAAGATGCCGAAGGCGAAGAGTTATTCCTCGCACGCTCAACGAT
TGACAAATTTGATGACAAATCTGTTCAGGAGCAGCTAGCTGGAAGAACTATTCCAACAAAACCAAGGGGGAAAAAACAAAGTAACACCAATGAACAACCTGGCTCTTCAC
AAAAGACAGCAGCCCATCAGTCATCATTTTCAACACAAATTGGATTAGTAGGATATACTTCAATCCAAACTAGTGCTGTGGCAGCATTTCAGTCACTGCCATCTCCATCT
CAGACACCAGTTGATAGTAACCATCCAATTTCCAACCCAGTGGTTGGAAGTTCCTGTGGTGTGGCCTCCTACCAAAATCAACACCCATTTAATTATTTCCCACCAAACGC
ATTTGTTCCTCTGATGTATTGGCCTCCTCCCAACTCATTCAATCCAGGACTTTACCCTTCTCCTTATACTTACCATTCTTTTCCATCTAGTGGAAATTGTATATCCTTCC
AAACTCATCCTTGTGGCAGCCTTCCATGCAGCCCCTTCAAACTAAAACCTGTGGAAGAAAATGCAAAGAATGAGGACATCTCAGAAGAAACGGATAGTAACTCTGACAGT
ACTTCAAGCAGTAAAGAT
Protein sequenceShow/hide protein sequence
MVELQGCAGLVNGPALCAAIDQESKGENVNVIAQMADELQRERQRNAELLDRISFLEAKLLQERVYKETQLADGLGSCSKPMPRSFKRLKRSKEDSVEKNETVMKCGTHC
FTPKDANPENRLVSWMSMDETQFVHCEKLKECDVTVDGVDTDETDDEDDYYQEEIDVPFDIKDWEINGNSKSVHDVEQDIDLDSNAGYSENRSCAENKPTFVDEQTEAEE
YGKGQEPKTEVEERSEAKKTLVYNVGLRNVSLQRKPPKLAFCPKEVKGIIESEILLQKNAQSHTMRKIVVFSCLGIRHGCEEIYELDFNHFSILRKGEPFISPQDPGEHV
LYENPGVRRKIFYPNRHHPTLCPVQILEEEKAMRPLDANCPSCLFLCIKYGGRTRNLPQNEYVRQRMGRNKLKSFGPLMCRMAMLANTRSGSFFFKALGITLLFMAGFPD
DLVRKETKYRNLDLLQKYYRTDKDAEGEELFLARSTIDKFDDKSVQEQLAGRTIPTKPRGKKQSNTNEQPGSSQKTAAHQSSFSTQIGLVGYTSIQTSAVAAFQSLPSPS
QTPVDSNHPISNPVVGSSCGVASYQNQHPFNYFPPNAFVPLMYWPPPNSFNPGLYPSPYTYHSFPSSGNCISFQTHPCGSLPCSPFKLKPVEENAKNEDISEETDSNSDS
TSSSKD