; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017939 (gene) of Chayote v1 genome

Gene IDSed0017939
OrganismSechium edule (Chayote v1)
DescriptionDUF3741 domain-containing protein/DUF4378 domain-containing protein
Genome locationLG05:40399476..40404909
RNA-Seq ExpressionSed0017939
SyntenySed0017939
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570707.1 Protein TRM32, partial [Cucurbita argyrosperma subsp. sororia]2.5e-30464.66Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNRNVPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE++EEESQKYWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TS TEKSNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN SS S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG      AS E+YS+LS+                                     Y     
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN

Query:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG
        REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PDIDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK 
Subjt:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG

Query:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC
        D N+PVD+SGSLN V+NDEG AW D+LEEKIP          QV  SECV+E    ++TV+   +LS  ++QV   ET ++ DETSKL++SEG+IL   C
Subjt:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC

Query:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW
        S ANELEPS+DQP EA  EAL + ETI+NHEII DAEKIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYW
Subjt:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW

Query:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM
        EKV++DSDHQLL DLVYET HNVFEKSF SFLK+FS +SQ+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+
Subjt:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM

Query:  EDLILDGLLDEVIS
        EDLILD LL+EVIS
Subjt:  EDLILDGLLDEVIS

KAG7010552.1 Protein TRM32 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-30464.66Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNRNVPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE++EEESQKYWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTEKSNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN SS S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG      AS E+YS+LS+                                     Y     
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN

Query:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG
        REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PDIDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK 
Subjt:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG

Query:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC
        D N+PVD+SGSLN V+ND G AW D+LEEKIP          QV  SECV+E    ++TV+   +LS  ++QV   ET ++ DETSKL++SEG+IL   C
Subjt:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC

Query:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW
        S ANELEPS+DQP EA  EAL + ETI+NHEII DAEKIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYW
Subjt:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW

Query:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM
        EKV++DSDHQLL DLVYET HNVFEKSF SFLK+FS +SQ+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+
Subjt:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM

Query:  EDLILDGLLDEVIS
        EDLILD LL+EVIS
Subjt:  EDLILDGLLDEVIS

XP_022943817.1 uncharacterized protein LOC111448443 isoform X4 [Cucurbita moschata]1.1e-30767.39Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNR+VPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQ YWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTE+SNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN  S S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG  ASL A  E+YSQ  E      REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PD
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----
        IDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK D N+PVD+SGSLN V+NDEG AW D+LEEKIP     
Subjt:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----

Query:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE
             QV  SECV+E    ++TV+    LS  ++QV   ET ++ DETSKL++SEG+IL   CS ANELEPS+DQP EA  EAL + ETI+NHEII DAE
Subjt:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE

Query:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL
        KIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYWEKVD+DSDHQLL DLVYET HNVFEKSF SFLKTFS 
Subjt:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL

Query:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS
        ++Q+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+EDLILD LL+EVIS
Subjt:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS

XP_023511930.1 uncharacterized protein LOC111776795 [Cucurbita pepo subsp. pepo]3.2e-30767.27Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNRNVPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQKYWKLNSNSKR LSRT SIH L+ S Y              +  + M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTEKSNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN SS S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK+NE FPK +KS S  QP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDL--SGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  PT  +  SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDL--SGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG  ASL A  E+YSQ  E      REAK + SQS+RL  E+ I   EK RKS+GRNLS+PD
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----
        IDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK D N+PVD+SGSLN V+NDEG AW D+LEEKIP     
Subjt:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----

Query:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE
             QV  SECV+E    + TV+   +LS  ++QV   ET  + DETSKL++SEG+IL   CS ANELEPS+DQP EA  EAL + ETI+NHEII DAE
Subjt:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE

Query:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL
        KIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYWEKVD+DSDHQLL DLVYET HNVFEKSF SFLK FS 
Subjt:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL

Query:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS
        +SQ+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+EDLILD LL+EVIS
Subjt:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS

XP_038902824.1 uncharacterized protein LOC120089436 [Benincasa hispida]0.0e+0066.05Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  QRQDSNLQFN+NVPGCFWSIF+TIDYH WH+VKKMLP+RKHSRSKGG K ++N HHVAEV E I+DE++ L+CTAESC  DR S +AHV E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDT-DYFNQRKICIQSTSFTEKSNGVKK
         T E+S EE QKYWKL+SNSKRR SRT SIH L+PS +SP  +    DS+    K  MKLAASG+ SNSLNAMD+ DY  QR+I I+ TS TEKSNGVKK
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDT-DYFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHGNH---TLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE N++ RN SSRS KED HIQEI+KANRKLFAELL G H   TLQT +N K  ASLAKS SFPAPGLA+KGY KLSSLQHK  E FPK QKSVS PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHGNH---TLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEA
        SKLVESVSPK+ HE++T CDS+S S + IRQQ +SSSLG NR L+H GWNQLV+KRFNFIKQKI+HSFKERKKGN QKT+K I T+D SG EL L REEA
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEA

Query:  QESSGITPSEDGSDIR-------------------------------------------GCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYG
         ES G T +EDGS  R                                           G SETD +EN NL + VQT T   ASL+AS E+YSQLSEYG
Subjt:  QESSGITPSEDGSDIR-------------------------------------------GCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYG

Query:  LHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP------DQPLDTASLNVLE
           NREAK YHS+S+RL SE+ +   E+P+K FGRNLS PDIDLFCTLFTD PH V  TEKPKR LV  STDNNIR DEN        +PLD+ S  ++E
Subjt:  LHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP------DQPLDTASLNVLE

Query:  KGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVE-GNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCS
        +G+ N+ VD+SGSLN + NDEG AW D L+E+IP          QVS +E +VE  ++T +  S+LSH+NQVL L T  Q DETSK ++S+GAIL PGCS
Subjt:  KGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVE-GNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCS

Query:  IANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWE
        IANELEPS+DQ NEA TEALP+FETI+NHEI+DDAEKIS  + LH E  RIN+ADFNYMR+ILQLSSFIE+GH +DRPL+SS+F+GEEAHFYKKLECYW+
Subjt:  IANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWE

Query:  KVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEME
        KVD+DSDHQLLLDLV ETLHNV+EKSF+ FLKTFS +SQ+RPV LGQY+LE+ RE+V+WYLCLGPELDQSLDDVV RDLRKGDDWMNLQSETE IALE+E
Subjt:  KVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEME

Query:  DLILDGLLDEVIS
        DLILD LLDEV++
Subjt:  DLILDGLLDEVIS

TrEMBL top hitse value%identityAlignment
A0A1S3CMA3 uncharacterized protein LOC1035020863.7e-30164.11Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  QRQDSNLQFN+NVPGCFWSIFHTIDYH WH+VKKMLPHRKHSRSK  PK ++N HH AE+ + +       +C+ ESC  DR+   AHV E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDT-DYFNQRKICIQSTSFTEKSNGVKK
         T  +SEEESQKYWKL S+SKRRLSRT SIH L+PS YSPG +   GDS+    K  MKL ASG++S SL+A+D+ DY NQRKI I  TS TEKS+GVKK
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDT-DYFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHGNH---TLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE N+++RN S+RS KED H+QEI+KANRKLFAELL G H   TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK  E FPK QKSVS+ QP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHGNH---TLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEA
        SKLVES SPK+ HE++  CDS+S + ++I+ Q +SSSLG NR  KH GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K I   D SGHEL L  EEA
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEA

Query:  QESSGITPSEDGSDIR------------------------------------------GCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGL
         ES G   SEDGS IR                                          GCS TD SEN NL + VQT TG  ASL+AS EKYSQLS Y  
Subjt:  QESSGITPSEDGSDIR------------------------------------------GCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGL

Query:  HTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVL
          NREAK YHSQS+RL SE+ I   E P+K FGRNLS PDIDLFCTLFTD PH V  TEKPKR L   STDNNIRIDENP         +PLD  S  ++
Subjt:  HTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVL

Query:  EKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG-NKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGC
        E+GD N+ +D+S SLN +  DEGT W D LEEKIP          QV  +E VVE  + TV+  S+LSH  QVL L+T  Q DETSKL++SEGAI+ P C
Subjt:  EKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG-NKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGC

Query:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW
        S+ANE E S+DQ NE  TEALP+FET ++H IIDD EKIS  + LH E  RIN+A+FNYMR+ILQLSSFIE G  +DRPL+ S+F+GEEAHFYKKLECYW
Subjt:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW

Query:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM
        EKVD+DSDHQLLLDLVYETLHN++EKSF  FLKTFS +SQ+RP+PLGQY+LED REKV+WYL LGPELDQSLDDVV RDLRKG++WMNLQSETE IALE+
Subjt:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM

Query:  EDLILDGLLDEVIS
        EDLILD LLDEV+S
Subjt:  EDLILDGLLDEVIS

A0A6J1FSR2 uncharacterized protein LOC111448443 isoform X33.9e-30364.29Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNR+VPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQ YWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTE+SNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN  S S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG      AS E+YS+LS+                                     Y     
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN

Query:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG
        REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PDIDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK 
Subjt:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG

Query:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIAN
        D N+PVD+SGSLN V+NDEG AW D+LEEKIP          QV  SECV+E     E   ++         ET ++ DETSKL++SEG+IL   CS AN
Subjt:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIAN

Query:  ELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVD
        ELEPS+DQP EA  EAL + ETI+NHEII DAEKIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYWEKVD
Subjt:  ELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVD

Query:  EDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLI
        +DSDHQLL DLVYET HNVFEKSF SFLKTFS ++Q+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+EDLI
Subjt:  EDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLI

Query:  LDGLLDEVIS
        LD LL+EVIS
Subjt:  LDGLLDEVIS

A0A6J1FU32 uncharacterized protein LOC111448443 isoform X45.3e-30867.39Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNR+VPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQ YWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTE+SNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN  S S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG  ASL A  E+YSQ  E      REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PD
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----
        IDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK D N+PVD+SGSLN V+NDEG AW D+LEEKIP     
Subjt:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----

Query:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE
             QV  SECV+E    ++TV+    LS  ++QV   ET ++ DETSKL++SEG+IL   CS ANELEPS+DQP EA  EAL + ETI+NHEII DAE
Subjt:  -----QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAE

Query:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL
        KIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYWEKVD+DSDHQLL DLVYET HNVFEKSF SFLKTFS 
Subjt:  KISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSL

Query:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS
        ++Q+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+EDLILD LL+EVIS
Subjt:  KSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS

A0A6J1FYA4 uncharacterized protein LOC111448443 isoform X14.7e-30464.55Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNR+VPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N+HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQ YWK+NSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF +RKI  Q TSFTE+SNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ ++ RN  S S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKLVES SPK+ HE++  CDS+S S ++I  Q SSSS GPNR L+H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN
        EAQES G   S DG  IRG SET +SEN NL +GVQT TG      AS E+YS+LS+                                     Y     
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSE-------------------------------------YGLHTN

Query:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG
        REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PDIDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK 
Subjt:  REAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKG

Query:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC
        D N+PVD+SGSLN V+NDEG AW D+LEEKIP          QV  SECV+E    ++TV+    LS  ++QV   ET ++ DETSKL++SEG+IL   C
Subjt:  DVNIPVDHSGSLNLVENDEGTAWEDELEEKIP----------QVSDSECVVEG---NKTVEDDSKLS-HINQVLILETSIQADETSKLANSEGAILTPGC

Query:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW
        S ANELEPS+DQP EA  EAL + ETI+NHEII DAEKIS  + L+ E  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYW
Subjt:  SIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYW

Query:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM
        EKVD+DSDHQLL DLVYET HNVFEKSF SFLKTFS ++Q+RP+PLGQY+LED REKVAWYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+
Subjt:  EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEM

Query:  EDLILDGLLDEVIS
        EDLILD LL+EVIS
Subjt:  EDLILDGLLDEVIS

A0A6J1J8H7 uncharacterized protein LOC111484426 isoform X21.3e-30166.67Show/hide
Query:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE
        MFKMEK  +RQDSN+QFNRNVPGCFWSIFHTIDYHPWH+VKKMLP+R+HS+SKG PK ++N HHVAE  EQI+DE+KPL CTAESC    + G+ +V E 
Subjt:  MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREE

Query:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK
         +KE+SEEESQKYWKLNSNSKR LSRT SIH L+ S Y          SR   P   M+L A G+KSNSLNAMD++ YF QRKI  Q TSFTEKSNGV+K
Subjt:  ATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTD-YFNQRKICIQSTSFTEKSNGVKK

Query:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP
        TLE+ +      S S KEDVHIQEI+KANRKLFAELL G     TLQTP+N K SASLAKSRSFPAPGLA+KGY KLSSLQHK NE FPK +KS S PQP
Subjt:  TLEANKVSRNFSSRSLKEDVHIQEIYKANRKLFAELLHG---NHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQP

Query:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE
        SKL ES SPK+ HE++  CDS+S S +++  Q SSSSLGPNR ++H GWNQLV+KRFNFIKQKIRHSFKERKKGN QKT+K  P  T+D SGHELPL RE
Subjt:  SKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIP--TMDLSGHELPLDRE

Query:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD
        EAQES G   S+DG  IRG SET +SEN NL +GVQT TG  ASL A  E+YSQ  E      REAK +HSQS+RL  E+ I   EK RKS+GRNLS+PD
Subjt:  EAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLNASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----
        IDLFCTLFTDP   V  TEKPKR +V  STDNN+RIDENP         +PLD+ S +V+EK D N+PVD+S  LN V+NDEG AW D LEEKIP     
Subjt:  IDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENP--------DQPLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIP-----

Query:  -----QVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISI
             QV  SECV+E     E   ++         ET  + DETSKL++SEG+IL   CS+ANELEPS+DQP EA  EAL + ET +NHEII DAEKIS 
Subjt:  -----QVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISI

Query:  SILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQL
         + L+ +  RIN+ADFNYMRYILQLSSFIESGH +DRPLSSS+F+GEEA FYKKLECYWEKVD+ SDHQLL DLVYET HNVFEKSF SFLKTFS +SQ+
Subjt:  SILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQL

Query:  RPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS
        RP+PLGQY+LED REKV+WYLCLGPELDQSLDDVV RDLRKGDDWMNLQ+E E  ALE+EDLILD LL+EVIS
Subjt:  RPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVIS

SwissProt top hitse value%identityAlignment
F4HSD5 Protein TRM325.4e-1530.6Show/hide
Query:  EDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPS-------FETILNHEIIDDAEKISISILLHPEPDRINS
        + D+       VL  E+ ++A++  K    E        S  N L+ SN +  +   +++PS        ET+L   +       S+ I  H E D    
Subjt:  EDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPS-------FETILNHEIIDDAEKISISILLHPEPDRINS

Query:  ADFNYMRYILQLSSFIES------GHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQ
        A F Y++ +L++S F+E+       +  ++PL+ S+            E   ++ +  +D +LL DLV E +      S + F KTF         P G+
Subjt:  ADFNYMRYILQLSSFIES------GHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQ

Query:  YVLEDAREKVAWYLC-LGPE-LDQSLDDVVKRD-LRKGDDWMNLQSETERIALEMEDLILDGLLDEVI
          L++   +V W L  LG E  D+SLDD+V RD L K D WMNLQ E+E + LE+EDLI D +LDE++
Subjt:  YVLEDAREKVAWYLC-LGPE-LDQSLDDVVKRD-LRKGDDWMNLQSETERIALEMEDLILDGLLDEVI

Arabidopsis top hitse value%identityAlignment
AT1G07620.1 GTP-binding protein Obg/CgtA3.8e-1630.6Show/hide
Query:  EDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPS-------FETILNHEIIDDAEKISISILLHPEPDRINS
        + D+       VL  E+ ++A++  K    E        S  N L+ SN +  +   +++PS        ET+L   +       S+ I  H E D    
Subjt:  EDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPS-------FETILNHEIIDDAEKISISILLHPEPDRINS

Query:  ADFNYMRYILQLSSFIES------GHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQ
        A F Y++ +L++S F+E+       +  ++PL+ S+            E   ++ +  +D +LL DLV E +      S + F KTF         P G+
Subjt:  ADFNYMRYILQLSSFIES------GHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQ

Query:  YVLEDAREKVAWYLC-LGPE-LDQSLDDVVKRD-LRKGDDWMNLQSETERIALEMEDLILDGLLDEVI
          L++   +V W L  LG E  D+SLDD+V RD L K D WMNLQ E+E + LE+EDLI D +LDE++
Subjt:  YVLEDAREKVAWYLC-LGPE-LDQSLDDVVKRD-LRKGDDWMNLQSETERIALEMEDLILDGLLDEVI

AT2G45900.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related6.1e-0631.82Show/hide
Query:  SDHQLLLDLVYETLHNVF-EKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLIL
        SD +LL D + E L        ++SF     +K  +   P  +  +E  +E+V W+L   P    +LD +V++DL +  +WM+L+ +   I  E  ++IL
Subjt:  SDHQLLLDLVYETLHNVF-EKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLIL

Query:  DGLLDEVISS
        D LL+E+IS+
Subjt:  DGLLDEVISS

AT4G00440.1 Protein of unknown function (DUF3741)2.3e-0528.04Show/hide
Query:  DHQLLLDLVYETLHNV-FEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILD
        DH+LL D + E L  +     + SF     +  + R     + ++ + +E V W+L   P L  +LD +V++D+ +  +W++++ + + I  E  +LIL+
Subjt:  DHQLLLDLVYETLHNV-FEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILD

Query:  GLLDEVI
         LL+E+I
Subjt:  GLLDEVI

AT4G00440.2 Protein of unknown function (DUF3741)2.3e-0528.04Show/hide
Query:  DHQLLLDLVYETLHNV-FEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILD
        DH+LL D + E L  +     + SF     +  + R     + ++ + +E V W+L   P L  +LD +V++D+ +  +W++++ + + I  E  +LIL+
Subjt:  DHQLLLDLVYETLHNV-FEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILD

Query:  GLLDEVI
         LL+E+I
Subjt:  GLLDEVI

AT5G02390.1 Protein of unknown function (DUF3741)1.4e-2122.41Show/hide
Query:  RNVPGCFWSIFHTIDYHPWHSVKKMLPHRKH--SRSKGGPKLSVNDH--------HVAEV-------AEQISDEDKP--LLCTAESCSFDRRSGKAHVRE
        +N PG  W +F  + ++ W  +KK LPH++    R+  G K  VN+          V ++        E +    KP   + + ES +   ++ K H  E
Subjt:  RNVPGCFWSIFHTIDYHPWHSVKKMLPHRKH--SRSKGGPKLSVNDH--------HVAEV-------AEQISDEDKP--LLCTAESCSFDRRSGKAHVRE

Query:  EATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTDYFNQ---RKICIQST--------
        E +K+++ EE  +  K +S  KR +        ++      G   +   +   Q  +  K + S + S+  N+ + D  N+   +K+ I           
Subjt:  EATKEMSEEESQKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTDYFNQ---RKICIQST--------

Query:  -SFTEKSNGVKKTLEA-NKVSRNFSSRSLKEDVHIQEIYKANR---------------KLFAE------------------LLHGNH-----TLQTP---
            E SN   +     N +  + ++   K D   +E  K  +               K F+E                  L+H N       LQ P   
Subjt:  -SFTEKSNGVKKTLEA-NKVSRNFSSRSLKEDVHIQEIYKANR---------------KLFAE------------------LLHGNH-----TLQTP---

Query:  -------KNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQPSKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNR
               + +  S ++ K+ SFP  G+ ++ +NK       +++  P    S++    +  ++    K     L L D +S+   + R++          
Subjt:  -------KNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQPSKLVESVSPKSTHENLTLCDSNSASCNHIRQQRSSSSLGPNR

Query:  VLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYA-
             G NQ+VIKRF  ++QKI+H   E       K  K   TMD    ++P          G +               +++N N   G +      A 
Subjt:  VLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYA-

Query:  SLNASPEKYSQLSEYGLHTNREAK-SYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENPDQ
        SL  S ++Y QL  Y     REAK S +S+  ++  E++ L  ++  K  GR LS+P +              FH                ++ID+   Q
Subjt:  SLNASPEKYSQLSEYGLHTNREAK-SYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENPDQ

Query:  PLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIPQVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCS
         ++                         E  +G     E+ E   Q S+ E + +     ED    +   Q   + T     ET  L             
Subjt:  PLDTASLNVLEKGDVNIPVDHSGSLNLVENDEGTAWEDELEEKIPQVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCS

Query:  IANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSAD---FNYMRYILQLSSF-----IESGHVVDRPLSSSMFD----GEE
             E S + PN   T  +    ++ +     D E +S S  L  E   I++ D   FNY+R IL++S F     +    +  +PL   +++       
Subjt:  IANELEPSNDQPNEACTEALPSFETILNHEIIDDAEKISISILLHPEPDRINSAD---FNYMRYILQLSSF-----IESGHVVDRPLSSSMFD----GEE

Query:  AHFYKKLECYW-EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGP-ELDQSLDDVVKRDLRKGDDWM
           ++  EC   E+   + +H LL DL+ E L  ++E+S+  + K  S   ++ P+P+G  VL++   +++ YL   P +  QS D V+ RDL + D WM
Subjt:  AHFYKKLECYW-EKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTFSLKSQLRPVPLGQYVLEDAREKVAWYLCLGP-ELDQSLDDVVKRDLRKGDDWM

Query:  NLQSETERIALEMEDLILDGLLDEVISS
        +LQ E+E + +E+EDLI + LL+E++ S
Subjt:  NLQSETERIALEMEDLILDGLLDEVISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAAAATGGAAAAGCAATTCCAGCGCCAGGACTCCAATCTGCAGTTTAACAGAAACGTTCCAGGTTGCTTCTGGAGCATATTCCATACTATTGACTACCATCCCTG
GCATAGTGTTAAAAAGATGCTTCCTCACAGAAAGCATTCAAGAAGCAAGGGAGGTCCAAAATTGTCTGTGAACGACCACCATGTCGCCGAAGTGGCAGAACAAATTAGTG
ACGAAGACAAGCCTCTACTGTGTACTGCAGAGAGTTGCTCTTTCGACCGAAGATCTGGAAAAGCTCATGTTAGGGAAGAGGCAACTAAAGAGATGTCAGAGGAAGAGAGC
CAAAAATACTGGAAATTGAACTCCAATTCTAAACGAAGGTTGAGTAGAACATACTCGATTCATCGTTTAGATCCTTCGCAGTATTCTCCTGGAATTAGTGTCAACATTGG
AGACAGCAGAGTCGCTCAACCGAAAACTGCCATGAAGTTAGCTGCATCAGGACTGAAGAGTAATTCTCTGAATGCCATGGATACGGACTACTTTAATCAAAGAAAAATTT
GTATCCAGTCGACATCTTTTACAGAAAAATCTAATGGAGTAAAGAAAACCTTGGAAGCTAACAAGGTCAGCAGAAACTTCTCCAGTCGTTCATTGAAGGAAGACGTTCAC
ATCCAAGAGATATACAAGGCAAACAGAAAGCTATTTGCTGAACTGTTACATGGTAACCACACCCTCCAAACCCCAAAAAATACTAAGCCATCTGCAAGTCTAGCGAAATC
AAGGTCCTTTCCTGCTCCGGGTTTAGCTCAAAAAGGATATAATAAGCTTAGCTCACTCCAACACAAGATTAATGAGCACTTTCCTAAAGGACAAAAATCTGTTTCTGTCC
CCCAACCATCAAAGCTGGTTGAGTCTGTATCTCCTAAGAGTACTCATGAAAATTTGACGCTTTGTGACTCTAACAGTGCTTCATGCAATCACATAAGACAACAAAGAAGC
TCTTCTTCCTTGGGCCCAAACCGTGTACTAAAGCATGTGGGGTGGAATCAATTGGTTATCAAGCGTTTCAATTTTATTAAACAGAAAATAAGGCACTCATTCAAGGAGAG
GAAGAAGGGAAATTACCAGAAAACAGCTAAACAAATACCAACCATGGATCTTTCTGGACATGAACTTCCCCTTGACAGAGAAGAGGCACAAGAAAGTTCAGGAATTACTC
CAAGCGAAGATGGCTCGGACATTAGAGGTTGTAGTGAGACTGATAGTTCGGAGAATGGCAATCTCGGTAATGGAGTTCAAACCACGACAGGGGATTATGCTTCATTGAAT
GCTTCTCCGGAAAAATATTCTCAACTGTCTGAGTACGGTCTCCATACAAACAGAGAGGCAAAGTCGTACCACTCACAAAGCATAAGGCTGGCAAGTGAAGATAATATATT
GATTAGAGAGAAGCCTCGAAAATCTTTTGGAAGGAATCTTTCCATGCCTGATATTGATCTCTTTTGTACATTATTTACTGACCCTCCTCATGGTGTTTTTCACACAGAAA
AACCAAAAAGGGTTTTGGTGCGTTTCAGTACAGATAATAATATTCGAATAGATGAGAATCCAGACCAACCATTGGATACTGCTTCATTGAATGTGTTAGAAAAAGGTGAT
GTCAACATACCTGTTGATCATTCAGGTAGTTTAAACTTGGTTGAAAATGATGAAGGAACTGCTTGGGAAGATGAGCTTGAGGAAAAAATACCTCAAGTATCGGATAGTGA
ATGCGTAGTCGAAGGCAACAAGACTGTTGAGGATGACAGCAAGCTTTCACACATCAATCAAGTCTTAATACTTGAAACTTCTATTCAAGCGGATGAAACTTCAAAGCTCG
CCAATTCAGAAGGTGCAATACTAACTCCTGGGTGCAGTATAGCAAATGAACTCGAACCTTCCAATGACCAACCTAATGAGGCCTGCACAGAAGCTTTACCATCTTTTGAA
ACCATTTTAAATCATGAGATTATTGATGATGCTGAGAAGATATCTATCTCTATTCTTCTGCATCCGGAACCTGATAGAATCAATAGTGCCGACTTCAACTATATGAGGTA
CATTCTTCAACTCTCCAGCTTTATCGAAAGTGGTCACGTGGTAGACCGACCACTCAGCTCTTCGATGTTTGATGGAGAGGAGGCCCACTTTTACAAAAAACTTGAATGCT
ACTGGGAGAAGGTGGACGAAGATTCTGATCACCAGCTTCTGCTCGATTTAGTTTATGAGACATTGCATAATGTGTTTGAAAAATCATTCGTTTCTTTCCTCAAAACCTTC
TCCTTGAAGAGCCAACTCCGTCCAGTGCCACTCGGGCAATATGTTCTCGAGGACGCCCGGGAAAAAGTTGCCTGGTACCTGTGCCTAGGACCAGAACTCGACCAGTCTTT
AGATGATGTTGTGAAGCGAGATTTAAGAAAAGGTGATGATTGGATGAACCTCCAATCTGAAACAGAGCGCATAGCACTTGAAATGGAGGATTTGATTCTTGATGGGCTTT
TAGATGAAGTAATAAGTTCTTAG
mRNA sequenceShow/hide mRNA sequence
TTGATTTTTAATTCCCCAATTGGTCACATCCAATTTCTACTGTTTATTTGATTGATTATTCAACGAATCAAACCTCCCCATTCTACGAATCCGACTAAATTTCTAATATT
CAACAGCTTAAGAGAACAGGGACCATTGCTTTTGGGTGGATTTGGTCATATCTTTGTGTAACCTAACAGTTTTTGGAATTTGGTTCATTTTTTGTCACCAACCCTTCTCT
GTCTCTGTAAATTATGGAACCGGTCAACTAACCCTACAGCCTTGTGTTCTTCCTGCCCTGAAGTCAAGGCTATGGAGTTTGGACCAGGCCAAGTGACTCGAGCTTGGATC
AAGCGGACAAGAACCCACCAATCTTTGTGTGTACAGAACCTGGAAGACTTGTCATCATGTTTAAAATGGAAAAGCAATTCCAGCGCCAGGACTCCAATCTGCAGTTTAAC
AGAAACGTTCCAGGTTGCTTCTGGAGCATATTCCATACTATTGACTACCATCCCTGGCATAGTGTTAAAAAGATGCTTCCTCACAGAAAGCATTCAAGAAGCAAGGGAGG
TCCAAAATTGTCTGTGAACGACCACCATGTCGCCGAAGTGGCAGAACAAATTAGTGACGAAGACAAGCCTCTACTGTGTACTGCAGAGAGTTGCTCTTTCGACCGAAGAT
CTGGAAAAGCTCATGTTAGGGAAGAGGCAACTAAAGAGATGTCAGAGGAAGAGAGCCAAAAATACTGGAAATTGAACTCCAATTCTAAACGAAGGTTGAGTAGAACATAC
TCGATTCATCGTTTAGATCCTTCGCAGTATTCTCCTGGAATTAGTGTCAACATTGGAGACAGCAGAGTCGCTCAACCGAAAACTGCCATGAAGTTAGCTGCATCAGGACT
GAAGAGTAATTCTCTGAATGCCATGGATACGGACTACTTTAATCAAAGAAAAATTTGTATCCAGTCGACATCTTTTACAGAAAAATCTAATGGAGTAAAGAAAACCTTGG
AAGCTAACAAGGTCAGCAGAAACTTCTCCAGTCGTTCATTGAAGGAAGACGTTCACATCCAAGAGATATACAAGGCAAACAGAAAGCTATTTGCTGAACTGTTACATGGT
AACCACACCCTCCAAACCCCAAAAAATACTAAGCCATCTGCAAGTCTAGCGAAATCAAGGTCCTTTCCTGCTCCGGGTTTAGCTCAAAAAGGATATAATAAGCTTAGCTC
ACTCCAACACAAGATTAATGAGCACTTTCCTAAAGGACAAAAATCTGTTTCTGTCCCCCAACCATCAAAGCTGGTTGAGTCTGTATCTCCTAAGAGTACTCATGAAAATT
TGACGCTTTGTGACTCTAACAGTGCTTCATGCAATCACATAAGACAACAAAGAAGCTCTTCTTCCTTGGGCCCAAACCGTGTACTAAAGCATGTGGGGTGGAATCAATTG
GTTATCAAGCGTTTCAATTTTATTAAACAGAAAATAAGGCACTCATTCAAGGAGAGGAAGAAGGGAAATTACCAGAAAACAGCTAAACAAATACCAACCATGGATCTTTC
TGGACATGAACTTCCCCTTGACAGAGAAGAGGCACAAGAAAGTTCAGGAATTACTCCAAGCGAAGATGGCTCGGACATTAGAGGTTGTAGTGAGACTGATAGTTCGGAGA
ATGGCAATCTCGGTAATGGAGTTCAAACCACGACAGGGGATTATGCTTCATTGAATGCTTCTCCGGAAAAATATTCTCAACTGTCTGAGTACGGTCTCCATACAAACAGA
GAGGCAAAGTCGTACCACTCACAAAGCATAAGGCTGGCAAGTGAAGATAATATATTGATTAGAGAGAAGCCTCGAAAATCTTTTGGAAGGAATCTTTCCATGCCTGATAT
TGATCTCTTTTGTACATTATTTACTGACCCTCCTCATGGTGTTTTTCACACAGAAAAACCAAAAAGGGTTTTGGTGCGTTTCAGTACAGATAATAATATTCGAATAGATG
AGAATCCAGACCAACCATTGGATACTGCTTCATTGAATGTGTTAGAAAAAGGTGATGTCAACATACCTGTTGATCATTCAGGTAGTTTAAACTTGGTTGAAAATGATGAA
GGAACTGCTTGGGAAGATGAGCTTGAGGAAAAAATACCTCAAGTATCGGATAGTGAATGCGTAGTCGAAGGCAACAAGACTGTTGAGGATGACAGCAAGCTTTCACACAT
CAATCAAGTCTTAATACTTGAAACTTCTATTCAAGCGGATGAAACTTCAAAGCTCGCCAATTCAGAAGGTGCAATACTAACTCCTGGGTGCAGTATAGCAAATGAACTCG
AACCTTCCAATGACCAACCTAATGAGGCCTGCACAGAAGCTTTACCATCTTTTGAAACCATTTTAAATCATGAGATTATTGATGATGCTGAGAAGATATCTATCTCTATT
CTTCTGCATCCGGAACCTGATAGAATCAATAGTGCCGACTTCAACTATATGAGGTACATTCTTCAACTCTCCAGCTTTATCGAAAGTGGTCACGTGGTAGACCGACCACT
CAGCTCTTCGATGTTTGATGGAGAGGAGGCCCACTTTTACAAAAAACTTGAATGCTACTGGGAGAAGGTGGACGAAGATTCTGATCACCAGCTTCTGCTCGATTTAGTTT
ATGAGACATTGCATAATGTGTTTGAAAAATCATTCGTTTCTTTCCTCAAAACCTTCTCCTTGAAGAGCCAACTCCGTCCAGTGCCACTCGGGCAATATGTTCTCGAGGAC
GCCCGGGAAAAAGTTGCCTGGTACCTGTGCCTAGGACCAGAACTCGACCAGTCTTTAGATGATGTTGTGAAGCGAGATTTAAGAAAAGGTGATGATTGGATGAACCTCCA
ATCTGAAACAGAGCGCATAGCACTTGAAATGGAGGATTTGATTCTTGATGGGCTTTTAGATGAAGTAATAAGTTCTTAGAATCTTCTTGGTAGATCTTGCTTGATGGTGT
ATAGTAGCTATCTCCAAATATGTGTACATGAGTTCTGGTTAAAGAAAGCTGTCTACCTTCAACATTCGAGAATTTAAAAGTCATATAGATTAAATATAAACAGAAAAATA
GATACAGTGATAGTGTGTTTGTTTATTTGGTTTTTACTCCCATAAGAAGGGTTAGGGGGGGTTAGATTCTGAATTTCAGTTTCTTGTAAATTAGACATACTCTATTCTGT
TTTTCGAGAAAATATGAATTATTGTAGTTGTTTACACGATTCATATCTTCAAAGATGATTGATTGTAGAACTTGGAATACTCTTTTTTTTATCTGTAAAATCAACCATTG
CAA
Protein sequenceShow/hide protein sequence
MFKMEKQFQRQDSNLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPHRKHSRSKGGPKLSVNDHHVAEVAEQISDEDKPLLCTAESCSFDRRSGKAHVREEATKEMSEEES
QKYWKLNSNSKRRLSRTYSIHRLDPSQYSPGISVNIGDSRVAQPKTAMKLAASGLKSNSLNAMDTDYFNQRKICIQSTSFTEKSNGVKKTLEANKVSRNFSSRSLKEDVH
IQEIYKANRKLFAELLHGNHTLQTPKNTKPSASLAKSRSFPAPGLAQKGYNKLSSLQHKINEHFPKGQKSVSVPQPSKLVESVSPKSTHENLTLCDSNSASCNHIRQQRS
SSSLGPNRVLKHVGWNQLVIKRFNFIKQKIRHSFKERKKGNYQKTAKQIPTMDLSGHELPLDREEAQESSGITPSEDGSDIRGCSETDSSENGNLGNGVQTTTGDYASLN
ASPEKYSQLSEYGLHTNREAKSYHSQSIRLASEDNILIREKPRKSFGRNLSMPDIDLFCTLFTDPPHGVFHTEKPKRVLVRFSTDNNIRIDENPDQPLDTASLNVLEKGD
VNIPVDHSGSLNLVENDEGTAWEDELEEKIPQVSDSECVVEGNKTVEDDSKLSHINQVLILETSIQADETSKLANSEGAILTPGCSIANELEPSNDQPNEACTEALPSFE
TILNHEIIDDAEKISISILLHPEPDRINSADFNYMRYILQLSSFIESGHVVDRPLSSSMFDGEEAHFYKKLECYWEKVDEDSDHQLLLDLVYETLHNVFEKSFVSFLKTF
SLKSQLRPVPLGQYVLEDAREKVAWYLCLGPELDQSLDDVVKRDLRKGDDWMNLQSETERIALEMEDLILDGLLDEVISS