; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0007343 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0007343
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionNASP-related protein sim3
Genome locationchr11:1431105..1434999
RNA-Seq ExpressionPay0007343
SyntenyPay0007343
Gene Ontology termsGO:0006335 - DNA replication-dependent nucleosome assembly (biological process)
GO:0034080 - CENP-A containing nucleosome assembly (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148044.1 NASP-related protein sim3 [Cucumis sativus]2.2e-23695.02Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSEVSVTVDKPKLDETLNVSEVTTESI QGGL SSCNSPNEKKPIT+PTAQTSD SG+KSL+LAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
        AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDK +S KSAVNGESSKASVSSNAE VDGVTDDVSETVSKKD+DEEE+D SDAEDLAD
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI
        AISICKSRVVRLTDEVKS+IVPTTASSTSGSEPE+PLSSNGSQTDNENA TEKQSEI+TLSGLLVELEKK  LEDLQQ ASNP SILSEILGIGSAKPN+
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI

Query:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL-SSQDKGDSSSA
        EKITPPVPSVFNSSQMGSA+SNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL SSQDKGDSSSA
Subjt:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL-SSQDKGDSSSA

XP_008457853.1 PREDICTED: NASP-related protein sim3 [Cucumis melo]9.5e-24899.38Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
        AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI
        AISICKSRVVRLTDEVKS+IVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKK  LEDLQQLASNPMSILSEILGIGSAKPNI
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI

Query:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
        EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
Subjt:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA

XP_022928044.1 protein HGV2 [Cucurbita moschata]6.2e-20786.09Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES   GGLDSSCN  NE KP  E TAQTSDGSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS K ES K+  NGESSKASVSSNAE+VDGV DDVS   SKKDQDEEE DDS  E
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS

Query:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA
        +CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPEIPLSSN SQTDN NAATEKQSEIE LSGLLVELEKK  LEDLQQLASNP SILSEILGIG+A
Subjt:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA

Query:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA
        K  +EKI PP+P+V NSSQMGSANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD SSA
Subjt:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA

XP_023513004.1 LOW QUALITY PROTEIN: protein HGV2 [Cucurbita pepo subsp. pepo]1.1e-20685.89Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES   GGLDSSCN  NE KP  E TAQTSDGSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS K ES K+A NGESSKASVSSNAE+VDGV DDVS   SKKDQDEEE DDS  E
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDIL ALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS

Query:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA
        +CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPE+PLSSN SQTDN NAATEKQSEIE LSGLLVELEKK  LEDLQQLASNP SILSEILGIG+A
Subjt:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA

Query:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA
        K  +EKI PP+P+V NSSQMGSANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD SSA
Subjt:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA

XP_038902205.1 NASP-related protein sim3 [Benincasa hispida]3.1e-23092.99Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADED PSEVSVTV+KPKLDETLNVSEVTTESI QGGL+SSCNSP+EKK + EPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVS-KKDQDEEETDDSDAEDLA
        AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG+SDK ES K+AVNGESSKASVSSNAE+VDGVTDDVSETVS KKDQDEEE DDSDAEDLA
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVS-KKDQDEEETDDSDAEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQ
        DADEDESDLDLAWKMLDVARAIVEK+S DTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQ
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQ

Query:  KAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPN
        KAISICKSRVVRLTDEVKS IVPTTASSTSGSEPE+PLSSNGSQTDN+NAATEKQSEIETLSGLLVELEKK  LEDLQQLASNP SILSEILGIGSAK N
Subjt:  KAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPN

Query:  IEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
        +EKITPPVP+VFNSSQMGSANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVST SES DSNPTKKLA D SSQDKGD SSA
Subjt:  IEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA

TrEMBL top hitse value%identityAlignment
A0A0A0LMR2 TPR_REGION domain-containing protein1.1e-23695.02Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSEVSVTVDKPKLDETLNVSEVTTESI QGGL SSCNSPNEKKPIT+PTAQTSD SG+KSL+LAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
        AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDK +S KSAVNGESSKASVSSNAE VDGVTDDVSETVSKKD+DEEE+D SDAEDLAD
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI
        AISICKSRVVRLTDEVKS+IVPTTASSTSGSEPE+PLSSNGSQTDNENA TEKQSEI+TLSGLLVELEKK  LEDLQQ ASNP SILSEILGIGSAKPN+
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI

Query:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL-SSQDKGDSSSA
        EKITPPVPSVFNSSQMGSA+SNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL SSQDKGDSSSA
Subjt:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDL-SSQDKGDSSSA

A0A1S3C6L8 NASP-related protein sim34.6e-24899.38Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
        AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLAD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI
        AISICKSRVVRLTDEVKS+IVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKK  LEDLQQLASNPMSILSEILGIGSAKPNI
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI

Query:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
        EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
Subjt:  EKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA

A0A6J1E053 NASP-related protein sim34.2e-20183.81Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSEVSVTVDKPK+DE+LN SEVT ES AQGG++SS N  N+K   TE TAQTSDGSGEKSLE+AEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE
        AAHYGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    +SDK  S KSAVNGESSKASVSSNAE+VDGVTDDVS   SKKDQDEE+ D+SDAE
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
        DLA+ADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI 
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS

Query:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA
        +CQKAISICKSRV+RLTDEVKS++VPTTASSTSGSEP   LSSN SQ D +NAA+EKQSEIETLSGLLVELEKK  LEDLQQLASNP SILSEILGIGSA
Subjt:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA

Query:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA
        +  + + + P P+  NSSQ+ SANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES +SNP KK A D SSQDKGD SSA
Subjt:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA

A0A6J1EJQ1 protein HGV23.0e-20786.09Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES   GGLDSSCN  NE KP  E TAQTSDGSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS K ES K+  NGESSKASVSSNAE+VDGV DDVS   SKKDQDEEE DDS  E
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS

Query:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA
        +CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPEIPLSSN SQTDN NAATEKQSEIE LSGLLVELEKK  LEDLQQLASNP SILSEILGIG+A
Subjt:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA

Query:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA
        K  +EKI PP+P+V NSSQMGSANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD SSA
Subjt:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA

A0A6J1I412 protein HGV24.3e-20685.69Show/hide
Query:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES   GGL+SSCN  NE KP  E TAQTSDGSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS K ES KSA NGESSKASVSSNAE+VDGV DDVS   SKKDQDEEE DDS  E
Subjt:  AAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS

Query:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA
        +CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPE+PLSSN SQ+D+ NAATEKQSEIE LSGLLVELEKK  LEDLQQLASNP SILSEILGIG+A
Subjt:  YCQKAISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSA

Query:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA
        K  +EKI PP+P+V NSSQMGSANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD SSA
Subjt:  KPNIEKITPPVPSVFNSSQMGSANSNGGFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSSSA

SwissProt top hitse value%identityAlignment
P27123 Nuclear autoantigenic sperm protein (Fragment)3.2e-0424.57Show/hide
Query:  EEADPLG-AVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKK-DQDEEET-DDSDAED--LADADEDE-SDLDLAWKMLDVARAI
        EE++  G  V  K  Q    +S +  V   +++ +     ++ +G   + SE   K+ D+ EEET +DS  E+  L + +E+E  +L+LAW MLD+A+ I
Subjt:  EEADPLG-AVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKK-DQDEEET-DDSDAED--LADADEDE-SDLDLAWKMLDVARAI

Query:  VEKDSADTME--KVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVKSL
         ++      +         L EV++E ++   ++ ++Q  LS+ E+ +E  +R LAE ++++ L   + SQ  EA++   K+I + + R+  L +++K  
Subjt:  VEKDSADTME--KVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVKSL

Query:  IVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSA
                                 + E ++TE + EIE L  LL E+ +K       Q + N   +  +   +GS+            S F  S  GS+
Subjt:  IVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSA

Query:  NSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKD
         S      P    + +N VT +  + R  K+     ES   +  KK  ++
Subjt:  NSNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKD

Q17886 Protein NASP homolog 13.1e-0724.36Show/hide
Query:  TSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGES
        T + + E+  +   ELL  G +A+K ND ++A D  S A E+ +  YGE         Y YG A L  A+EE+  L    +KE             +G+ 
Subjt:  TSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGES

Query:  SKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLADADEDESD-LDLAWKMLDVARAI---------VEKDSADTMEK-----VDILSALAEV
         +A          G +DD      K D++  ET+  D E+  + ++D+ D + L+W++L+ AR I          E+     +E+      D+L  L E 
Subjt:  SKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLADADEDESD-LDLAWKMLDVARAI---------VEKDSADTMEK-----VDILSALAEV

Query:  ALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVK
         +       +  D  +AL+I   ++ P +R++A+    +       +   E + Y  K   +  +R   L  E++
Subjt:  ALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVK

Q9USQ4 NASP-related protein sim36.4e-0524.5Show/hide
Query:  LDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGCALLYKAQEEADPLG-AV
        + S   +    K  +   A T + S   S  + E+L+ +G+ A    ++ EAVD + +AL    + +G  + E   + + YG +L   A E +  LG A+
Subjt:  LDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGCALLYKAQEEADPLG-AV

Query:  PKKEGQSDKAESAK--SAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLADADEDESDLDLAWKMLDVARAIVEK------DSAD
          KE  S   ES +   A+   +       N   V+     ++    +K+ +E+ET+++       ++EDE D ++AW++LD+ R +  K      DS D
Subjt:  PKKEGQSDKAESAK--SAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLADADEDESDLDLAWKMLDVARAIVEK------DSAD

Query:  -TMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVE-PDNRQLAELNFRVCLCLEF-----GSQPQEAISYCQKAISICKSRVVRLTDEVKSLIV
          +   DI   L E++LE E+   +  D + AL   E++    +N  L+E ++++ L LEF      S    A  + +KA  I K+ +    +EV     
Subjt:  -TMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVE-PDNRQLAELNFRVCLCLEF-----GSQPQEAISYCQKAISICKSRVVRLTDEVKSLIV

Query:  PTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSANS
                           G Q   E+  T   S++E L  +L ELE+K                    L +    P++E+    V S  + S + S   
Subjt:  PTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSANS

Query:  NGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQD
            DS +++ A    V +   +G  VKR  T  E   S+  K+  KD   +D
Subjt:  NGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQD

Arabidopsis top hitse value%identityAlignment
AT4G37210.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-10952.56Show/hide
Query:  DPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNS-PNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAH
        +P +E++ T     L+  L   E T ES+ QGG +S+CN+  N          +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR AH
Subjt:  DPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNS-PNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAH

Query:  YGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQ--DEEETDDSDAEDL-AD
        YGEL +EC+  YY+YG ALL KAQ EADPLG +PKKEG+  +  S     NGES   SV S      G +    E    KDQ  D E+  D D  D   D
Subjt:  YGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQ--DEEETDDSDAEDL-AD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LEREDI +SLSDY+ ALSILERLVEPD+R+ AELNFR+C+CLE G QP+EAI YCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI
        A+ ICK+R+ RL++E+K      T+S+ S  +  I  SSN    D   +A++K+ EI  L+GL  +LEKK  LEDL+Q A NP  +L+E++G+ SAKPN 
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNI

Query:  EKITPPVPSVFNSSQMGSANSNGGFD--SPTVSTAHT--------NGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSS
             P  +  +SS+MG+ N+N G D  SPTVSTAHT        +GVTHLGVVGRGVKRV  N+ S +S+ +KK A + S +  G+SS
Subjt:  EKITPPVPSVFNSSQMGSANSNGGFD--SPTVSTAHT--------NGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSS

AT4G37210.2 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-8553.49Show/hide
Query:  DPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNS-PNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAH
        +P +E++ T     L+  L   E T ES+ QGG +S+CN+  N          +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR AH
Subjt:  DPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNS-PNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAH

Query:  YGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQ--DEEETDDSDAEDL-AD
        YGEL +EC+  YY+YG ALL KAQ EADPLG +PKKEG+  +  S     NGES   SV S      G +    E    KDQ  D E+  D D  D   D
Subjt:  YGELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQ--DEEETDDSDAEDL-AD

Query:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK
        ADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LEREDI +SLSDY+ ALSILERLVEPD+R+ AELNFR+C+CLE G QP+EAI YCQK
Subjt:  ADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQK

Query:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKAS
        A+ ICK+R+ RL++E+K      T+S+ S  +  I  SSN    D   +A++K+ EI  L+GL  +LEKKA+
Subjt:  AISICKSRVVRLTDEVKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACGAGGATCCACCATCGGAGGTCTCAGTGACGGTGGACAAACCCAAACTTGACGAAACTCTCAACGTCAGTGAGGTCACCACTGAGTCCATTGCTCAA
GGAGGCCTCGACTCGTCTTGCAATTCTCCGAATGAAAAGAAGCCCATTACTGAACCCACTGCTCAGACCTCTGATGGAAGCGGGGAGAAATCGTTAGAGCTGGCG
GAAGAGTTGCTTGAGAAGGGATCCAAGGCCATGAAGGATAATGATTTCAATGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTCGAGCTGCGCATTATGGT
GAACTGGCTTCAGAATGTGTCAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGACCCACTGGGAGCTGTCCCAAAGAAGGAA
GGTCAATCTGACAAGGCTGAATCTGCAAAGAGTGCTGTTAATGGTGAATCATCAAAAGCTTCTGTTTCCAGCAATGCTGAAGTGGTGGATGGGGTTACAGATGAT
GTTTCCGAGACAGTCAGTAAGAAGGATCAGGATGAAGAAGAAACTGATGATAGTGATGCTGAGGACTTGGCAGATGCAGATGAAGATGAATCTGACCTTGATTTA
GCTTGGAAAATGCTAGATGTTGCCAGAGCAATCGTGGAAAAAGACTCAGCTGACACTATGGAGAAAGTGGACATACTCTCAGCCTTGGCAGAAGTTGCATTAGAA
AGAGAGGACATTGGAACTTCCCTCAGTGACTACCAGAAAGCGTTATCAATTTTAGAAAGACTTGTTGAACCTGACAATCGACAGCTTGCTGAACTAAATTTCCGT
GTATGCTTGTGTTTGGAGTTTGGTTCTCAGCCGCAGGAAGCCATTTCATATTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGGTGCGGCTCACTGACGAA
GTAAAGAGTCTCATTGTACCAACGACAGCTTCGTCTACATCAGGGTCAGAACCAGAGATCCCACTATCCTCCAATGGCTCCCAGACTGACAACGAAAATGCTGCA
ACGGAGAAACAATCTGAGATTGAAACTCTATCTGGGCTTTTGGTTGAGCTAGAAAAGAAGGCGAGTCTTGAAGATCTTCAACAGCTGGCCTCAAACCCAATGTCA
ATTCTCTCAGAGATCCTCGGTATTGGATCAGCAAAGCCAAATATCGAAAAGATCACACCTCCAGTTCCATCAGTGTTCAACTCCTCACAAATGGGTTCAGCTAAC
AGCAATGGAGGATTCGACTCTCCAACAGTCTCAACTGCCCACACGAACGGCGTGACGCACCTTGGTGTTGTTGGAAGAGGAGTGAAACGAGTATCAACAAATTCA
GAGTCTAATGACTCGAACCCAACGAAGAAACTGGCAAAAGATTTATCATCACAAGATAAAGGCGACAGTAGTTCCGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACGAGGATCCACCATCGGAGGTCTCAGTGACGGTGGACAAACCCAAACTTGACGAAACTCTCAACGTCAGTGAGGTCACCACTGAGTCCATTGCTCAA
GGAGGCCTCGACTCGTCTTGCAATTCTCCGAATGAAAAGAAGCCCATTACTGAACCCACTGCTCAGACCTCTGATGGAAGCGGGGAGAAATCGTTAGAGCTGGCG
GAAGAGTTGCTTGAGAAGGGATCCAAGGCCATGAAGGATAATGATTTCAATGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTCGAGCTGCGCATTATGGT
GAACTGGCTTCAGAATGTGTCAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGACCCACTGGGAGCTGTCCCAAAGAAGGAA
GGTCAATCTGACAAGGCTGAATCTGCAAAGAGTGCTGTTAATGGTGAATCATCAAAAGCTTCTGTTTCCAGCAATGCTGAAGTGGTGGATGGGGTTACAGATGAT
GTTTCCGAGACAGTCAGTAAGAAGGATCAGGATGAAGAAGAAACTGATGATAGTGATGCTGAGGACTTGGCAGATGCAGATGAAGATGAATCTGACCTTGATTTA
GCTTGGAAAATGCTAGATGTTGCCAGAGCAATCGTGGAAAAAGACTCAGCTGACACTATGGAGAAAGTGGACATACTCTCAGCCTTGGCAGAAGTTGCATTAGAA
AGAGAGGACATTGGAACTTCCCTCAGTGACTACCAGAAAGCGTTATCAATTTTAGAAAGACTTGTTGAACCTGACAATCGACAGCTTGCTGAACTAAATTTCCGT
GTATGCTTGTGTTTGGAGTTTGGTTCTCAGCCGCAGGAAGCCATTTCATATTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGGTGCGGCTCACTGACGAA
GTAAAGAGTCTCATTGTACCAACGACAGCTTCGTCTACATCAGGGTCAGAACCAGAGATCCCACTATCCTCCAATGGCTCCCAGACTGACAACGAAAATGCTGCA
ACGGAGAAACAATCTGAGATTGAAACTCTATCTGGGCTTTTGGTTGAGCTAGAAAAGAAGGCGAGTCTTGAAGATCTTCAACAGCTGGCCTCAAACCCAATGTCA
ATTCTCTCAGAGATCCTCGGTATTGGATCAGCAAAGCCAAATATCGAAAAGATCACACCTCCAGTTCCATCAGTGTTCAACTCCTCACAAATGGGTTCAGCTAAC
AGCAATGGAGGATTCGACTCTCCAACAGTCTCAACTGCCCACACGAACGGCGTGACGCACCTTGGTGTTGTTGGAAGAGGAGTGAAACGAGTATCAACAAATTCA
GAGTCTAATGACTCGAACCCAACGAAGAAACTGGCAAAAGATTTATCATCACAAGATAAAGGCGACAGTAGTTCCGCCTGA
Protein sequenceShow/hide protein sequence
MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSDGSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYG
ELASECVKLYYKYGCALLYKAQEEADPLGAVPKKEGQSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDVSETVSKKDQDEEETDDSDAEDLADADEDESDLDL
AWKMLDVARAIVEKDSADTMEKVDILSALAEVALEREDIGTSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDE
VKSLIVPTTASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKKASLEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSAN
SNGGFDSPTVSTAHTNGVTHLGVVGRGVKRVSTNSESNDSNPTKKLAKDLSSQDKGDSSSA