; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020386 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020386
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationscaffold211:156583..161182
RNA-Seq ExpressionMS020386
SyntenyMS020386
Gene Ontology termsGO:0006335 - DNA replication-dependent nucleosome assembly (biological process)
GO:0034080 - CENP-A containing nucleosome assembly (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011181.1 Nuclear autoantigenic sperm protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-20584.74Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG++SS NCSN+     E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE HQ+S KD SVK+  NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVELEKKLEDLQQLASNPKSILSEILGIG+A++KV 
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA
        EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA

XP_022158694.1 NASP-related protein sim3 [Momordica charantia]3.8e-249100Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
        EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
Subjt:  EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA

XP_022971716.1 protein HGV2 [Cucurbita maxima]4.0e-20685.36Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG+ESS NCSN+     E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE HQ+S KD SVKSA NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVELEKKLEDLQQLASNPKSILSEILGIG+A++KV 
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA
        EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA

XP_023513004.1 LOW QUALITY PROTEIN: protein HGV2 [Cucurbita pepo subsp. pepo]1.2e-20584.95Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG++SS NCSN+   A E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE HQ+S KD SVK+A NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDIL ALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVELEKKLEDLQQLASNPKSILSEILGIG+A++KV 
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA
        EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA

XP_038902205.1 NASP-related protein sim3 [Benincasa hispida]1.1e-20685.42Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE  PSEVSVTV+KPK+DE+LN SEVT ES  QGG+ESS N  ++K A  E TAQTSDGSGEKSLE+AEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDD----VSSKKDQDEEDADNSDA
        AAHYGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    ESDKD SVK+AVNGESSKASVSSNAE+VDGVTDD    VS KKDQDEE+AD+SDA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDD----VSSKKDQDEEDADNSDA

Query:  EDLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAI
        EDLA+ADEDESDLDLAWKMLDVARAIVEK+SGDTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI
Subjt:  EDLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAI

Query:  PFCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSAR
         +CQKAISICKSRV+RLTDEVKS +VPTTASSTSGSEP   LSSN SQ D DNAA+EKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSA+
Subjt:  PFCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSAR

Query:  SKVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
        + V + + P PA  NSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVST SESA+SNP KK A DSSSQDKGDGSSA
Subjt:  SKVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA

TrEMBL top hitse value%identityAlignment
A0A0A0LMR2 TPR_REGION domain-containing protein4.4e-19882.75Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSEVSVTVDKPK+DE+LN SEVT ES  QGG++SS N  N+K   T+ TAQTSD SG+KSL++AEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVS---SKKDQDEEDADNSDAE
        AAHYGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    +SDKD SVKSAVNGESSKASVSSNAE VDGVTDDVS   SKKD+DEE++D SDAE
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVS---SKKDQDEEDADNSDAE

Query:  DLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIP
        DLA+ADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI 
Subjt:  DLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIP

Query:  FCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARS
        +CQKAISICKSRV+RLTDEVKS++VPTTASSTSGSEP   LSSN SQ D +NA +EKQSEI+TLSGLLVELEKKLEDLQQ ASNPKSILSEILGIGSA+ 
Subjt:  FCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARS

Query:  KVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAID-SSSQDKGDGSSA
         + + + P P+  NSSQ+ SA+SNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES +SNP KK A D SSSQDKGD SSA
Subjt:  KVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAID-SSSQDKGDGSSA

A0A1S3C6L8 NASP-related protein sim36.5e-20284.16Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSEVSVTVDKPK+DE+LN SEVT ES AQGG++SS N  N+K   TE TAQTSDGSGEKSLE+AEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVS---SKKDQDEEDADNSDAE
        AAHYGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    +SDK  S KSAVNGESSKASVSSNAE+VDGVTDDVS   SKKDQDEE+ D+SDAE
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVS---SKKDQDEEDADNSDAE

Query:  DLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIP
        DLA+ADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALEREDI TSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI 
Subjt:  DLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIP

Query:  FCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARS
        +CQKAISICKSRV+RLTDEVKS++VPTTASSTSGSEP   LSSN SQ D +NAA+EKQSEIETLSGLLVELEKKLEDLQQLASNP SILSEILGIGSA+ 
Subjt:  FCQKAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARS

Query:  KVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
         + + + P P+  NSSQ+ SANSNGGFDSPTVSTAHTN   GVTHLGVVGRGVKRVSTNSES +SNP KK A D SSQDKGD SSA
Subjt:  KVNEKSAP-PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA

A0A6J1E053 NASP-related protein sim31.9e-249100Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
        EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
Subjt:  EKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA

A0A6J1EJQ1 protein HGV22.8e-20584.74Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG++SS NCSN+   A E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE +Q+S KD SVK+  NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVELEKKLEDLQQLASNPKSILSEILGIG+A++KV 
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA
        EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA

A0A6J1I412 protein HGV21.9e-20685.36Show/hide
Query:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG+ESS NCSN+     E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIR

Query:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE HQ+S KD SVKSA NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLA

Query:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQ
Subjt:  EADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQ

Query:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN
        KAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVELEKKLEDLQQLASNPKSILSEILGIG+A++KV 
Subjt:  KAISICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVN

Query:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA
        EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  EKSAP--PAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA

SwissProt top hitse value%identityAlignment
Q17886 Protein NASP homolog 18.4e-0521.31Show/hide
Query:  VESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVP
        V+ +S  S++K   T    +T +   +K   +A ELL  G +A+K ND ++A D  S A E+ +  YGE         Y YG A L  A+EE+  L    
Subjt:  VESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPECVKLYYKYGCALLYKAQEEADPLGAVP

Query:  KKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVAR--------AIVEKDSGD
        +KE    +++          NGE+ K                         ED + S  E+    D+D+  + L+W++L+ AR        A+  + SG 
Subjt:  KKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVAR--------AIVEKDSGD

Query:  T------MEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVP
        +      ++  D+L  L E  +       +  D  +AL+I   ++ P +R++A+    +       +   E + +  K   +  +R   L  E++     
Subjt:  T------MEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVP

Query:  TTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLED----LQQLASNPKSILSEILGIGSARSKVNEKSAPPAALNS-SQLASAN
                             +D     SE ++E++ L  ++  +E+ + D      Q+    K+I ++  G     +K+ +++      N  S L    
Subjt:  TTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLED----LQQLASNPKSILSEILGIGSARSKVNEKSAPPAALNS-SQLASAN

Query:  SNGGFDSPTVSTA
        +    D+PT + A
Subjt:  SNGGFDSPTVSTA

Q9USQ4 NASP-related protein sim38.9e-0725.95Show/hide
Query:  NAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPECVKLYYKYGCALLYKAQEEADPLG-AVPKKEG-----E
        N+AT+A  +    S  +++   E+L+ +G+ A    ++ EAVD + +AL    + +G  + E   + + YG +L   A E +  LG A+  KE      E
Subjt:  NAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPECVKLYYKYGCALLYKAQEEADPLG-AVPKKEG-----E

Query:  SHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVARAIVEK------DSGD-TMEKVD
        S +E +  GS   +     +K +V+          ++ SS    ++E  +    E    ++EDE D ++AW++LD+ R +  K      DS D  +   D
Subjt:  SHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVARAIVEK------DSGD-TMEKVD

Query:  ILSALAEVALEREDIETSLSDYQKALSILERLVE-PDNRQLAELNFRLCLCLEF-----GSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVPTTASST
        I   L E++LE E+   +  D + AL   E++    +N  L+E +++L L LEF      S    A    +KA  I K+ +    +EV       T    
Subjt:  ILSALAEVALEREDIETSLSDYQKALSILERLVE-PDNRQLAELNFRLCLCLEF-----GSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVPTTASST

Query:  SGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVNEKSAPPAALNSSQLASANSNGG
         G + A +              S   S++E L  +L ELE+K  DL+  A + +  +   +   S  SK +   A   A     + +AN  GG
Subjt:  SGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVNEKSAPPAALNSSQLASANSNGG

Arabidopsis top hitse value%identityAlignment
AT4G37210.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-11654.62Show/hide
Query:  PPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESS-SNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHY
        P +E++ T     ++ +L + E T+ES  QGG ES+ +N +N+ NAA  A  +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR AHY
Subjt:  PPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESS-SNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHY

Query:  GELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSV-KSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEAD
        GEL  EC+  YY+YG ALL KAQ EADPLG +PKKEGE  QES    S+  S V+G+  +   SS  E   G       +  +D +D D SDA+   +AD
Subjt:  GELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSV-KSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEAD

Query:  EDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAI
        EDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LEREDIE+SLSDY+ ALSILERLVEPD+R+ AELNFR+C+CLE G +P+EAIP+CQKA+
Subjt:  EDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAI

Query:  SICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVNEKS
         ICK+R+ RL++E+K      T+S+ S  +   Q SSNV  I  D +AS+K+ EI  L+GL  +LEKKLEDL+Q A NPK +L+E++G+ SA+   ++K 
Subjt:  SICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVNEKS

Query:  APPAA-LNSSQLASANSNGGFD--SPTVSTAHT-----NGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA
         P AA ++SS++ + N+N G D  SPTVSTAHT       A GVTHLGVVGRGVKRV  N+ S ES+  KKPA++ S  DK DG+S+
Subjt:  APPAA-LNSSQLASANSNGGFD--SPTVSTAHT-----NGAPGVTHLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA

AT4G37210.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-8855.16Show/hide
Query:  PPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESS-SNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHY
        P +E++ T     ++ +L + E T+ES  QGG ES+ +N +N+ NAA  A  +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR AHY
Subjt:  PPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESS-SNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHY

Query:  GELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSV-KSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEAD
        GEL  EC+  YY+YG ALL KAQ EADPLG +PKKEGE  QES    S+  S V+G+  +   SS  E   G       +  +D +D D SDA+   +AD
Subjt:  GELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSV-KSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEAD

Query:  EDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAI
        EDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LEREDIE+SLSDY+ ALSILERLVEPD+R+ AELNFR+C+CLE G +P+EAIP+CQKA+
Subjt:  EDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAI

Query:  SICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKK
         ICK+R+ RL++E+K      T+S+ S  +   Q SSNV  I  D +AS+K+ EI  L+GL  +LEKK
Subjt:  SICKSRVLRLTDEVKSILVPTTASSTSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACGAGGGTCCACCATCGGAGGTCTCGGTGACGGTGGATAAACCGAAGGTGGACGAAAGCCTGAACGCCAGTGAGGTCACCATTGAGTCCAATGCCCAGGGAGG
CGTCGAGTCGTCTTCCAATTGTTCCAATGACAAGAACGCCGCTACTGAAGCCACTGCTCAGACCTCCGATGGAAGCGGTGAGAAATCGTTGGAGATGGCGGAAGAGTTGC
TGGAGAAAGGATCCAAGGCCATGAAGGATAATGATTTTAACGAGGCTGTCGATTGCTTCAGCCGCGCCCTTGAGATTAGAGCTGCACATTATGGTGAACTGGCTCCAGAA
TGCGTCAAATTGTACTACAAATATGGATGTGCTTTATTGTATAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTTCCAAAGAAAGAGGGTGAATCCCATCAAGAATC
AGATAAAGATGGATCTGTGAAGAGTGCTGTAAATGGTGAATCTTCAAAAGCTTCTGTTTCAAGTAATGCTGAACTGGTGGATGGAGTAACGGATGATGTTTCCAGTAAGA
AAGATCAGGATGAAGAAGATGCCGATAATAGTGATGCCGAAGACTTGGCAGAGGCAGATGAAGATGAATCTGACCTGGATTTAGCTTGGAAAATGCTGGACGTTGCCAGA
GCAATAGTTGAAAAAGATTCGGGTGACACAATGGAGAAAGTGGACATATTGTCGGCCTTGGCAGAAGTTGCTCTAGAAAGAGAGGACATTGAAACTTCCCTCAGTGACTA
CCAGAAAGCGTTATCAATTTTAGAAAGACTTGTTGAACCCGACAATCGACAACTTGCCGAACTAAACTTCCGTTTATGCTTGTGCCTGGAGTTCGGTTCCAAGCCGCAGG
AAGCCATTCCATTTTGCCAGAAGGCAATATCAATCTGCAAGTCACGTGTGCTGCGGCTCACCGACGAAGTAAAGAGTATCCTGGTACCAACGACAGCTTCTTCTACATCG
GGGTCAGAACCAGCTGCCCAGCTGTCCTCCAATGTCTCCCAGATTGACACTGACAACGCTGCATCAGAGAAACAATCTGAGATTGAAACTCTATCTGGGCTTTTGGTTGA
GCTAGAAAAGAAGCTTGAAGATCTGCAACAGCTAGCCTCAAACCCGAAGTCAATTCTCTCAGAAATCCTTGGGATAGGATCAGCTAGGTCGAAAGTCAATGAAAAGAGCG
CACCTCCAGCAGCGTTGAACTCTTCCCAGTTGGCTTCAGCTAACAGCAATGGAGGATTTGACTCTCCGACTGTCTCAACTGCCCACACAAACGGTGCACCCGGAGTGACA
CACCTTGGTGTCGTCGGAAGGGGAGTCAAACGAGTATCTACAAACTCAGAATCTGCTGAATCAAACCCAATGAAGAAACCGGCAATAGATTCATCATCACAAGATAAAGG
TGATGGCAGTTCTGCC
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACGAGGGTCCACCATCGGAGGTCTCGGTGACGGTGGATAAACCGAAGGTGGACGAAAGCCTGAACGCCAGTGAGGTCACCATTGAGTCCAATGCCCAGGGAGG
CGTCGAGTCGTCTTCCAATTGTTCCAATGACAAGAACGCCGCTACTGAAGCCACTGCTCAGACCTCCGATGGAAGCGGTGAGAAATCGTTGGAGATGGCGGAAGAGTTGC
TGGAGAAAGGATCCAAGGCCATGAAGGATAATGATTTTAACGAGGCTGTCGATTGCTTCAGCCGCGCCCTTGAGATTAGAGCTGCACATTATGGTGAACTGGCTCCAGAA
TGCGTCAAATTGTACTACAAATATGGATGTGCTTTATTGTATAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTTCCAAAGAAAGAGGGTGAATCCCATCAAGAATC
AGATAAAGATGGATCTGTGAAGAGTGCTGTAAATGGTGAATCTTCAAAAGCTTCTGTTTCAAGTAATGCTGAACTGGTGGATGGAGTAACGGATGATGTTTCCAGTAAGA
AAGATCAGGATGAAGAAGATGCCGATAATAGTGATGCCGAAGACTTGGCAGAGGCAGATGAAGATGAATCTGACCTGGATTTAGCTTGGAAAATGCTGGACGTTGCCAGA
GCAATAGTTGAAAAAGATTCGGGTGACACAATGGAGAAAGTGGACATATTGTCGGCCTTGGCAGAAGTTGCTCTAGAAAGAGAGGACATTGAAACTTCCCTCAGTGACTA
CCAGAAAGCGTTATCAATTTTAGAAAGACTTGTTGAACCCGACAATCGACAACTTGCCGAACTAAACTTCCGTTTATGCTTGTGCCTGGAGTTCGGTTCCAAGCCGCAGG
AAGCCATTCCATTTTGCCAGAAGGCAATATCAATCTGCAAGTCACGTGTGCTGCGGCTCACCGACGAAGTAAAGAGTATCCTGGTACCAACGACAGCTTCTTCTACATCG
GGGTCAGAACCAGCTGCCCAGCTGTCCTCCAATGTCTCCCAGATTGACACTGACAACGCTGCATCAGAGAAACAATCTGAGATTGAAACTCTATCTGGGCTTTTGGTTGA
GCTAGAAAAGAAGCTTGAAGATCTGCAACAGCTAGCCTCAAACCCGAAGTCAATTCTCTCAGAAATCCTTGGGATAGGATCAGCTAGGTCGAAAGTCAATGAAAAGAGCG
CACCTCCAGCAGCGTTGAACTCTTCCCAGTTGGCTTCAGCTAACAGCAATGGAGGATTTGACTCTCCGACTGTCTCAACTGCCCACACAAACGGTGCACCCGGAGTGACA
CACCTTGGTGTCGTCGGAAGGGGAGTCAAACGAGTATCTACAAACTCAGAATCTGCTGAATCAAACCCAATGAAGAAACCGGCAATAGATTCATCATCACAAGATAAAGG
TGATGGCAGTTCTGCC
Protein sequenceShow/hide protein sequence
MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSDGSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPE
CVKLYYKYGCALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDVSSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVAR
AIVEKDSGDTMEKVDILSALAEVALEREDIETSLSDYQKALSILERLVEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVPTTASSTS
GSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKKLEDLQQLASNPKSILSEILGIGSARSKVNEKSAPPAALNSSQLASANSNGGFDSPTVSTAHTNGAPGVT
HLGVVGRGVKRVSTNSESAESNPMKKPAIDSSSQDKGDGSSA