; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023427 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023427
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPAP/OAS1 substrate-binding domain superfamily
Genome locationchr7:48134154..48137482
RNA-Seq ExpressionLag0023427
SyntenyLag0023427
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607368.1 hypothetical protein SDJN03_00710, partial [Cucurbita argyrosperma subsp. sororia]3.6e-18771.98Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++K ETAVVLDTSGTNDT + SSL SNQ N VLEN+Y APQNGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CD+Y+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVP GI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+PVNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ SATP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQES
        G SGSPP   SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA       S+PSEQ  TS LQG SS GIG SA E LKR EENNQES
Subjt:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQES

KAG7037039.1 Superoxide dismutase [Mn], mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-18671.72Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++K ETAVVLDTSGTNDT + SS  SNQ N VLEN+Y APQNGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CD+Y+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVP GI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+PVNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ SATP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP   SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA       S+PSEQ  TS LQG SS GIG SA E LKR EENNQE
Subjt:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

XP_022949357.1 uncharacterized protein LOC111452735 [Cucurbita moschata]2.0e-18572.19Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT+DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++K ETAVVLDTSGTNDT + SS  SNQ N VLEN+Y APQNGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CD+Y+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVP GI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+ VNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ S TP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP   SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA S+PSEQ  TS LQG SS GIG SA E LKR EENNQE
Subjt:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

XP_022997710.1 uncharacterized protein LOC111492589 [Cucurbita maxima]3.2e-18873.01Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++KLETAVVLDTSGTNDT + SS HSNQ N VLEN+Y AP+NGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CDRY+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVPAGI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+PVNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ SATP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS-SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA--SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP  SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA  S+P EQ  TS LQG SS GIG SA E LKR EENNQE
Subjt:  GLSGSPPS-SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA--SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

XP_023523561.1 uncharacterized protein LOC111787748 [Cucurbita pepo subsp. pepo]4.8e-18471.26Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL   D K    + SF+ +   G K   N++K ETAVVLDTSGTNDT + SS  SNQ N VLEN+Y APQNGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDG+          K FN+RCNGG  S DDVGKLLDLCGDYD CF+NLRYSQ+CDRY+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVPAGI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+PVNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ SATP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP   SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA      S+P EQ  TS LQG SS GIG SA E LKR +ENNQE
Subjt:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA------SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

TrEMBL top hitse value%identityAlignment
A0A2N9G170 Uncharacterized protein7.8e-5533.04Show/hide
Query:  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH---CWTN--AEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTT--ETKGS
        NF+RIRSAFKYGARKLGWIL+LP ER+  EL KFF+NTLDRH   CWT+  ++F ++          S  S    + K  L+ T GF D K +  E    
Subjt:  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH---CWTN--AEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTT--ETKGS

Query:  FSN---------ISST---------------GCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCA-----------------------
          N         +SS                G +F G+ ++L T+ +L    TND+ +   L SN   + L NS+ A                       
Subjt:  FSN---------ISST---------------GCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCA-----------------------

Query:  -PQNGFLG-------------LLGSQMTSDCFNNDGVFFTLEVEQKDKDFNKRCN------GGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSP
         P N  +              L+G+ + S   N +G+     V          CN      G + S + +  LLDL GDY+   RNL+Y Q+C  Y+ SP
Subjt:  -PQNGFLG-------------LLGSQMTSDCFNNDGVFFTLEVEQKDKDFNKRCN------GGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSP

Query:  PALPLSPPMSPHRQKNYPWQAVHQS-RIRNNVPAGIDHNGFAMGLQ----SDPVNN-FAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGR-----SQ
        P LP SPP+SP  Q   PW+ +  S RI+ N  + +  N  A G +    S P ++  A  LEEK+RP+G GTYFP+ N   YRDR    KGR     + 
Subjt:  PALPLSPPMSPHRQKNYPWQAVHQS-RIRNNVPAGIDHNGFAMGLQ----SDPVNN-FAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGR-----SQ

Query:  GQMTR--------------------------------------------SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGLSGSPPSS
        GQ+ R                                            SQL R   SN L+  PRE+ +   G HE S  E+PVLG GK G S S   S
Subjt:  GQMTR--------------------------------------------SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGLSGSPPSS

Query:  YLSRWKTPHDNNDSWPHDELESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQE
        + S+  + H N  S P ++LES  + P+P  A +PE S+  E  TSH  GS+        +S K    NN+E
Subjt:  YLSRWKTPHDNNDSWPHDELESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQE

A0A6J1DLE7 uncharacterized protein LOC1110220084.0e-18467.8Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S+R  P+   +   H   +  +  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCW+NAEFPTMDA FGVSV++SA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISST--GCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSD
        P +T    K CLEPTL  RD KT+  + SFSNISST  GC+FTGN++KL    VL+TS TND  +LSSLHSNQIN+V EN +CAPQNGF  LLGS+MTSD
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISST--GCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSD

Query:  CFNNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRNN-VPAGI
        C N+D +  TLEVE KD+ FNKRCNG TRSF+D GKLLDLCGDYD  FRNLRYSQ+CDRY+ S P LPLSPPMSPHRQKNYPW+  H+S   N+ +P+GI
Subjt:  CFNNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRNN-VPAGI

Query:  DHNGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGK
        D NGF MGLQS+PVN+FAIVLEE +RPQGIGTYFPRTNT SYRDR+SQAKG+ QGQMT+ QL  QD SN+LSAT REL +   G HEFSEAEFP LGNGK
Subjt:  DHNGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGK

Query:  TGLSGSPPSSYLSRWKTPHDNNDSWPHDELESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQESDATVD-C----GDRNFP
        TG SGSPPS  +S+WKTPH N+ SWPHDEL +WPIYPEP DATI EASNP EQ  SH    S  IGLSA E+LKRVEE+NQ  +  V+ C     D +FP
Subjt:  TGLSGSPPSSYLSRWKTPHDNNDSWPHDELESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQESDATVD-C----GDRNFP

A0A6J1GBT6 uncharacterized protein LOC1114527359.5e-18672.19Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT+DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++K ETAVVLDTSGTNDT + SS  SNQ N VLEN+Y APQNGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CD+Y+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVP GI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+ VNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ S TP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP   SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA S+PSEQ  TS LQG SS GIG SA E LKR EENNQE
Subjt:  GLSGSPPS--SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA-SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

A0A6J1KAQ2 uncharacterized protein LOC1114925891.6e-18873.01Show/hide
Query:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA
        S R  P+   +   H   +  L  N NN   S +  NFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPT DAR GVSV+SSA
Subjt:  SQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSA

Query:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF
        P KT FQ KACLEPTL  RD K    + SF+ +   G K   N++KLETAVVLDTSGTNDT + SS HSNQ N VLEN+Y AP+NGFLGLLGSQMT DCF
Subjt:  PSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCF

Query:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH
        NNDGV          K FN+RCNGG  SFDDVGKLLDLCGDYD CF+NLRYSQ+CDRY+ SPP LPLSPPMSPHRQ+NYPW A HQS+IRN NVPAGI+ 
Subjt:  NNDGVFFTLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRN-NVPAGIDH

Query:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT
        NGFAMGLQS+PVNNF  V EEKRRPQGIGTYFPRTNT +YRDR+SQAKG+SQGQMTRSQL R D S NQ SATP+EL IYT GG EFSEAEFPVLGNGKT
Subjt:  NGFAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHS-NQLSATPRELGIYTVGGHEFSEAEFPVLGNGKT

Query:  GLSGSPPS-SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA--SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE
        G SGSPP  SYLSRWKTPH   +NDSWPHDE +E WPI PEPCDATIPEA  S+P EQ  TS LQG SS GIG SA E LKR EENNQE
Subjt:  GLSGSPPS-SYLSRWKTPH--DNNDSWPHDE-LESWPIYPEPCDATIPEA--SNPSEQ-ATSHLQG-SSCGIGLSAAESLKRVEENNQE

A0A7N2KNM2 Uncharacterized protein3.4e-5031.88Show/hide
Query:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH---CWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKT
        NN   S    NFYRIRSAF+YGARKLGWIL+LP ER+  EL KFF+NTLDRH   CWT+        +    + S        + K  L+   GF D K 
Subjt:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH---CWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKT

Query:  TETKGSFSNIS------------------------STGCK--FTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCA---------------
        +  + +   I+                        +  CK  F G+ ++L ++  +D   TND+H   S  S   ++VL N Y A               
Subjt:  TETKGSFSNIS------------------------STGCK--FTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCA---------------

Query:  ---------PQNGFLG----------------------------------LLGSQMTSDCFNNDGVFFTLEVEQKDKDFNKRCNGG------TRSFDDVG
                 PQN  +                                   L G      C N++G+  T  V          CNG       + S + + 
Subjt:  ---------PQNGFLG----------------------------------LLGSQMTSDCFNNDGVFFTLEVEQKDKDFNKRCNGG------TRSFDDVG

Query:  KLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNGFAMGLQ----SDPVNNFAIV-LEEKRRPQGI
         LLDL GDYD   RNL+Y Q+C  Y+ SP  LP  PP+SP  Q   P + +    +I+ NV + +  NG A+G +    S P ++ A   LEEK+ P+G 
Subjt:  KLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNGFAMGLQ----SDPVNNFAIV-LEEKRRPQGI

Query:  GTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGLSGSPPSSYLSRWKTPHDNNDSWPHDEL
        GT+ P+ N  SY+ R    KGR+       QL R  ++N L+  PRE+     G HE S  EFPVL  GK+G S S    +LS+W + H N  S P ++ 
Subjt:  GTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGLSGSPPSSYLSRWKTPHDNNDSWPHDEL

Query:  ESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQE
        ES  + P+P  A +PE S+  E  TS    S+        +S K +  NN+E
Subjt:  ESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G51620.1 PAP/OAS1 substrate-binding domain superfamily2.1e-2025.59Show/hide
Query:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTTET
        NN   S +  NFYRIRSAF YGARKLG + L  +E + +EL+KFF+N L RH   + + P         V  + P     ++ A L  +  F++ +    
Subjt:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTTET

Query:  KGSFSNISSTGCKFTGNIRKLETAVVL------DTSG--------------TNDTHELSSLHSNQINV---------VLENSYCAPQNGFLGLLGSQMTS
          S S+  +TG         L+  V +      D SG              + D  +L++L   ++ +         + +    +P NG           
Subjt:  KGSFSNISSTGCKFTGNIRKLETAVVL------DTSG--------------TNDTHELSSLHSNQINV---------VLENSYCAPQNGFLGLLGSQMTS

Query:  DCFNNDGV-------FFTLEVEQKDKDFNKRCNGGTRSFDDV--------------------------GKLLDLCGDYDICFRNLRYSQVCDRYSFSPPA
        +  N +GV       + T     KD   N+  N     ++D+                            L DL GDY+    +LR+ +    Y  + P 
Subjt:  DCFNNDGV-------FFTLEVEQKDKDFNKRCNGGTRSFDDV--------------------------GKLLDLCGDYDICFRNLRYSQVCDRYSFSPPA

Query:  LPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNG-------FAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTR
         PLSPP  P    N  W+ + H    R N P  ++ NG       F +  Q  P   F I  EE  +P+G GTYFP  N + YRDR    +GR+  Q   
Subjt:  LPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNG-------FAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTR

Query:  SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGL------SGSPPSSYLSRWKTPHDNNDS--WPHDELESW-PIYPEPCDA-TIPEASN
                    + +PR  G      H  SE  FP     +  L      +GS   S+     +  D N S   P+++   + P  P P +  + PE S 
Subjt:  SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGL------SGSPPSSYLSRWKTPHDNNDS--WPHDELESW-PIYPEPCDA-TIPEASN

Query:  PSEQATSH
        P +    H
Subjt:  PSEQATSH

AT3G51620.2 PAP/OAS1 substrate-binding domain superfamily2.1e-2025.59Show/hide
Query:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTTET
        NN   S +  NFYRIRSAF YGARKLG + L  +E + +EL+KFF+N L RH   + + P         V  + P     ++ A L  +  F++ +    
Subjt:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVRSSAPSKTNFQHKACLEPTLGFRDRKTTET

Query:  KGSFSNISSTGCKFTGNIRKLETAVVL------DTSG--------------TNDTHELSSLHSNQINV---------VLENSYCAPQNGFLGLLGSQMTS
          S S+  +TG         L+  V +      D SG              + D  +L++L   ++ +         + +    +P NG           
Subjt:  KGSFSNISSTGCKFTGNIRKLETAVVL------DTSG--------------TNDTHELSSLHSNQINV---------VLENSYCAPQNGFLGLLGSQMTS

Query:  DCFNNDGV-------FFTLEVEQKDKDFNKRCNGGTRSFDDV--------------------------GKLLDLCGDYDICFRNLRYSQVCDRYSFSPPA
        +  N +GV       + T     KD   N+  N     ++D+                            L DL GDY+    +LR+ +    Y  + P 
Subjt:  DCFNNDGV-------FFTLEVEQKDKDFNKRCNGGTRSFDDV--------------------------GKLLDLCGDYDICFRNLRYSQVCDRYSFSPPA

Query:  LPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNG-------FAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTR
         PLSPP  P    N  W+ + H    R N P  ++ NG       F +  Q  P   F I  EE  +P+G GTYFP  N + YRDR    +GR+  Q   
Subjt:  LPLSPPMSPHRQKNYPWQAV-HQSRIRNNVPAGIDHNG-------FAMGLQSDPVNNFAIVLEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTR

Query:  SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGL------SGSPPSSYLSRWKTPHDNNDS--WPHDELESW-PIYPEPCDA-TIPEASN
                    + +PR  G      H  SE  FP     +  L      +GS   S+     +  D N S   P+++   + P  P P +  + PE S 
Subjt:  SQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGL------SGSPPSSYLSRWKTPHDNNDS--WPHDELESW-PIYPEPCDA-TIPEASN

Query:  PSEQATSH
        P +    H
Subjt:  PSEQATSH

AT3G61690.1 nucleotidyltransferases6.6e-0648.08Show/hide
Query:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH
        NN   S +  NF+RIRSAF  GA+KL  +L  P+E +  E+ +FF NT +RH
Subjt:  NNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTGCAACCCGCTGTGATCAGAGACCCATTCGAGTCACAGAGGATCCACCCCACCCCACCTGCTTACGATTTTTCACACCACAAGGCATTGATCATTTTGTCTTT
CAACATGAACAATCAGAAAGAAAGTCAAGCCCTAAGAAACTTTTATCGAATTCGCAGTGCTTTCAAATATGGAGCTCGTAAGCTAGGTTGGATTCTTTTATTACCAGAAG
AAAGAATGGAAGCTGAACTTAAGAAATTCTTTGCCAACACTTTAGACAGGCACTGTTGGACTAATGCAGAATTTCCTACTATGGATGCTCGCTTTGGGGTTTCAGTCCGG
TCATCTGCACCATCCAAGACAAATTTTCAACACAAAGCCTGTTTGGAGCCAACATTGGGCTTCAGAGATCGAAAGACAACAGAAACTAAAGGTTCTTTCTCAAACATAAG
TTCTACAGGGTGTAAATTCACAGGGAACATCCGGAAGCTTGAGACTGCTGTTGTTCTGGATACAAGTGGCACTAATGACACACATGAATTGTCTTCCTTGCATTCCAACC
AGATTAATGTGGTCTTAGAGAACTCTTATTGTGCCCCACAGAATGGTTTTCTAGGCTTGTTAGGAAGTCAAATGACCTCAGATTGCTTTAATAATGATGGGGTGTTTTTC
ACGTTAGAGGTTGAACAGAAAGATAAGGACTTCAACAAAAGATGCAATGGAGGCACCAGAAGCTTTGATGATGTAGGAAAGTTATTAGATCTTTGTGGGGATTATGATAT
TTGCTTTAGGAACCTTAGATATAGTCAGGTTTGTGACAGGTACTCCTTTTCTCCTCCAGCACTACCGTTAAGTCCACCCATGTCACCTCATAGGCAGAAGAATTATCCTT
GGCAAGCCGTCCATCAATCCCGAATACGCAATAATGTACCTGCTGGAATTGACCATAATGGATTCGCCATGGGACTACAGTCGGACCCGGTTAATAATTTTGCTATTGTT
TTGGAGGAAAAGAGAAGGCCACAAGGAATTGGTACTTACTTCCCTAGAACGAATACTAGCTCATATAGAGACAGGCGATCTCAAGCAAAGGGAAGGAGTCAAGGACAGAT
GACTCGGAGTCAATTGCTAAGGCAAGATCACAGTAATCAATTGTCTGCAACTCCACGAGAACTCGGCATATATACAGTTGGCGGCCATGAATTTTCGGAAGCTGAATTTC
CGGTTCTTGGTAATGGCAAAACGGGATTATCAGGTTCACCACCCTCGTCTTACCTTTCAAGATGGAAAACCCCGCATGACAATAATGATTCCTGGCCACACGATGAACTT
GAATCTTGGCCTATTTATCCTGAGCCTTGTGATGCAACTATCCCAGAAGCAAGCAACCCATCAGAGCAAGCTACTTCACACCTTCAGGGCTCATCATGTGGGATTGGGTT
GTCAGCTGCAGAAAGTCTAAAACGAGTAGAAGAAAACAACCAAGAAAGCGATGCAACGGTGGACTGTGGAGATAGGAACTTCCCTGTAGATAATTATGCATTAGAAATCA
CAGTAATAACATGTAAGGTCCGGTTGATAGAGAATATCATGGTTGATTTACCAATGAAATGGATAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTGCAACCCGCTGTGATCAGAGACCCATTCGAGTCACAGAGGATCCACCCCACCCCACCTGCTTACGATTTTTCACACCACAAGGCATTGATCATTTTGTCTTT
CAACATGAACAATCAGAAAGAAAGTCAAGCCCTAAGAAACTTTTATCGAATTCGCAGTGCTTTCAAATATGGAGCTCGTAAGCTAGGTTGGATTCTTTTATTACCAGAAG
AAAGAATGGAAGCTGAACTTAAGAAATTCTTTGCCAACACTTTAGACAGGCACTGTTGGACTAATGCAGAATTTCCTACTATGGATGCTCGCTTTGGGGTTTCAGTCCGG
TCATCTGCACCATCCAAGACAAATTTTCAACACAAAGCCTGTTTGGAGCCAACATTGGGCTTCAGAGATCGAAAGACAACAGAAACTAAAGGTTCTTTCTCAAACATAAG
TTCTACAGGGTGTAAATTCACAGGGAACATCCGGAAGCTTGAGACTGCTGTTGTTCTGGATACAAGTGGCACTAATGACACACATGAATTGTCTTCCTTGCATTCCAACC
AGATTAATGTGGTCTTAGAGAACTCTTATTGTGCCCCACAGAATGGTTTTCTAGGCTTGTTAGGAAGTCAAATGACCTCAGATTGCTTTAATAATGATGGGGTGTTTTTC
ACGTTAGAGGTTGAACAGAAAGATAAGGACTTCAACAAAAGATGCAATGGAGGCACCAGAAGCTTTGATGATGTAGGAAAGTTATTAGATCTTTGTGGGGATTATGATAT
TTGCTTTAGGAACCTTAGATATAGTCAGGTTTGTGACAGGTACTCCTTTTCTCCTCCAGCACTACCGTTAAGTCCACCCATGTCACCTCATAGGCAGAAGAATTATCCTT
GGCAAGCCGTCCATCAATCCCGAATACGCAATAATGTACCTGCTGGAATTGACCATAATGGATTCGCCATGGGACTACAGTCGGACCCGGTTAATAATTTTGCTATTGTT
TTGGAGGAAAAGAGAAGGCCACAAGGAATTGGTACTTACTTCCCTAGAACGAATACTAGCTCATATAGAGACAGGCGATCTCAAGCAAAGGGAAGGAGTCAAGGACAGAT
GACTCGGAGTCAATTGCTAAGGCAAGATCACAGTAATCAATTGTCTGCAACTCCACGAGAACTCGGCATATATACAGTTGGCGGCCATGAATTTTCGGAAGCTGAATTTC
CGGTTCTTGGTAATGGCAAAACGGGATTATCAGGTTCACCACCCTCGTCTTACCTTTCAAGATGGAAAACCCCGCATGACAATAATGATTCCTGGCCACACGATGAACTT
GAATCTTGGCCTATTTATCCTGAGCCTTGTGATGCAACTATCCCAGAAGCAAGCAACCCATCAGAGCAAGCTACTTCACACCTTCAGGGCTCATCATGTGGGATTGGGTT
GTCAGCTGCAGAAAGTCTAAAACGAGTAGAAGAAAACAACCAAGAAAGCGATGCAACGGTGGACTGTGGAGATAGGAACTTCCCTGTAGATAATTATGCATTAGAAATCA
CAGTAATAACATGTAAGGTCCGGTTGATAGAGAATATCATGGTTGATTTACCAATGAAATGGATAGTATGA
Protein sequenceShow/hide protein sequence
MRLQPAVIRDPFESQRIHPTPPAYDFSHHKALIILSFNMNNQKESQALRNFYRIRSAFKYGARKLGWILLLPEERMEAELKKFFANTLDRHCWTNAEFPTMDARFGVSVR
SSAPSKTNFQHKACLEPTLGFRDRKTTETKGSFSNISSTGCKFTGNIRKLETAVVLDTSGTNDTHELSSLHSNQINVVLENSYCAPQNGFLGLLGSQMTSDCFNNDGVFF
TLEVEQKDKDFNKRCNGGTRSFDDVGKLLDLCGDYDICFRNLRYSQVCDRYSFSPPALPLSPPMSPHRQKNYPWQAVHQSRIRNNVPAGIDHNGFAMGLQSDPVNNFAIV
LEEKRRPQGIGTYFPRTNTSSYRDRRSQAKGRSQGQMTRSQLLRQDHSNQLSATPRELGIYTVGGHEFSEAEFPVLGNGKTGLSGSPPSSYLSRWKTPHDNNDSWPHDEL
ESWPIYPEPCDATIPEASNPSEQATSHLQGSSCGIGLSAAESLKRVEENNQESDATVDCGDRNFPVDNYALEITVITCKVRLIENIMVDLPMKWIV