; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036473 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036473
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNuclear factor of activated T-cells 5 isoform X1
Genome locationscaffold5:40479274..40488908
RNA-Seq ExpressionSpg036473
SyntenySpg036473
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR003107 - HAT (Half-A-TPR) repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583926.1 Pre-mRNA-processing factor 39, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.89Show/hide
Query:  PSGTFGDSAGFFHLVAALLSPVLCQTKAIAMANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSF
        P  T  D  G   LV+   +P+LCQT+AIAMANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFD+WTSLISEIE K+ DVIE+ISLVYDSF
Subjt:  PSGTFGDSAGFFHLVAALLSPVLCQTKAIAMANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSF

Query:  LSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLA
        LSEFPLCHGYWRKYAAHK RLCSVDKV+DVFEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLA
Subjt:  LSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLA

Query:  LIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYD
        LIYIQTLRFPTKKLSYYH      SFRKLT SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYD
Subjt:  LIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYD

Query:  EACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVP
        EA QLEEK++HFERKIRRTYFHVK+LDA QLKNWHSYLDFVE+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVP
Subjt:  EACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVP

Query:  VIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTD
        VIHLFN+RFKEQIRDLSGAR AFL LDG+LDS FVENI+LKANMEKRMGKSTAA NVY+EALE+ALMK KLDVLPSLYIHFSR KHM             
Subjt:  VIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTD

Query:  FDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIH
            ITG  DAA+EVLIDGIRNVPLCKLLLE          ELINFVMV GAPKLINLVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIH
Subjt:  FDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIH

Query:  DVMKVWNRHIKLFPQSIRAMSYEDLIPGTEALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISM
        DVM+VWNRHIKLFPQSIRAM YED    TEAL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      
Subjt:  DVMKVWNRHIKLFPQSIRAMSYEDLIPGTEALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISM

Query:  KKSTIDQIHSGEAETGVEVRVQQNSPKVYEHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSS
        K+S ID IHSGEAE G E RVQQ+SPKV EHY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSS
Subjt:  KKSTIDQIHSGEAETGVEVRVQQNSPKVYEHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSS

Query:  ESVCNTDEGALMHNPRGVRSYGSIQISNEVASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRT
        ESVCNTDEGAL+HNP+GVRS GSIQIS EVASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  
Subjt:  ESVCNTDEGALMHNPRGVRSYGSIQISNEVASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRT

Query:  RSGQTPDSQDYNSESVASQKPHVERSSQERSQIQSAQQQNFPTTSQPQIPSQ
        RSGQTPDSQDY SES+ASQ+P VERSSQE +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  RSGQTPDSQDYNSESVASQKPHVERSSQERSQIQSAQQQNFPTTSQPQIPSQ

KAG7019540.1 Pre-mRNA-processing factor 39 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.61Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH SIASVSFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDF+E+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDV+PSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLW-KQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGT
        E          ELINFVMV GAPKLI+LVDPIVANAISLKPDVS       +  I++L+ KQA+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    T
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLW-KQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGT

Query:  EALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVY
        EAL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      K+S ID IHSGEAE G E RVQQ+SPKV 
Subjt:  EALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVY

Query:  EHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNE
        EHY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS E
Subjt:  EHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNE

Query:  VASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQE
        VASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE
Subjt:  VASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQE

Query:  RSQIQSAQQQNFPTTSQPQIPSQ
         +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  RSQIQSAQQQNFPTTSQPQIPSQ

XP_022927450.1 uncharacterized protein LOC111434274 isoform X1 [Cucurbita moschata]0.0e+0079.93Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDF+E+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDV+PSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAPKLI+LVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      K+S ID IHSGEAE G E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

XP_022927451.1 uncharacterized protein LOC111434274 isoform X2 [Cucurbita moschata]0.0e+0079.93Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDF+E+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDV+PSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAPKLI+LVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      K+S ID IHSGEAE G E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

XP_023519752.1 uncharacterized protein LOC111783101 [Cucurbita pepo subsp. pepo]0.0e+0080.04Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLSTG  RYS LQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDFVE+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDVLPSLYIHFSR KHM                 I+G  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAPKLINLVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      KKS ID IHSGEAE   E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN VELA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKA  EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPS DK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

TrEMBL top hitse value%identityAlignment
A0A6J1CDP7 uncharacterized protein LOC1110107370.0e+0077.51Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MAND+QLLDNS+TKAQPIESE AVGLDESKLHE VP+ GL+FD+WTSLISE EKKYPD I+KISLVYDSFLSEFPLCHGYWRKYAA K RLCSVDK V V
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IW DYCSFSISAFE+ SD+RRL RR ISFVGKDYLS+SLWDKYI FE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLST-GMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAG
         SL+E+I S+TGCNPS   ELEA PDGE+PI CTD+EV+SVIKDLLD  T    RYSAL+KYV AGEKLYDEACQLE KVVHFERKIRRTYFHVK+LDA 
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLST-GMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAG

Query:  QLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQL--D
        QLKNWHSYLDFVE+ GDFDWAVKLYERCLIPCANYPEFWMRY EFM++KGGREIAM ALERATE FLK+VP IHLFN+RFKEQI DLSGAR+AFLQL  D
Subjt:  QLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQL--D

Query:  GDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCK
         DLDS FVENI+LKANMEKRMGKSTAAFNVY+EALEMALMKKKLDVLPSLYIHFSR K+                  ITGS DAAMEVLIDGIRNVPLCK
Subjt:  GDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCK

Query:  LLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIP
        LLLE          ELI FVM+H APKLINLVDPIVANAISL  D S       +  I++L+ +AVDLCGTIHDVMKVWNRHIKLFPQSIR MSYED IP
Subjt:  LLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIP

Query:  GTEALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPK
        GTEALR+TKG KQT D+TVPNQ   DGDFD STQL LE+NK+S LE+QNFQNDQSANGNEPTS  LG H I MK+ TID I+S EAE   + RVQQ SPK
Subjt:  GTEALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPK

Query:  VYEHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQIS
        V EHY EGGNGVEL+ MP++NSKED+YG+ LGQSLKN+SIGNLSL+PKN  KIDLLPKAS +GEAPLENSMSSESV +TDEGALMHNP G+RS GSI+IS
Subjt:  VYEHYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQIS

Query:  NEVASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSS
        NEVASPSSSPSHDK IHTQ PS+FHM    NRKWHHKR AGN H +S   F GHSRRR HRTWQGSP++YQ T+SGQ PD QDY S+S+ASQKP +ERSS
Subjt:  NEVASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSS

Query:  QERSQIQSAQQQNFPTTSQPQIPSQ
        QE +QIQSAQQQNFPTTSQ Q+PSQ
Subjt:  QERSQIQSAQQQNFPTTSQPQIPSQ

A0A6J1EHQ2 uncharacterized protein LOC111434274 isoform X20.0e+0079.93Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDF+E+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDV+PSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAPKLI+LVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      K+S ID IHSGEAE G E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

A0A6J1ENZ2 uncharacterized protein LOC111434274 isoform X10.0e+0079.93Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISEIE K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS   ELEALP+GE PICCTD+E+SSVIKDLLDLS G  RYSALQKYV AGEKLYDEA QLEEK++HFERKIRRTYFHVK+LDA Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDF+E+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEMALMK KLDV+PSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAPKLI+LVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG FDLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG      K+S ID IHSGEAE G E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN V+LA MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH+ AT NR WHHK  AGNLHHDS  RFQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQDY SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

A0A6J1KIG4 uncharacterized protein LOC111495535 isoform X20.0e+0079.5Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISE E K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS + ELEALP+GE PICCTD E+SSVIKDLLDLSTG  RYSALQKYV AGEKLYDEA QLEEK++HFE KIRRTYFHVK+L+A Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDFVE+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEM LMK KLDVLPSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAP LINLVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG  DLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG     MKKS ID IHSGEAE   E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN VEL  MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH  AT NR WHHK  +GNLHHDS  +FQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQD+ SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

A0A6J1KL11 uncharacterized protein LOC111495535 isoform X10.0e+0079.5Show/hide
Query:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV
        MANDLQLL+NSSTKAQP ES+ AVGLDESKLHE VPKCGLNFDEWTSLISE E K+ DVIE+ISLVYDSFLSEFPLCHGYWRKYAAHK RLCSVDKV+DV
Subjt:  MANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDV

Query:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT
        FEQAVQSATYSV IWVDYCSFSIS FE+P+DVRRL +R ISFVGKDYLS+SLWDKYIEFE SQQQWDSLALIYIQTLRFPTKKLSYYH      SFRKLT
Subjt:  FEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLT

Query:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ
         SLRENI S+TGCNPS + ELEALP+GE PICCTD E+SSVIKDLLDLSTG  RYSALQKYV AGEKLYDEA QLEEK++HFE KIRRTYFHVK+L+A Q
Subjt:  TSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQ

Query:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL
        LKNWHSYLDFVE+ GDFDWAVKLYERCLIPCA+YPEFWMRYVEFME KGGRE+AMFALERAT+ FLKRVPVIHLFN+RFKEQIRDLSGAR AFL LDGDL
Subjt:  LKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDL

Query:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL
        DS FVENI+LKANMEKRMGKSTAA NVY+EALEM LMK KLDVLPSLYIHFSR KHM                 ITG  DAA+EVLIDGIRNVPLCKLLL
Subjt:  DSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLL

Query:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE
        E          ELINFVMV GAP LINLVDPIVANAISLKPDVS       +  I++L+ +A+DLCGTIHDVM+VWNRHIKLFPQSIRAM YED    TE
Subjt:  ELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTE

Query:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE
        AL++TKG KQT DSTV NQ  KDG  DLSTQLPLEENK+SL  NQNFQNDQS+NGNEP SCLLG     MKKS ID IHSGEAE   E RVQQ+SPKV E
Subjt:  ALRVTKGRKQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYE

Query:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV
        HY EGGN VEL  MPM+NSKEDEYGNALGQ+LKNLSIG+LSLSPKN  KID+LPKAS EGEAP ENSMSSESVCNTDEGAL+HNP+GVRS GSIQIS EV
Subjt:  HYREGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEV

Query:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER
        ASPSSSPSHDK IHTQAPSQFH  AT NR WHHK  +GNLHHDS  +FQ HSRRR HRTWQ SPR+YQ  RSGQTPDSQD+ SES+ASQ+P VERSSQE 
Subjt:  ASPSSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQER

Query:  SQIQSAQQQNFPTTSQPQIPSQ
        +QIQSAQQQNFPT  Q Q+PSQ
Subjt:  SQIQSAQQQNFPTTSQPQIPSQ

SwissProt top hitse value%identityAlignment
O74970 Pre-mRNA-processing factor 391.6e-4529.57Show/hide
Query:  KLHEDVPKCGLNFDEWTSLISEIE--------KKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSF
        K +  + K   +FD W  L+   E              I  +  VYD FL ++PL  GYW+KYA  +  +   +    ++E+ +    +SVD+W +YC+F
Subjt:  KLHEDVPKCGLNFDEWTSLISEIE--------KKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSF

Query:  SISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRK-----LTTSLRENIHSETGCNPS
         +    + ++VR L  +G + VG D+LSH  WDKY+EFE+ Q++ D++  +  + +  P  + + Y      VS  +     L   +  +I ++    P+
Subjt:  SISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRK-----LTTSLRENIHSETGCNPS

Query:  KSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVV-------HFERKIRRTYFHVKELDAGQLKNWHSYLD
        K             +    ++  +V +  L++   M+             ++Y+   Q+ +KV         FE +I+R YFHVKELD  QL NW  YLD
Subjt:  KSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVV-------HFERKIRRTYFHVKELDAGQLKNWHSYLD

Query:  FVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGR-EIAMFALERATEIFLK-RVPVIHLFNARFKEQIRDLSGARTAF----LQLDGDLDS
        F E+ GDF     LYERCLI CA Y EFW RY  +M  +           ERA+ IF     P I +  A F+E   +++ A+  +     QL G+L++
Subjt:  FVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGR-EIAMFALERATEIFLK-RVPVIHLFNARFKEQIRDLSGARTAF----LQLDGDLDS

Q1JPZ7 Pre-mRNA-processing factor 391.6e-3726.52Show/hide
Query:  NDLQLLDNSSTKAQP-IESEPAVGLDE---------SKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLC
        ++++L D     A+P  E++PA   +          SK+ ED P+   +F+ W  L+  +E++  + +      +D+F   +P C+GYW+KYA  + +  
Subjt:  NDLQLLDNSSTKAQP-IESEPAVGLDE---------SKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLC

Query:  SVDKVVDVFEQAVQSATYSVDIWVDYCSF-----SISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYY
         +    +V+ + +Q+   SVD+W+ Y +F       S  E  S +R      +   G D+ S  LW+ YI +E  Q +  ++  IY + L  PT+  S +
Subjt:  SVDKVVDVFEQAVQSATYSVDIWVDYCSF-----SISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYY

Query:  HGSIASVSFRKLTTSLRENIHSETGCNPSKSMELE-------ALPDGEIPICCTDTEVSSVIKDL----LDLSTGMKRYSALQ----KYVRAGEKLYDEA
                F+K    ++ N       NP   +  E        L +   P    D E  +  ++L     DL    KR + ++    K +   +++++  
Subjt:  HGSIASVSFRKLTTSLRENIHSETGCNPSKSMELE-------ALPDGEIPICCTDTEVSSVIKDL----LDLSTGMKRYSALQ----KYVRAGEKLYDEA

Query:  CQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVI
             K   FE  I+R YFHVK L+  QL NW  YLDF   NG  +  V L+ERCLI CA Y EFW++Y +++E+    E      ++A  + L + P +
Subjt:  CQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVI

Query:  HLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEAL
        HL  A F+EQ   +  AR+    ++  +    +  +  + ++E+R G    A  + ++A+
Subjt:  HLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEAL

Q4KLU2 Pre-mRNA-processing factor 397.6e-4328.17Show/hide
Query:  PAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSF
        P +  D  K  + V     +F+ WT L+  +E++  + +      +D+FL+ +P C+GYW+KYA  + +  ++ +  +V+ + +Q+ T SVD+W+ Y +F
Subjt:  PAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSF

Query:  SISAFEEPSD------VRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSL-----RENIHSE
         +    +P+D      +R      +   G D+ S  LW+ YI +E  Q     +  IY + L  PT+   +Y     S+ F++    +     RE + SE
Subjt:  SISAFEEPSD------VRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSL-----RENIHSE

Query:  TGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEE----KVVHFERKIRRTYFHVKELDAGQLKNWHS
              K +    L  G      T+ ++ S ++++ D +   KR + ++       +++ E   L E    K+ +FE +I+R YFHVK L+  QL NW  
Subjt:  TGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEE----KVVHFERKIRRTYFHVKELDAGQLKNWHS

Query:  YLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVE
        YL+F   NG  +  V L+ERC+I CA Y EFW++Y ++ME     E       RA  + L + P++HL  A F+EQ  +L  AR     ++  ++   + 
Subjt:  YLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVE

Query:  NIVLKANMEKRMGKSTAAFNVYKEAL
         +  + N+E+R G    A ++ +EA+
Subjt:  NIVLKANMEKRMGKSTAAFNVYKEAL

Q7KRW8 Pre-mRNA-processing factor 392.0e-3526.01Show/hide
Query:  DNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSA
        DN+  K +  +  P    D  K    V +   +F  WT L+  ++ +     E     YD+FLS +P C+GYWRKYA ++ R         VFE+ +++ 
Subjt:  DNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSA

Query:  TYSVDIWVDYCSFSISAF-EEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTK-------------------------
          SVD+W+ Y     S   ++ + VR    R +   G ++ S  LWD YI +E   +++  +  IY + L  PT+                         
Subjt:  TYSVDIWVDYCSFSISAF-EEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTK-------------------------

Query:  ------------------KLSYYHGSIASVSFRKLTT-----------------SLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIK----
                          K S  H   +S S +   +                   RE +    G +P  + E +           T++E S        
Subjt:  ------------------KLSYYHGSIASVSFRKLTT-----------------SLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIK----

Query:  --DLLDLST--GMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFW-
          D  DLST    +  S   + + A  K++            FE  I+R YFHVK L+  QLKNW  YLDF    GD +  + L+ERCLI CA Y EFW 
Subjt:  --DLLDLST--GMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFW-

Query:  --MRYVEFMETKGG-REIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMA
          +RY+E +E + G  ++      RA  I     P +HL  A F+E   +   A     ++D    +  ++    + N+E+R G       +YK  +E  
Subjt:  --MRYVEFMETKGG-REIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMA

Query:  LMKKKLDVLPSLYIHFSRF
           K   +  SL I ++RF
Subjt:  LMKKKLDVLPSLYIHFSRF

Q86UA1 Pre-mRNA-processing factor 398.1e-3726.16Show/hide
Query:  ESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDY
        E+E     +  K  + V     +F  W  L+  +E++  + +      +D F   +P C+GYW+KYA  + R  ++    +V+ + +Q+   SVD+W+ Y
Subjt:  ESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDY

Query:  CSFSISAFEEPSD------VRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSY----YHGSIASVSFRKLTT-----SL
         +F +    +P D      +R      +   G D+ S  LW+ YI +E  Q     +  IY + L  PT+  S+    +   + +   R L T      L
Subjt:  CSFSISAFEEPSD------VRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSY----YHGSIASVSFRKLTT-----SL

Query:  RENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKN
        R  + S  G +       + LP G   I    T+ + +I ++ ++           + +   +++++       K   FE  I+R YFHVK L+  QLKN
Subjt:  RENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKN

Query:  WHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQ
        W  YL+F   NG  +  V L+ERC+I CA Y EFW++Y ++ME     E       RA  I L + P++H+  A F+EQ  +++ AR      +  +   
Subjt:  WHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQ

Query:  FVENIVLKANMEKRMGKSTAAFNVYKEALEMA
         +  +  + ++E+R G    A ++ ++A++ A
Subjt:  FVENIVLKANMEKRMGKSTAAFNVYKEALEMA

Arabidopsis top hitse value%identityAlignment
AT1G04080.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-10534.11Show/hide
Query:  PSGTFGDSAGFFHLVAALLSPVLCQTKAI---AMANDLQLLDNSSTKAQPIESEPA----------VGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYP
        P G +  SA          +P++  + ++   ++ N    ++N S       + PA          +  +E +L   V    L F+ WT+LI E E+   
Subjt:  PSGTFGDSAGFFHLVAALLSPVLCQTKAI---AMANDLQLLDNSSTKAQPIESEPA----------VGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYP

Query:  DVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYI
        D I KI  VYD+FL+EFPLC+GYW+K+A H+AR+ ++DKVV+V+E+AV   TYSVDIW+ YC+F+I+ + +P  +RRL  R + +VG D+LS  LWDKYI
Subjt:  DVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYI

Query:  EFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSS
        E+E  QQ W  +ALIY + L  P + L  Y  S   ++  +  + LR                 E+  SE+G    K+ E  +  DG        TE S 
Subjt:  EFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSS

Query:  VIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMR
         ++     S        L+KYV   E +Y ++ + E K++ +E  IRR YFHV+ L+  +L+NWH+YLDF+E +GDF+  VKLYERC++ CANYPE+W+R
Subjt:  VIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMR

Query:  YVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKK
        YV  ME  G  ++A  AL RAT++F+K+ P IHLF AR KEQ  D++GAR A+  +  ++    +E ++  ANME R+G    AF++Y++ + +   K+ 
Subjt:  YVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKK

Query:  LDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLK
          +LP LY  +SRF ++                 ++   + A  ++++ + +V   K          PL++ LI+F  +   P+ I+ ++P+V   I  K
Subjt:  LDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLK

Query:  PDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF
        PD     I S         ++ + + + G +  + K  ++H+KLF
Subjt:  PDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF

AT1G04080.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-7033.26Show/hide
Query:  RLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPS
        RL  R + +VG D+LS  LWDKYIE+E  QQ W  +ALIY + L  P + L  Y  S   ++  +  + LR                 E+  SE+G    
Subjt:  RLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPS

Query:  KSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGD
        K+ E  +  DG        TE S  ++     S        L+KYV   E +Y ++ + E K++ +E  IRR YFHV+ L+  +L+NWH+YLDF+E +GD
Subjt:  KSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGD

Query:  FDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEK
        F+  VKLYERC++ CANYPE+W+RYV  ME  G  ++A  AL RAT++F+K+ P IHLF AR KEQ  D++GAR A+  +  ++    +E ++  ANME 
Subjt:  FDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIVLKANMEK

Query:  RMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINF
        R+G    AF++Y++ + +   K+   +LP LY  +SRF ++                 ++   + A  ++++ + +V   K          PL++ LI+F
Subjt:  RMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLPLLQELINF

Query:  VMVHGAPKLINLVDPIVANAISLKPDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF
          +   P+ I+ ++P+V   I  KPD     I S         ++ + + + G +  + K  ++H+KLF
Subjt:  VMVHGAPKLINLVDPIVANAISLKPDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF

AT1G04080.3 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-9831.57Show/hide
Query:  PSGTFGDSAGFFHLVAALLSPVLCQTKAI---AMANDLQLLDNSSTKAQPIESEPA----------VGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYP
        P G +  SA          +P++  + ++   ++ N    ++N S       + PA          +  +E +L   V    L F+ WT+LI E E+   
Subjt:  PSGTFGDSAGFFHLVAALLSPVLCQTKAI---AMANDLQLLDNSSTKAQPIESEPA----------VGLDESKLHEDVPKCGLNFDEWTSLISEIEKKYP

Query:  DVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYI
        D I KI  VYD+FL+EFPLC+GYW+K+A H+AR+ ++DKVV+V+E+AV   TYSVDIW+ YC+F+I+ + +P  +RRL  R + +VG D+LS  LWDKYI
Subjt:  DVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYI

Query:  EFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSS
        E+E  QQ W  +ALIY + L  P + L  Y  S   ++  +  + LR                 E+  SE+G    K+ E  +  DG        TE S 
Subjt:  EFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLR-----------------ENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSS

Query:  VIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFD----------------------
         ++     S        L+KYV   E +Y ++ + E K++ +E  IRR YFHV+ L+  +L+NWH+YLDF+E +GDF+                      
Subjt:  VIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFD----------------------

Query:  W---------------------------------AVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRD
        W                                  VKLYERC++ CANYPE+W+RYV  ME  G  ++A  AL RAT++F+K+ P IHLF AR KEQ  D
Subjt:  W---------------------------------AVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRD

Query:  LSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEV
        ++GAR A+  +  ++    +E ++  ANME R+G    AF++Y++ + +   K+   +LP LY  +SRF ++                 ++   + A  +
Subjt:  LSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEV

Query:  LIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF
        +++ + +V   K          PL++ LI+F  +   P+ I+ ++P+V   I  KPD     I S         ++ + + + G +  + K  ++H+KLF
Subjt:  LIDGIRNVPLCKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVSKVRITS---------LWKQAVDLCGTIHDVMKVWNRHIKLF

AT5G46400.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-14636.74Show/hide
Query:  QPIESEPAVG-----LDESKLHEDVPKCGLNFDEWTSLISEIE-KKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSAT
        +P++  P +      LD  +L E      L+FDEWT LISEIE   +PD IEK+ LVYD+FL EFPLCHGYWRKYA HK +LC+++  V+VFE+AVQ+AT
Subjt:  QPIESEPAVG-----LDESKLHEDVPKCGLNFDEWTSLISEIE-KKYPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSAT

Query:  YSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLRENIHS
        YSV +W+DYC+F+++A+E+P DV RL  RG+SF+GKDY   +LWDKYIE+   QQQW SLA +Y++TL++P+KKL  Y+      +FRK+  SL+E I  
Subjt:  YSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQWDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLRENIHS

Query:  ETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLD
            N   S   + + +  +    TD E+S V+++L+  S+      AL  Y+  GE+ Y ++ QL EK+  FE +IRR YFHVK LD  QL NWH+YL 
Subjt:  ETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQLEEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLD

Query:  FVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIV
        F E  GDFDWA+ LYERCLIPCANY EFW RYV+F+E+KGGRE+A FAL RA++ F+K   VIHLFNARFKE + D S A  A  +   +L   FVEN+ 
Subjt:  FVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRDLSGARTAFLQLDGDLDSQFVENIV

Query:  LKANMEKRMGKSTAAFNVYKEALEMALM-KKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLP
         KANMEKR+G   AA   Y+EAL   L+ K+ L+    LY+ FSR K++                 IT S D A ++L++G  NVP CKLLLE       
Subjt:  LKANMEKRMGKSTAAFNVYKEALEMALM-KKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPLCKLLLELHIWLLP

Query:  LLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTEALRVTKGR
           EL+  +M+HG  + ++L+DPI+   +S + D S       K  I++L+ + +DL GTIHDV K   RHIKLFP S RA        G     + + R
Subjt:  LLQELINFVMVHGAPKLINLVDPIVANAISLKPDVS-------KVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTEALRVTKGR

Query:  KQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGN----EPT-SCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYEHYR
        ++T +    +  T  G   +    P +E K S L++   Q+  +   +    EP   CL   H +    + I++    E+++ + + ++ N         
Subjt:  KQTTDSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGN----EPT-SCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYEHYR

Query:  EGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEVASP
        EGG       +P++ S   E+G    Q+       + S +  + VK D +       ++P ++  S ES+  T  G   ++ R +              P
Subjt:  EGGNGVELAQMPMENSKEDEYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEVASP

Query:  SSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRT--RSGQTPDSQDYNSESVASQKPHVERSSQERS
           P        +   Q HM  T              H D+    Q  + +     +Q S  +       S   P SQ +    + S  P   ++ Q + 
Subjt:  SSSPSHDKSIHTQAPSQFHMHATRNRKWHHKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRT--RSGQTPDSQDYNSESVASQKPHVERSSQERS

Query:  QIQSAQQQN---FPTTSQPQIPSQVTVYLAAESSTRNRY
        Q  ++Q Q    +P T  PQ P Q       +  +   Y
Subjt:  QIQSAQQQN---FPTTSQPQIPSQVTVYLAAESSTRNRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTCGATCGACAATCCATACCTTTTGCTTGGCGCAATTGGTCCCAATGGAGACCCTTCTCGTTATTACTGCCATCGGGTACTTTTGGTGATTCTGCTGGTTTCTT
CCATCTGGTCGCTGCTCTGTTGAGTCCAGTTCTATGCCAAACCAAAGCTATTGCCATGGCGAATGATCTTCAACTTCTCGATAATTCAAGCACAAAAGCTCAACCAATTG
AATCAGAGCCCGCAGTTGGCTTAGATGAGTCCAAACTTCACGAAGATGTTCCCAAATGTGGATTAAATTTTGATGAATGGACTTCGCTTATTTCAGAGATTGAGAAAAAG
TATCCTGATGTCATTGAGAAGATTTCTTTGGTGTATGATTCATTCTTGTCTGAGTTTCCTCTGTGCCATGGGTACTGGAGAAAGTATGCAGCTCATAAAGCAAGGTTGTG
CTCTGTGGACAAGGTTGTTGATGTCTTTGAACAAGCAGTGCAATCAGCAACATACTCTGTTGATATTTGGGTTGACTACTGTAGCTTCAGCATATCAGCTTTTGAAGAAC
CATCTGATGTTCGTAGATTGTTGAGGAGGGGAATATCCTTTGTCGGAAAGGACTATTTAAGCCATAGCTTGTGGGACAAGTACATTGAGTTTGAGCAATCTCAACAGCAG
TGGGATTCTTTAGCTCTGATTTACATCCAAACTCTTAGATTTCCTACCAAAAAGTTGTCTTATTACCACGGCAGTATTGCATCTGTCAGCTTTAGAAAATTGACTACTTC
TCTGAGAGAGAATATCCATTCTGAGACTGGGTGCAATCCTTCAAAGTCCATGGAATTAGAAGCTTTGCCTGATGGTGAAATCCCCATCTGTTGTACAGACACTGAAGTAT
CTTCTGTCATTAAAGACCTACTGGATCTGTCTACTGGCATGAAAAGGTATAGTGCACTGCAGAAGTACGTGCGTGCTGGGGAAAAACTCTATGATGAAGCATGTCAATTG
GAGGAAAAAGTTGTTCACTTTGAGCGTAAAATCAGGAGGACATATTTTCATGTAAAAGAACTTGATGCTGGTCAACTGAAGAATTGGCATTCTTATCTGGACTTTGTGGA
GATAAATGGAGATTTTGACTGGGCGGTTAAACTCTACGAGAGATGCTTAATTCCCTGTGCTAATTATCCTGAGTTTTGGATGCGCTATGTGGAGTTTATGGAAACTAAGG
GTGGAAGAGAGATAGCAATGTTTGCTCTCGAGCGTGCAACAGAAATATTTCTTAAGAGAGTTCCTGTTATCCACCTTTTTAATGCAAGGTTTAAGGAACAAATTAGAGAT
TTATCTGGTGCACGCACTGCTTTTCTTCAGCTTGATGGAGATTTAGATTCTCAATTTGTGGAAAATATTGTATTGAAGGCTAATATGGAGAAACGAATGGGAAAATCTAC
AGCAGCTTTTAATGTTTACAAAGAAGCACTGGAAATGGCTTTGATGAAGAAGAAATTGGATGTTCTACCATCTCTATATATTCATTTTTCTCGATTTAAACATATGCACT
CTCTTGTTATGCTCGTTAGAACTGCAGTTACTGACTTTGATTTACAGATTACAGGAAGTGTAGATGCTGCTATGGAAGTCTTAATAGATGGGATCAGAAATGTACCTCTC
TGCAAATTACTTCTTGAGCTCCATATTTGGCTCCTGCCCCTTCTACAGGAACTTATAAACTTCGTAATGGTGCATGGAGCGCCAAAGCTTATAAATTTAGTTGATCCCAT
CGTAGCTAACGCAATATCTCTCAAGCCAGACGTATCTAAAGTACGGATCACTTCTCTATGGAAGCAGGCTGTTGACCTGTGTGGAACCATCCATGATGTAATGAAGGTGT
GGAATCGTCATATTAAATTGTTTCCACAGTCTATCAGAGCAATGTCATATGAAGACCTTATTCCAGGGACAGAAGCCTTAAGAGTGACCAAGGGAAGAAAACAGACAACA
GATTCTACTGTACCCAACCAGTCAACCAAAGATGGTGATTTTGATCTATCAACTCAGCTTCCTTTAGAAGAGAATAAACGGTCTCTGTTAGAAAACCAAAACTTCCAGAA
TGACCAATCTGCCAATGGGAACGAACCAACATCCTGTTTACTGGGAAAGCACACGATCAGTATGAAAAAGTCTACTATTGATCAGATTCATTCAGGAGAAGCTGAAACTG
GTGTCGAGGTAAGAGTGCAGCAGAATTCTCCAAAAGTTTATGAGCATTATAGAGAAGGTGGAAATGGGGTTGAATTAGCACAAATGCCTATGGAAAACTCAAAAGAAGAT
GAGTACGGTAATGCTTTGGGACAGAGCTTGAAAAATCTTTCAATTGGGAATCTTTCTTTAAGCCCCAAGAACAAAGTGAAAATAGATTTACTCCCCAAAGCATCTCGCGA
AGGGGAAGCTCCCTTGGAAAACAGTATGTCTAGTGAAAGTGTCTGCAATACAGATGAAGGGGCTTTAATGCACAACCCACGGGGTGTCAGATCTTATGGTTCCATCCAGA
TTTCTAATGAAGTGGCTAGTCCATCATCATCTCCAAGCCACGATAAATCTATACACACCCAAGCACCTTCACAGTTTCACATGCATGCGACTCGAAATAGGAAGTGGCAC
CATAAACGTATTGCTGGTAACTTACATCATGACTCTCACCGCCGTTTTCAGGGACATTCACGGAGAAGGTCTCATCGAACATGGCAAGGTTCTCCTCGAGAGTACCAGCG
AACAAGATCTGGTCAAACACCAGATAGTCAAGATTATAACTCTGAATCTGTTGCTTCACAAAAACCACATGTTGAACGAAGCAGCCAAGAACGCAGTCAGATTCAATCTG
CACAGCAGCAGAACTTCCCCACTACTTCTCAGCCTCAAATTCCTTCTCAAGTCACGGTCTACTTGGCTGCTGAATCTTCAACGAGAAACAGATACAAGTTACGGTCTGAA
TTGTCAGAGCCTCTTGCTTCCTCGAGGGGCCAAGCTTGTCTGCCAAAGTCCCATTGGCGGCACCGAAGAATGAGATGCTTAGTGTTTAATCCAAAGCTTCGTAAGTTCTT
GCCTAATGCAGAGGGTTTTATGAATTGTCGACAAAGGCTGGATGACAACAGGGCTGTCAATGGAGGAGACGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTCGATCGACAATCCATACCTTTTGCTTGGCGCAATTGGTCCCAATGGAGACCCTTCTCGTTATTACTGCCATCGGGTACTTTTGGTGATTCTGCTGGTTTCTT
CCATCTGGTCGCTGCTCTGTTGAGTCCAGTTCTATGCCAAACCAAAGCTATTGCCATGGCGAATGATCTTCAACTTCTCGATAATTCAAGCACAAAAGCTCAACCAATTG
AATCAGAGCCCGCAGTTGGCTTAGATGAGTCCAAACTTCACGAAGATGTTCCCAAATGTGGATTAAATTTTGATGAATGGACTTCGCTTATTTCAGAGATTGAGAAAAAG
TATCCTGATGTCATTGAGAAGATTTCTTTGGTGTATGATTCATTCTTGTCTGAGTTTCCTCTGTGCCATGGGTACTGGAGAAAGTATGCAGCTCATAAAGCAAGGTTGTG
CTCTGTGGACAAGGTTGTTGATGTCTTTGAACAAGCAGTGCAATCAGCAACATACTCTGTTGATATTTGGGTTGACTACTGTAGCTTCAGCATATCAGCTTTTGAAGAAC
CATCTGATGTTCGTAGATTGTTGAGGAGGGGAATATCCTTTGTCGGAAAGGACTATTTAAGCCATAGCTTGTGGGACAAGTACATTGAGTTTGAGCAATCTCAACAGCAG
TGGGATTCTTTAGCTCTGATTTACATCCAAACTCTTAGATTTCCTACCAAAAAGTTGTCTTATTACCACGGCAGTATTGCATCTGTCAGCTTTAGAAAATTGACTACTTC
TCTGAGAGAGAATATCCATTCTGAGACTGGGTGCAATCCTTCAAAGTCCATGGAATTAGAAGCTTTGCCTGATGGTGAAATCCCCATCTGTTGTACAGACACTGAAGTAT
CTTCTGTCATTAAAGACCTACTGGATCTGTCTACTGGCATGAAAAGGTATAGTGCACTGCAGAAGTACGTGCGTGCTGGGGAAAAACTCTATGATGAAGCATGTCAATTG
GAGGAAAAAGTTGTTCACTTTGAGCGTAAAATCAGGAGGACATATTTTCATGTAAAAGAACTTGATGCTGGTCAACTGAAGAATTGGCATTCTTATCTGGACTTTGTGGA
GATAAATGGAGATTTTGACTGGGCGGTTAAACTCTACGAGAGATGCTTAATTCCCTGTGCTAATTATCCTGAGTTTTGGATGCGCTATGTGGAGTTTATGGAAACTAAGG
GTGGAAGAGAGATAGCAATGTTTGCTCTCGAGCGTGCAACAGAAATATTTCTTAAGAGAGTTCCTGTTATCCACCTTTTTAATGCAAGGTTTAAGGAACAAATTAGAGAT
TTATCTGGTGCACGCACTGCTTTTCTTCAGCTTGATGGAGATTTAGATTCTCAATTTGTGGAAAATATTGTATTGAAGGCTAATATGGAGAAACGAATGGGAAAATCTAC
AGCAGCTTTTAATGTTTACAAAGAAGCACTGGAAATGGCTTTGATGAAGAAGAAATTGGATGTTCTACCATCTCTATATATTCATTTTTCTCGATTTAAACATATGCACT
CTCTTGTTATGCTCGTTAGAACTGCAGTTACTGACTTTGATTTACAGATTACAGGAAGTGTAGATGCTGCTATGGAAGTCTTAATAGATGGGATCAGAAATGTACCTCTC
TGCAAATTACTTCTTGAGCTCCATATTTGGCTCCTGCCCCTTCTACAGGAACTTATAAACTTCGTAATGGTGCATGGAGCGCCAAAGCTTATAAATTTAGTTGATCCCAT
CGTAGCTAACGCAATATCTCTCAAGCCAGACGTATCTAAAGTACGGATCACTTCTCTATGGAAGCAGGCTGTTGACCTGTGTGGAACCATCCATGATGTAATGAAGGTGT
GGAATCGTCATATTAAATTGTTTCCACAGTCTATCAGAGCAATGTCATATGAAGACCTTATTCCAGGGACAGAAGCCTTAAGAGTGACCAAGGGAAGAAAACAGACAACA
GATTCTACTGTACCCAACCAGTCAACCAAAGATGGTGATTTTGATCTATCAACTCAGCTTCCTTTAGAAGAGAATAAACGGTCTCTGTTAGAAAACCAAAACTTCCAGAA
TGACCAATCTGCCAATGGGAACGAACCAACATCCTGTTTACTGGGAAAGCACACGATCAGTATGAAAAAGTCTACTATTGATCAGATTCATTCAGGAGAAGCTGAAACTG
GTGTCGAGGTAAGAGTGCAGCAGAATTCTCCAAAAGTTTATGAGCATTATAGAGAAGGTGGAAATGGGGTTGAATTAGCACAAATGCCTATGGAAAACTCAAAAGAAGAT
GAGTACGGTAATGCTTTGGGACAGAGCTTGAAAAATCTTTCAATTGGGAATCTTTCTTTAAGCCCCAAGAACAAAGTGAAAATAGATTTACTCCCCAAAGCATCTCGCGA
AGGGGAAGCTCCCTTGGAAAACAGTATGTCTAGTGAAAGTGTCTGCAATACAGATGAAGGGGCTTTAATGCACAACCCACGGGGTGTCAGATCTTATGGTTCCATCCAGA
TTTCTAATGAAGTGGCTAGTCCATCATCATCTCCAAGCCACGATAAATCTATACACACCCAAGCACCTTCACAGTTTCACATGCATGCGACTCGAAATAGGAAGTGGCAC
CATAAACGTATTGCTGGTAACTTACATCATGACTCTCACCGCCGTTTTCAGGGACATTCACGGAGAAGGTCTCATCGAACATGGCAAGGTTCTCCTCGAGAGTACCAGCG
AACAAGATCTGGTCAAACACCAGATAGTCAAGATTATAACTCTGAATCTGTTGCTTCACAAAAACCACATGTTGAACGAAGCAGCCAAGAACGCAGTCAGATTCAATCTG
CACAGCAGCAGAACTTCCCCACTACTTCTCAGCCTCAAATTCCTTCTCAAGTCACGGTCTACTTGGCTGCTGAATCTTCAACGAGAAACAGATACAAGTTACGGTCTGAA
TTGTCAGAGCCTCTTGCTTCCTCGAGGGGCCAAGCTTGTCTGCCAAAGTCCCATTGGCGGCACCGAAGAATGAGATGCTTAGTGTTTAATCCAAAGCTTCGTAAGTTCTT
GCCTAATGCAGAGGGTTTTATGAATTGTCGACAAAGGCTGGATGACAACAGGGCTGTCAATGGAGGAGACGTGTGA
Protein sequenceShow/hide protein sequence
MRFDRQSIPFAWRNWSQWRPFSLLLPSGTFGDSAGFFHLVAALLSPVLCQTKAIAMANDLQLLDNSSTKAQPIESEPAVGLDESKLHEDVPKCGLNFDEWTSLISEIEKK
YPDVIEKISLVYDSFLSEFPLCHGYWRKYAAHKARLCSVDKVVDVFEQAVQSATYSVDIWVDYCSFSISAFEEPSDVRRLLRRGISFVGKDYLSHSLWDKYIEFEQSQQQ
WDSLALIYIQTLRFPTKKLSYYHGSIASVSFRKLTTSLRENIHSETGCNPSKSMELEALPDGEIPICCTDTEVSSVIKDLLDLSTGMKRYSALQKYVRAGEKLYDEACQL
EEKVVHFERKIRRTYFHVKELDAGQLKNWHSYLDFVEINGDFDWAVKLYERCLIPCANYPEFWMRYVEFMETKGGREIAMFALERATEIFLKRVPVIHLFNARFKEQIRD
LSGARTAFLQLDGDLDSQFVENIVLKANMEKRMGKSTAAFNVYKEALEMALMKKKLDVLPSLYIHFSRFKHMHSLVMLVRTAVTDFDLQITGSVDAAMEVLIDGIRNVPL
CKLLLELHIWLLPLLQELINFVMVHGAPKLINLVDPIVANAISLKPDVSKVRITSLWKQAVDLCGTIHDVMKVWNRHIKLFPQSIRAMSYEDLIPGTEALRVTKGRKQTT
DSTVPNQSTKDGDFDLSTQLPLEENKRSLLENQNFQNDQSANGNEPTSCLLGKHTISMKKSTIDQIHSGEAETGVEVRVQQNSPKVYEHYREGGNGVELAQMPMENSKED
EYGNALGQSLKNLSIGNLSLSPKNKVKIDLLPKASREGEAPLENSMSSESVCNTDEGALMHNPRGVRSYGSIQISNEVASPSSSPSHDKSIHTQAPSQFHMHATRNRKWH
HKRIAGNLHHDSHRRFQGHSRRRSHRTWQGSPREYQRTRSGQTPDSQDYNSESVASQKPHVERSSQERSQIQSAQQQNFPTTSQPQIPSQVTVYLAAESSTRNRYKLRSE
LSEPLASSRGQACLPKSHWRHRRMRCLVFNPKLRKFLPNAEGFMNCRQRLDDNRAVNGGDV