; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023954 (gene) of Chayote v1 genome

Gene IDSed0023954
OrganismSechium edule (Chayote v1)
DescriptionProtein Ycf2-like
Genome locationLG11:5051521..5053040
RNA-Seq ExpressionSed0023954
SyntenySed0023954
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]4.7e-5232.64Show/hide
Query:  RSRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDR
        R+   KINL  KS+++ +I  +LG  +  + R+  FG  L+ S+   SSQLLLHLIQR CK K   +L F IGG+ L FGLREF LITGL C +I  I+ 
Subjt:  RSRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDR

Query:  ASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN
          + GGGRL+  YFE ++ V R  LN+ F ++     DD +K++ LY LESFL+PK +   ++ DHI M DD E F+ YPWG VA++ LV  +     S 
Subjt:  ASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN

Query:  GNCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF--------REAEL
        G   + MGG  + +L WAYEV+  LS+ P F+                                  LEV  ++AT +E+ MP+FA F        +EAE 
Subjt:  GNCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF--------REAEL

Query:  AAR-----ERISPLVCTRPTSSYVHMGFTSAWLENSEMARR---------LDMIEETQKELNRRLSILIDVVNNIARCMQLVNGTMYDNQIIHATQTLSV
          R     + I+ +   R   S   +      +E  E++++         L+ ++  + ++N R   L+  +N I   M+      +            V
Subjt:  AAR-----ERISPLVCTRPTSSYVHMGFTSAWLENSEMARR---------LDMIEETQKELNRRLSILIDVVNNIARCMQLVNGTMYDNQIIHATQTLSV

Query:  EHDTEIKNDGKKRQEVADNHVIHKVDQHRDRD
        E D E  +      + ++  V+ K D   D+D
Subjt:  EHDTEIKNDGKKRQEVADNHVIHKVDQHRDRD

KGN48800.2 hypothetical protein Csa_003918 [Cucumis sativus]1.3e-3832.77Show/hide
Query:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
        +INL  K  +++ I  +L      + + +CFG  LD  + K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID + ++ 
Subjt:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG

Query:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV
         G+  K YF   + ++R  L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD + F+ YPWG ++Y+  V  ++ +  SN    +
Subjt:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV

Query:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL
        G+GG  YALL WAYE +  L+    F                                    +V  L+AT  E+EMPY   F   + +  + ISP+
Subjt:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL

XP_031743197.1 uncharacterized protein LOC101221625 isoform X9 [Cucumis sativus]1.3e-3832.77Show/hide
Query:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
        +INL  K  +++ I  +L      + + +CFG  LD  + K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID + ++ 
Subjt:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG

Query:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV
         G+  K YF   + ++R  L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD + F+ YPWG ++Y+  V  ++ +  SN    +
Subjt:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV

Query:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL
        G+GG  YALL WAYE +  L+    F                                    +V  L+AT  E+EMPY   F   + +  + ISP+
Subjt:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL

XP_031743205.1 uncharacterized protein LOC101221625 isoform X17 [Cucumis sativus]1.3e-3832.77Show/hide
Query:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
        +INL  K  +++ I  +L      + + +CFG  LD  + K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID + ++ 
Subjt:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG

Query:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV
         G+  K YF   + ++R  L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD + F+ YPWG ++Y+  V  ++ +  SN    +
Subjt:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV

Query:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL
        G+GG  YALL WAYE +  L+    F                                    +V  L+AT  E+EMPY   F   + +  + ISP+
Subjt:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL

XP_031743208.1 uncharacterized protein LOC101221625 isoform X20 [Cucumis sativus]1.3e-3832.77Show/hide
Query:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
        +INL  K  +++ I  +L      + + +CFG  LD  + K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID + ++ 
Subjt:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG

Query:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV
         G+  K YF   + ++R  L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD + F+ YPWG ++Y+  V  ++ +  SN    +
Subjt:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV

Query:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL
        G+GG  YALL WAYE +  L+    F                                    +V  L+AT  E+EMPY   F   + +  + ISP+
Subjt:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein6.4e-3932.77Show/hide
Query:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
        +INL  K  +++ I  +L      + + +CFG  LD  + K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID + ++ 
Subjt:  KINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG

Query:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV
         G+  K YF   + ++R  L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD + F+ YPWG ++Y+  V  ++ +  SN    +
Subjt:  GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTV

Query:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL
        G+GG  YALL WAYE +  L+    F                                    +V  L+AT  E+EMPY   F   + +  + ISP+
Subjt:  GMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQFREAELAARERISPL

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.2e-3733.22Show/hide
Query:  SRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRA
        S + +INL  K  +++ I  +L      + + +CFG  LD  V K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID +
Subjt:  SRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRA

Query:  SLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNG
         ++  G+  K YF   + ++RT L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD E F+ YPWG ++Y+  +  ++ A  SN 
Subjt:  SLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNG

Query:  NCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF
           +G+GG  +AL  WAYE +  L+    F+                                   +V  L+AT+ E+EM Y   F
Subjt:  NCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF

A0A5A7THT9 Protein Ycf2-like4.2e-3843.94Show/hide
Query:  CKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDR
        CK K   +L F IGG+ L FGLREF LITGL C +I+ I+   +KGGG L+  YFE ++ V R  LN+ F ++     DD +K++ LY LESFL+PK + 
Subjt:  CKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDR

Query:  VGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRFYLEVCALVATDEELEMPYFAQFREAE
        + ++ DHI M DD E F+ YPWG VA++ LV  +  A  S G   + MGG  + +L WAYE            LEV  ++AT +E+ M +FA F E E
Subjt:  VGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRFYLEVCALVATDEELEMPYFAQFREAE

A0A5A7U047 Protein Ycf2-like2.3e-5232.64Show/hide
Query:  RSRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDR
        R+   KINL  KS+++ +I  +LG  +  + R+  FG  L+ S+   SSQLLLHLIQR CK K   +L F IGG+ L FGLREF LITGL C +I  I+ 
Subjt:  RSRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDR

Query:  ASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN
          + GGGRL+  YFE ++ V R  LN+ F ++     DD +K++ LY LESFL+PK +   ++ DHI M DD E F+ YPWG VA++ LV  +     S 
Subjt:  ASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN

Query:  GNCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF--------REAEL
        G   + MGG  + +L WAYEV+  LS+ P F+                                  LEV  ++AT +E+ MP+FA F        +EAE 
Subjt:  GNCTVGMGGLFYALLTWAYEVLSALSSKPRFY----------------------------------LEVCALVATDEELEMPYFAQF--------REAEL

Query:  AAR-----ERISPLVCTRPTSSYVHMGFTSAWLENSEMARR---------LDMIEETQKELNRRLSILIDVVNNIARCMQLVNGTMYDNQIIHATQTLSV
          R     + I+ +   R   S   +      +E  E++++         L+ ++  + ++N R   L+  +N I   M+      +            V
Subjt:  AAR-----ERISPLVCTRPTSSYVHMGFTSAWLENSEMARR---------LDMIEETQKELNRRLSILIDVVNNIARCMQLVNGTMYDNQIIHATQTLSV

Query:  EHDTEIKNDGKKRQEVADNHVIHKVDQHRDRD
        E D E  +      + ++  V+ K D   D+D
Subjt:  EHDTEIKNDGKKRQEVADNHVIHKVDQHRDRD

A0A5D3CNI7 TF-B3 domain-containing protein9.3e-3837.23Show/hide
Query:  SRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRA
        S + +INL  K  +++ I  +L      + + +CFG  LD  V K SSQL  HLI+RQC +K   EL F + G+   FG+++F LITGLNCG++  ID +
Subjt:  SRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRA

Query:  SLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNG
         ++  G+  K YF   + ++RT L+  F    +    DV+K++ LY LE F+L K  R GI  ++  + DD E F+ YPWG ++Y+  +  ++ A  SN 
Subjt:  SLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNG

Query:  NCTVGMGGLFYALLTWAYEVLSALSSKPRFY
           +G+GG  +AL  WAYE +  L+    F+
Subjt:  NCTVGMGGLFYALLTWAYEVLSALSSKPRFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.9e-1125.41Show/hide
Query:  KINLCCKSSIMAEILTSL-GPEIAPQLRDTCFGGLLDFSVKKTS-SQLLLH-LIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRAS
        ++N+  +   +  I   L G E   +++ + FG L +F V + S S  L+H L+ RQ   KK  EL F  GG  + F +REF ++TGL CG++   D   
Subjt:  KINLCCKSSIMAEILTSL-GPEIAPQLRDTCFGGLLDFSVKKTS-SQLLLH-LIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRAS

Query:  LKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISL-LYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAY
             +    +       +   +    ++ ++  +    K+ L L  +   ++   D+  +  D + M +D++ F +YPWG  A+
Subjt:  LKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISL-LYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAY

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases1.5e-0829.41Show/hide
Query:  LNKRAVVDDVLKI--SLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN-GNCTVGMGGLFYALLTWAYEVLSALSS
        L KR V D  +++  + L  ++ FLLP      I +DH  M++DL+ F  YPWG ++++ ++++I+   V       V + GL YAL     E + A+  
Subjt:  LNKRAVVDDVLKI--SLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSN-GNCTVGMGGLFYALLTWAYEVLSALSS

Query:  KPRFYLEVCALVATDEELE
         P    ++  +V +D + E
Subjt:  KPRFYLEVCALVATDEELE

AT4G08430.1 Ulp1 protease family protein3.0e-0425Show/hide
Query:  ELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKR-----TMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVG
        E+   I  + + F L EF  ITGLNC    + D      G    K ++  + V        T L   F+++K   ++  + +  L CL+         VG
Subjt:  ELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKR-----TMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVG

Query:  IEEDH---------IHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRF---YLEVCALVATDEELEMPY
        +   H              D  AF KYPWG VA+  L+ ++++      +  +   G   ALL W YE +  +     F    L    L+      +   
Subjt:  IEEDH---------IHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRF---YLEVCALVATDEELEMPY

Query:  FAQFREAELAARERIS
        F  F E E AA  ++S
Subjt:  FAQFREAELAARERIS

AT5G28810.1 Domain of unknown function (DUF1985)8.7e-0423.32Show/hide
Query:  KYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLE
        K + F L EF  ITGLNC    + D                       T L   F+++K   ++  + +  L  L   +   +    +         D  
Subjt:  KYLTFGLREFCLITGLNCGQITKIDRASLKGGGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLE

Query:  AFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRF---YLEVCALVATDEELEMPYFAQFREAELAARERI
        AF KYPWG VA+  L+ +++       +  +   G   ALL W YE +  +     F    L    L+      +   F  F E E AA  ++
Subjt:  AFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALLTWAYEVLSALSSKPRF---YLEVCALVATDEELEMPYFAQFREAELAARERI

AT5G45570.1 Ulp1 protease family protein8.4e-0723.04Show/hide
Query:  CKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKK--TSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGR
        C  S + +I   LG ++  +L+ T  G  + F+      ++Q +   +  Q +     E+   I  + + F L EF  ITGLNC    + D      G  
Subjt:  CKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKK--TSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKGGGR

Query:  LRKTYFEYIEVVKR-----TMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNC
          K ++  + V        T L   F+++K   ++  + +  L  L   +   +    +         D  AF KYPWG VA+  L  +++       + 
Subjt:  LRKTYFEYIEVVKR-----TMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNC

Query:  TVGMGGLFYALLTWAYE
         +   G    LL W YE
Subjt:  TVGMGGLFYALLTWAYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCCTTGCTCCGATCAAGGTCGTTCAAGATAAATTTGTGCTGCAAGAGTAGTATAATGGCTGAAATACTGACAAGTCTTGGCCCGGAAATTGCACCACAATTAAG
GGATACTTGTTTTGGTGGTTTGCTAGATTTTTCAGTGAAGAAAACATCCTCACAACTCCTATTGCACCTGATACAACGTCAGTGCAAGGCGAAAAAGTACCCCGAGCTAA
CATTCAAAATAGGTGGCAAATACTTGACATTTGGACTCCGAGAGTTCTGCCTTATAACCGGATTGAACTGTGGCCAGATAACAAAAATAGATAGGGCATCCCTAAAGGGA
GGGGGGCGGCTACGCAAGACATACTTCGAATACATTGAGGTGGTGAAACGGACAATGTTGAATTTGACATTTAAGTTAAACAAACGGGCAGTTGTAGATGATGTGCTTAA
GATATCCCTCCTATACTGCTTAGAAAGTTTCCTACTACCTAAGTATGATCGGGTTGGTATCGAGGAAGACCACATCCATATGGCAGACGACTTGGAGGCATTCAACAAAT
ACCCGTGGGGTAGTGTTGCATATCAATTTCTGGTCTCCAACATACGGTATGCTGGGGTGTCGAATGGGAATTGTACTGTGGGGATGGGGGGCCTATTCTACGCCCTCTTG
ACTTGGGCATACGAGGTATTATCGGCATTGAGTTCTAAGCCAAGATTCTACCTTGAGGTGTGTGCCTTAGTTGCAACCGATGAAGAACTAGAAATGCCATATTTTGCACA
GTTTCGAGAAGCGGAGTTGGCCGCTCGTGAGAGAATCAGCCCACTAGTTTGTACCAGGCCAACAAGCTCGTACGTTCACATGGGATTTACTTCTGCGTGGCTAGAAAATT
CTGAAATGGCCAGACGTTTGGATATGATAGAGGAAACCCAAAAGGAATTGAATCGTCGACTTTCAATTTTGATTGATGTTGTAAACAACATTGCAAGATGCATGCAACTG
GTCAACGGGACTATGTATGACAACCAGATAATTCATGCAACGCAAACTCTGTCAGTAGAACACGACACTGAAATTAAAAATGATGGAAAGAAAAGACAAGAAGTAGCTGA
CAATCATGTCATCCATAAAGTGGACCAGCATCGTGATAGGGACAATGATGAAGGGCCTATTGGTGGGAAAGGTGCCTCAAGCTCTACATTTGTGGCTCCCCCGACATTTG
TGGTAGACATGGGCTCTAGTGCTGCAACCCTACCAGCCAGCAACCCTGTGAATGAAGGAGAAACCCACCGTCTGACCTCCCCCCAGAATTTGATATGCCGTCCTTTGATC
TAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCCTTGCTCCGATCAAGGTCGTTCAAGATAAATTTGTGCTGCAAGAGTAGTATAATGGCTGAAATACTGACAAGTCTTGGCCCGGAAATTGCACCACAATTAAG
GGATACTTGTTTTGGTGGTTTGCTAGATTTTTCAGTGAAGAAAACATCCTCACAACTCCTATTGCACCTGATACAACGTCAGTGCAAGGCGAAAAAGTACCCCGAGCTAA
CATTCAAAATAGGTGGCAAATACTTGACATTTGGACTCCGAGAGTTCTGCCTTATAACCGGATTGAACTGTGGCCAGATAACAAAAATAGATAGGGCATCCCTAAAGGGA
GGGGGGCGGCTACGCAAGACATACTTCGAATACATTGAGGTGGTGAAACGGACAATGTTGAATTTGACATTTAAGTTAAACAAACGGGCAGTTGTAGATGATGTGCTTAA
GATATCCCTCCTATACTGCTTAGAAAGTTTCCTACTACCTAAGTATGATCGGGTTGGTATCGAGGAAGACCACATCCATATGGCAGACGACTTGGAGGCATTCAACAAAT
ACCCGTGGGGTAGTGTTGCATATCAATTTCTGGTCTCCAACATACGGTATGCTGGGGTGTCGAATGGGAATTGTACTGTGGGGATGGGGGGCCTATTCTACGCCCTCTTG
ACTTGGGCATACGAGGTATTATCGGCATTGAGTTCTAAGCCAAGATTCTACCTTGAGGTGTGTGCCTTAGTTGCAACCGATGAAGAACTAGAAATGCCATATTTTGCACA
GTTTCGAGAAGCGGAGTTGGCCGCTCGTGAGAGAATCAGCCCACTAGTTTGTACCAGGCCAACAAGCTCGTACGTTCACATGGGATTTACTTCTGCGTGGCTAGAAAATT
CTGAAATGGCCAGACGTTTGGATATGATAGAGGAAACCCAAAAGGAATTGAATCGTCGACTTTCAATTTTGATTGATGTTGTAAACAACATTGCAAGATGCATGCAACTG
GTCAACGGGACTATGTATGACAACCAGATAATTCATGCAACGCAAACTCTGTCAGTAGAACACGACACTGAAATTAAAAATGATGGAAAGAAAAGACAAGAAGTAGCTGA
CAATCATGTCATCCATAAAGTGGACCAGCATCGTGATAGGGACAATGATGAAGGGCCTATTGGTGGGAAAGGTGCCTCAAGCTCTACATTTGTGGCTCCCCCGACATTTG
TGGTAGACATGGGCTCTAGTGCTGCAACCCTACCAGCCAGCAACCCTGTGAATGAAGGAGAAACCCACCGTCTGACCTCCCCCCAGAATTTGATATGCCGTCCTTTGATC
TAA
Protein sequenceShow/hide protein sequence
MTALLRSRSFKINLCCKSSIMAEILTSLGPEIAPQLRDTCFGGLLDFSVKKTSSQLLLHLIQRQCKAKKYPELTFKIGGKYLTFGLREFCLITGLNCGQITKIDRASLKG
GGRLRKTYFEYIEVVKRTMLNLTFKLNKRAVVDDVLKISLLYCLESFLLPKYDRVGIEEDHIHMADDLEAFNKYPWGSVAYQFLVSNIRYAGVSNGNCTVGMGGLFYALL
TWAYEVLSALSSKPRFYLEVCALVATDEELEMPYFAQFREAELAARERISPLVCTRPTSSYVHMGFTSAWLENSEMARRLDMIEETQKELNRRLSILIDVVNNIARCMQL
VNGTMYDNQIIHATQTLSVEHDTEIKNDGKKRQEVADNHVIHKVDQHRDRDNDEGPIGGKGASSSTFVAPPTFVVDMGSSAATLPASNPVNEGETHRLTSPQNLICRPLI