; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1045 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1045
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMacro domain-containing protein
Genome locationMC09:16271370..16276453
RNA-Seq ExpressionMC09g1045
SyntenyMC09g1045
Gene Ontology termsNA
InterPro domainsIPR002589 - Macro domain
IPR043472 - Macro domain-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600880.1 hypothetical protein SDJN03_06113, partial [Cucurbita argyrosperma subsp. sororia]6.57e-12981.62Show/hide
Query:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA
        +  S+F RVS  Q +S T S N S S   +RS A+ILA +MSNG+ +GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNA
Subjt:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA

Query:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST
        AGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGV+RYP+DEAATIA+ST
Subjt:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST

Query:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        +KEFS  LKEVHFVL+SSDIY+VWL KANELLKN
Subjt:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

KAG7031514.1 hypothetical protein SDJN02_05554 [Cucurbita argyrosperma subsp. argyrosperma]2.67e-12881.2Show/hide
Query:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA
        +  S+F RVS  Q +S T S N S S   ++S A+ILA +MSNG+ +GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNA
Subjt:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA

Query:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST
        AGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGV+RYP+DEAATIA+ST
Subjt:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST

Query:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        +KEFS  LKEVHFVL+SSDIY+VWL KANELLKN
Subjt:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

XP_022146875.1 uncharacterized protein LOC111015968 [Momordica charantia]1.95e-167100Show/hide
Query:  MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI
        MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI
Subjt:  MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI

Query:  HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA
        HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA
Subjt:  HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA

Query:  ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
Subjt:  ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

XP_022943121.1 uncharacterized protein LOC111447945 [Cucurbita moschata]1.62e-12982.05Show/hide
Query:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA
        +  S+F RVS  Q +S T S N S S   +RS A+ILA +MSNG+G+GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNA
Subjt:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA

Query:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST
        AGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGV+RYP+DEAATIA+ST
Subjt:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST

Query:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        +KEFS  LKEVHFVL+SSDIY+VWL KANELLKN
Subjt:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

XP_038891619.1 macro domain-containing protein VPA0103, partial [Benincasa hispida]4.33e-12983.48Show/hide
Query:  VSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACY
        +SQ +S T S N S S HF+RS AR LA SM+N +G+GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPAN+VMLGGGGADGAIHNAAGPDLVQACY
Subjt:  VSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACY

Query:  AVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKE
        +V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAAT+A+ST+KEFS+ LKE
Subjt:  AVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKE

Query:  VHFVLFSSDIYDVWLNKANELLKN
        VHFVL++SDIY+VWL+KAN+LLKN
Subjt:  VHFVLFSSDIYDVWLNKANELLKN

TrEMBL top hitse value%identityAlignment
A0A1S3C4C8 macro domain-containing protein VPA0103 isoform X21.54e-12583.18Show/hide
Query:  KSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPE
        +S T S N S S HF RS  R  A SM+N + +GVV FK+SPSTD VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLV+ACY+V E
Subjt:  KSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPE

Query:  VQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFV
        VQPGIRCPTGEARITPGF LPASHVIHTVGPIY  S NPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA+STIKEFS+ LKEVHFV
Subjt:  VQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFV

Query:  LFSSDIYDVWLNKANELLKN
        L++ DIY+VWL+KANELLKN
Subjt:  LFSSDIYDVWLNKANELLKN

A0A1S3C4G0 macro domain-containing protein VPA0103 isoform X15.01e-12381.33Show/hide
Query:  KSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPE
        +S T S N S S HF RS  R  A SM+N + +GVV FK+SPSTD VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLV+ACY+V E
Subjt:  KSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPE

Query:  VQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEV---
        VQPGIRCPTGEARITPGF LPASHVIHTVGPIY  S NPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA+STIKEFS+ LKEV   
Subjt:  VQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEV---

Query:  --HFVLFSSDIYDVWLNKANELLKN
          HFVL++ DIY+VWL+KANELLKN
Subjt:  --HFVLFSSDIYDVWLNKANELLKN

A0A6J1CYJ5 uncharacterized protein LOC1110159689.45e-168100Show/hide
Query:  MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI
        MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI
Subjt:  MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAI

Query:  HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA
        HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA
Subjt:  HNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIA

Query:  ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
Subjt:  ISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

A0A6J1FQU9 uncharacterized protein LOC1114479457.83e-13082.05Show/hide
Query:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA
        +  S+F RVS  Q +S T S N S S   +RS A+ILA +MSNG+G+GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNA
Subjt:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA

Query:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST
        AGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGV+RYP+DEAATIA+ST
Subjt:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST

Query:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        +KEFS  LKEVHFVL+SSDIY+VWL KANELLKN
Subjt:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

A0A6J1JJ63 uncharacterized protein LOC1114849181.84e-12881.2Show/hide
Query:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA
        +  S+F RVS  Q +S T S N S S   +RS A+ILA +MSNG+G+GVVRFK+SPST  VIQKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIHNA
Subjt:  IAGSLFSRVS--QPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNA

Query:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST
        AGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  +SNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGV+RYP+DEAATIA+ST
Subjt:  AGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAIST

Query:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN
        +KEFS  LKEVHFVL+SSDIY+VWL  ANELLKN
Subjt:  IKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN

SwissProt top hitse value%identityAlignment
Q87JZ5 Macro domain-containing protein VPA01038.2e-3852.17Show/hide
Query:  KGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRN
        +GDIT   +    DAIVN AN  MLGGGG DGAIH AAGP L+ ACYAV +V  GIRCP G+ARIT    L A +VIH VGPIY K ++P+ +L SAY+ 
Subjt:  KGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRN

Query:  SLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFSSDIYDVW
        SL +A  N+ Q +A PAISCGV+ YP  EAA +A++  +    A  ++ F LFS ++  +W
Subjt:  SLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFSSDIYDVW

Q8KAE4 Macro domain-containing protein CT22193.9e-3248.48Show/hide
Query:  KGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQA-LLRSAYR
        K DIT   +    DAIVN AN  +LGGGG DGAIH AAGP L++AC  +        C TGEA+IT G+ LPA+ VIHTVGP+++  ++ +A LL S YR
Subjt:  KGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQA-LLRSAYR

Query:  NSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEF---SRALKEVHFVLFSSDIYDVW
        NSL +A E++ + IAFP+IS G++ YP ++AA IAI+T++E     R +++V F  FS    DV+
Subjt:  NSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEF---SRALKEVHFVLFSSDIYDVW

Q8P5Z8 Macro domain-containing protein XCC31841.3e-3550.64Show/hide
Query:  IQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSS-NPQALLRSA
        + +GDIT   +    D IVN ANE +LGGGG DGAIH AAGP L++AC A+PEV+PG+RCPTGE RIT GF L A H+ HTVGP++     N    L + 
Subjt:  IQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSS-NPQALLRSA

Query:  YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALK-EVHFVL
        Y  SL +A++  +  IAFPAISCG++ YP  +AA IA++  +++ R+ K   H VL
Subjt:  YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALK-EVHFVL

Q8PHB6 Macro domain-containing protein XAC33432.2e-3550Show/hide
Query:  IQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKS-SNPQALLRSA
        + +GDIT   +    D IVN ANE +LGGGG DGAIH AAGP L++AC A+P+V+PG+RCPTGE RIT GF L A H+ HTVGP++     N    L + 
Subjt:  IQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKS-SNPQALLRSA

Query:  YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALK-EVHFVL
        Y  SL +A++  +  IAFPAISCG++ YP  +AA IA++  +++ R+ K   H VL
Subjt:  YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALK-EVHFVL

Q8Y2K1 Macro domain-containing protein RSc03342.1e-3350Show/hide
Query:  DAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQ-ALLRSAYRNSLAVAKENNIQY
        DAIVN AN  +LGGGG DGAIH AAGP+L++AC A+        C TG+A+ITPGF LPA ++IHTVGPI+      + ALL + YRNSLA+AK+++++ 
Subjt:  DAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQ-ALLRSAYRNSLAVAKENNIQY

Query:  IAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFSS---DIYDVWLNKA
        IAFP IS GV+ +P   AA IA+ T++E    L ++ F  FS+    +Y+  LN+A
Subjt:  IAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFSS---DIYDVWLNKA

Arabidopsis top hitse value%identityAlignment
AT1G69340.1 appr-1-p processing enzyme family protein4.2e-1331.88Show/hide
Query:  FSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNG-AGNGVV-RFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDL
        +S V Q  SL          H     A  LA+S   G +GNG+V +F +    +  I       W ++   DA+VN  NE +     + G +H AAGP L
Subjt:  FSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNG-AGNGVV-RFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDL

Query:  VQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQA--LLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKE
         + C  +        C TG A++T  + LPA  VIHTVGP Y    +  A   L   YR+ L +  ++ +Q IA   I      YP + AA +AI T++ 
Subjt:  VQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQA--LLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKE

Query:  FSRALKE
        F    K+
Subjt:  FSRALKE

AT2G40600.1 appr-1-p processing enzyme family protein5.6e-7462.45Show/hide
Query:  SRVSQPKSLTFSPN----FSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPD
        S  S+   ++F+ N     +SSS    SR   +++SM++G    V  F +S S+   I KGDIT W +D SSDAIVNPANE MLGGGGADGAIH AAGP 
Subjt:  SRVSQPKSLTFSPN----FSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPD

Query:  LVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEF
        L  ACY VPEV+PG+RCPTGEARITPGF LPAS VIHTVGPIY    NPQ  L ++Y+NSL VAKENNI+YIAFPAISCG++ YP+DEAA I ISTIK+F
Subjt:  LVQACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEF

Query:  SRALKEVHFVLFSSDIYDVWLNKANELLK
        S   KEVHFVLF+ DI+ VW+NKA E+L+
Subjt:  SRALKEVHFVLFSSDIYDVWLNKANELLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGTTTGAAAATTGCAGGTTCGCTGTTCAGCAGAGTATCACAACCAAAATCCCTCACCTTCTCTCCAAATTTCTCGTCTTCTTCTCATTTCCAGCGATCAAGAGC
AAGAATTTTGGCCGCCTCAATGTCGAATGGAGCGGGCAATGGAGTGGTTCGCTTCAAAATCTCTCCCTCAACCGATTTCGTTATTCAGAAGGGGGATATCACGCACTGGT
TCATCGACGGTTCCTCTGATGCCATTGTTAATCCAGCAAATGAAGTAATGCTTGGAGGTGGTGGTGCTGATGGAGCCATACACAATGCTGCTGGTCCAGATCTCGTACAA
GCATGCTATGCTGTCCCAGAAGTCCAGCCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACTCCAGGTTTTGGGTTGCCCGCATCTCATGTTATACATACTGTTGG
ACCCATCTATTACAAGAGTAGTAACCCCCAGGCTTTACTGAGAAGTGCATATAGAAATTCCTTGGCTGTGGCAAAGGAGAATAACATTCAATATATTGCCTTTCCTGCCA
TATCCTGTGGTGTGTTTAGATATCCTTATGATGAAGCTGCCACAATAGCCATATCTACCATTAAAGAGTTCTCCAGGGCCTTGAAAGAGGTGCACTTTGTTCTTTTTTCA
TCTGATATTTATGATGTTTGGTTGAACAAGGCAAATGAATTGCTCAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
GATAAAGTGGATGATAAAACTGTTCCTAGACTAATAGTAATAAAATGATAAATTACACGTGATATTCATGAGTCCTACTAAATACATCAAAATCAAAACGTGGTACAACG
AGCTGGTAATTGATATATATCTCGGAACTCCTTCGAAAACAGAAATCGGACTTCAATCGAAAATAGAATTTGTAGTTTTTCTTTTAATATTTTCAGATATATGTTCAGCG
TCATATTTGAAAATTGAATGAATGGCTTGTTTGAAAATTGCAGGTTCGCTGTTCAGCAGAGTATCACAACCAAAATCCCTCACCTTCTCTCCAAATTTCTCGTCTTCTTC
TCATTTCCAGCGATCAAGAGCAAGAATTTTGGCCGCCTCAATGTCGAATGGAGCGGGCAATGGAGTGGTTCGCTTCAAAATCTCTCCCTCAACCGATTTCGTTATTCAGA
AGGGGGATATCACGCACTGGTTCATCGACGGTTCCTCTGATGCCATTGTTAATCCAGCAAATGAAGTAATGCTTGGAGGTGGTGGTGCTGATGGAGCCATACACAATGCT
GCTGGTCCAGATCTCGTACAAGCATGCTATGCTGTCCCAGAAGTCCAGCCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACTCCAGGTTTTGGGTTGCCCGCATC
TCATGTTATACATACTGTTGGACCCATCTATTACAAGAGTAGTAACCCCCAGGCTTTACTGAGAAGTGCATATAGAAATTCCTTGGCTGTGGCAAAGGAGAATAACATTC
AATATATTGCCTTTCCTGCCATATCCTGTGGTGTGTTTAGATATCCTTATGATGAAGCTGCCACAATAGCCATATCTACCATTAAAGAGTTCTCCAGGGCCTTGAAAGAG
GTGCACTTTGTTCTTTTTTCATCTGATATTTATGATGTTTGGTTGAACAAGGCAAATGAATTGCTCAAGAACTAGCACAGGTTGCAACGGTATCATTCACCTTTACTAAC
TAAAAGTTCTAGTGTTGTATGGATCTGAACATAGTTATGCTTTCTTATTAAGTACAAATTCTCCGAAGGGGTTATTTAAAATAAACTTGGGATCCTTTCTCCAAAGAGTT
TTCTTTCGAGTTTGGCAAATGAGCCCATTCGTAGAAAAACTATTGTTATGGCTATGTGGAACTTGACATATCAATGTTATGGCTATGTGGAACTTGACATATCGATGTTA
ATTAATATTTTGACTATTTGAG
Protein sequenceShow/hide protein sequence
MACLKIAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVIQKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQ
ACYAVPEVQPGIRCPTGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFS
SDIYDVWLNKANELLKN