; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001381 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001381
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein AF-9
Genome locationscaffold36:3731288..3732505
RNA-Seq ExpressionMS001381
SyntenyMS001381
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599121.1 Copper transporter 5, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8254.9Show/hide
Query:  PHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRA
        P +SGA IRSLVK L+TK+  N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HRA
Subjt:  PHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRA

Query:  AMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-ADN-
        AMKKA      + + QPQTP + +PAR  E +I+PRKI KS     +         N   NN+    N  N  Y    + SFP WS   +NSQS+  DN 
Subjt:  AMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-ADN-

Query:  LDIALPEQTLGLNLNFHDFSNLDANLFAN-------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV
        + IALPEQTLGLNLN  DF+NL+ NLF+N       G+ S SSSSSSPSLSIA DQEAP+  S      ESN                  GGG G GLHV
Subjt:  LDIALPEQTLGLNLNFHDFSNLDANLFAN-------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV

Query:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ
        AVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D+ 
Subjt:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ

Query:  FFHDPALP
             ALP
Subjt:  FFHDPALP

XP_008464887.1 PREDICTED: protein AF-9 [Cucumis melo]9.6e-8454.36Show/hide
Query:  MSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAM
        +S  YI +++K  + K P+NP SSSSSSSS++ S  +  +S ++ SKMA       P Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HRAAM
Subjt:  MSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAM

Query:  KKAAAVAGEQPQQQPQTPPE--TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNF-AYDSNQ-KNFCY-PS---FPSNSFP-WSPFQINSQSV
        KKAAA A       PQ+P E  +P R QEG+IKPRKIPKS      + E        P++NF  Y++N  KN CY PS     + S P WS  + N+  V
Subjt:  KKAAAVAGEQPQQQPQTPPE--TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNF-AYDSNQ-KNFCY-PS---FPSNSFP-WSPFQINSQSV

Query:  ADNLDIALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGE
            DI LPEQTLGLNLN  DF NLDANLF+N +VS S S+S+   SI  DQE                                  GGGGGG+HVAVGE
Subjt:  ADNLDIALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGE

Query:  EEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQ--LLNDYCSHDYQFF
        EEMAE+R+IGEKHEMEWSDKM++VKSAWWLRFMKMGK +EEE++E+ + G G G     +  PFDQILEFPDWMNNGNE CFEE+  +LNDY     QFF
Subjt:  EEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQ--LLNDYCSHDYQFF

Query:  H
        H
Subjt:  H

XP_022999732.1 uncharacterized protein LOC111493992 isoform X1 [Cucurbita maxima]2.3e-8555.64Show/hide
Query:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR
        +P +SGA IRSLVK L+TK+ +N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HR
Subjt:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR

Query:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-
        AAMKKA      + Q QPQTP + +PAR QE +I+PRKI K     SSS E+  +   P N+  N+  Y + +    Y    + SFP WS   +NSQS+ 
Subjt:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-

Query:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV
         DN + I LPEQTLGLNLN  DF+NL+ NLF+NG   +VS S S SSPSLSIA DQEAP+  S      ESN                  GGG GGGLHV
Subjt:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV

Query:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ
        AVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D+ 
Subjt:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ

Query:  FFHDPALP
             ALP
Subjt:  FFHDPALP

XP_022999733.1 uncharacterized protein LOC111493992 isoform X2 [Cucurbita maxima]2.3e-8555.64Show/hide
Query:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR
        +P +SGA IRSLVK L+TK+ +N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HR
Subjt:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR

Query:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-
        AAMKKA      + Q QPQTP + +PAR QE +I+PRKI K     SSS E+  +   P N+  N+  Y + +    Y    + SFP WS   +NSQS+ 
Subjt:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-

Query:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV
         DN + I LPEQTLGLNLN  DF+NL+ NLF+NG   +VS S S SSPSLSIA DQEAP+  S      ESN                  GGG GGGLHV
Subjt:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV

Query:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ
        AVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D+ 
Subjt:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ

Query:  FFHDPALP
             ALP
Subjt:  FFHDPALP

XP_023522708.1 uncharacterized protein LOC111786710 [Cucurbita pepo subsp. pepo]9.6e-8454.5Show/hide
Query:  PHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRA
        P +SGA IRSLVK L+TK+ +N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HRA
Subjt:  PHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRA

Query:  AMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-ADN-
        AMKKA      + + QPQTP + +PAR  E +I+PRKI KS     +         N   NN+    N  N  Y    + SFP WS   +NSQS+  DN 
Subjt:  AMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-ADN-

Query:  LDIALPEQTLGLNLNFHDFSNLDANLFAN---------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGL
        + I LPEQTLGLNLN  DF+NL+ NLF+N         G+ S SSSSSSPSLSI  DQEAP+  S      ESN                  GGG GGGL
Subjt:  LDIALPEQTLGLNLNFHDFSNLDANLFAN---------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGL

Query:  HVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHD
        HVAVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D
Subjt:  HVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHD

Query:  YQFFHDPALPW
        +      ALPW
Subjt:  YQFFHDPALPW

TrEMBL top hitse value%identityAlignment
A0A1S3CML6 protein AF-94.6e-8454.36Show/hide
Query:  MSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAM
        +S  YI +++K  + K P+NP SSSSSSSS++ S  +  +S ++ SKMA       P Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HRAAM
Subjt:  MSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAM

Query:  KKAAAVAGEQPQQQPQTPPE--TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNF-AYDSNQ-KNFCY-PS---FPSNSFP-WSPFQINSQSV
        KKAAA A       PQ+P E  +P R QEG+IKPRKIPKS      + E        P++NF  Y++N  KN CY PS     + S P WS  + N+  V
Subjt:  KKAAAVAGEQPQQQPQTPPE--TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNF-AYDSNQ-KNFCY-PS---FPSNSFP-WSPFQINSQSV

Query:  ADNLDIALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGE
            DI LPEQTLGLNLN  DF NLDANLF+N +VS S S+S+   SI  DQE                                  GGGGGG+HVAVGE
Subjt:  ADNLDIALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGE

Query:  EEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQ--LLNDYCSHDYQFF
        EEMAE+R+IGEKHEMEWSDKM++VKSAWWLRFMKMGK +EEE++E+ + G G G     +  PFDQILEFPDWMNNGNE CFEE+  +LNDY     QFF
Subjt:  EEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQ--LLNDYCSHDYQFF

Query:  H
        H
Subjt:  H

A0A6J1G4K1 uncharacterized protein LOC111450745 isoform X18.7e-8353.98Show/hide
Query:  DSDPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKF
        +S P +SGA IRSLVK L+TK+  N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+
Subjt:  DSDPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKF

Query:  HRAAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-A
        HRAAMKKA      + + QPQTP + +PAR  E +I+PRKI KS     +         N   NN+    N  N  Y    + SFP WS   +NSQS+  
Subjt:  HRAAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-A

Query:  DN-LDIALPEQTLGLNLNFHDFSNLDANLFAN-----------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGG
        DN + IALPEQTLGLNLN  DF+NL+ NLF+N           G+ S SSSSSSPSLSIA D EAP+  S      ESN                  GGG
Subjt:  DN-LDIALPEQTLGLNLNFHDFSNLDANLFAN-----------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGG

Query:  GGGGLHVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLND
         GGGLHVAVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWM+NGNE CF+EQLLND
Subjt:  GGGGLHVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLND

Query:  YCSHDYQFFHDPALP
        YCS+D+      ALP
Subjt:  YCSHDYQFFHDPALP

A0A6J1G4N1 uncharacterized protein LOC111450745 isoform X28.7e-8353.98Show/hide
Query:  DSDPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKF
        +S P +SGA IRSLVK L+TK+  N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+
Subjt:  DSDPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKF

Query:  HRAAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-A
        HRAAMKKA      + + QPQTP + +PAR  E +I+PRKI KS     +         N   NN+    N  N  Y    + SFP WS   +NSQS+  
Subjt:  HRAAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-A

Query:  DN-LDIALPEQTLGLNLNFHDFSNLDANLFAN-----------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGG
        DN + IALPEQTLGLNLN  DF+NL+ NLF+N           G+ S SSSSSSPSLSIA D EAP+  S      ESN                  GGG
Subjt:  DN-LDIALPEQTLGLNLNFHDFSNLDANLFAN-----------GTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGG

Query:  GGGGLHVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLND
         GGGLHVAVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWM+NGNE CF+EQLLND
Subjt:  GGGGLHVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLND

Query:  YCSHDYQFFHDPALP
        YCS+D+      ALP
Subjt:  YCSHDYQFFHDPALP

A0A6J1KBM1 uncharacterized protein LOC111493992 isoform X11.1e-8555.64Show/hide
Query:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR
        +P +SGA IRSLVK L+TK+ +N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HR
Subjt:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR

Query:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-
        AAMKKA      + Q QPQTP + +PAR QE +I+PRKI K     SSS E+  +   P N+  N+  Y + +    Y    + SFP WS   +NSQS+ 
Subjt:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-

Query:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV
         DN + I LPEQTLGLNLN  DF+NL+ NLF+NG   +VS S S SSPSLSIA DQEAP+  S      ESN                  GGG GGGLHV
Subjt:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV

Query:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ
        AVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D+ 
Subjt:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ

Query:  FFHDPALP
             ALP
Subjt:  FFHDPALP

A0A6J1KKK2 uncharacterized protein LOC111493992 isoform X21.1e-8555.64Show/hide
Query:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR
        +P +SGA IRSLVK L+TK+ +N                        PSKMA   + P P   HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK+HR
Subjt:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR

Query:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-
        AAMKKA      + Q QPQTP + +PAR QE +I+PRKI K     SSS E+  +   P N+  N+  Y + +    Y    + SFP WS   +NSQS+ 
Subjt:  AAMKKAAAVAGEQPQQQPQTPPE-TPARCQEGRIKPRKIPKSYQAPSSSTEKIII---PANYPRNNFAYDSNQKNFCYPSFPSNSFP-WSPFQINSQSV-

Query:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV
         DN + I LPEQTLGLNLN  DF+NL+ NLF+NG   +VS S S SSPSLSIA DQEAP+  S      ESN                  GGG GGGLHV
Subjt:  ADN-LDIALPEQTLGLNLNFHDFSNLDANLFANG---TVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHV

Query:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ
        AVGEEEMAEIRSIG+KHEMEWSDKMNLVKSAWWLRFMK+GK EEE             G GF F   FDQILEFPDWMNNGNE CF+EQLLNDYCS+D+ 
Subjt:  AVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWWLRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQ

Query:  FFHDPALP
             ALP
Subjt:  FFHDPALP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein3.5e-1530.65Show/hide
Query:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR
        +P +S  YIRSLVKQ      +  +++ +++++   +D              G+       Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HR
Subjt:  DPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHR

Query:  AAMKKAAAVAGEQPQQQPQTPPETPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFPWSPFQINSQSVADNLDI
        A+M++A  +    P  QP  PP+                                   P N F+          P  P + F W+           +L+ 
Subjt:  AAMKKAAAVAGEQPQQQPQTPPETPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFPWSPFQINSQSVADNLDI

Query:  ALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGEEEMAEI
         LP Q LGLNLNF DF++    +  + T S+SSSSS+ S S +I    PH  S+ S    +   A S S        +G                     
Subjt:  ALPEQTLGLNLNFHDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGEEEMAEI

Query:  RSIGEKHEMEWSDKMNLVKSAWWLRFM-KMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQFFHDPAL
                     + N+V SAWW   M K  + E + E EE         E   F   F  ++EFP W+N   E  F    L D+ S      H+P L
Subjt:  RSIGEKHEMEWSDKMNLVKSAWWLRFM-KMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQFFHDPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CATAAGCTCAAGGAAGATTCAGACCCTCACATGTCCGGCGCTTACATCCGTAGCCTGGTCAAACAATTGAGAACAAAGGACCCTATGAATCCCAATTCCTCTTCCTCTTC
TTCCTCCTCTGCTGCAGATTCCGATGCTGCACCACCGACCTCCGGCAACGCTCCCTCTAAAATGGCCGGAAATCACAGAGCTCCTCCGCCGATTCAACACCACAAGAAGC
AAGTCCGGAGACGCCTCCACACCACTCGCCCTTATCAGGAGCGCCTCTTGAATATGGCCGAGGCGCGCCGGGAGATTGTCACGGCGCTTAAGTTTCACCGTGCCGCTATG
AAAAAGGCCGCCGCCGTCGCCGGTGAGCAACCGCAGCAACAACCGCAGACGCCGCCGGAGACTCCGGCCAGATGCCAGGAAGGGAGGATCAAACCGAGAAAAATCCCCAA
ATCGTATCAAGCGCCATCGTCGAGCACTGAGAAAATTATAATTCCCGCTAATTATCCCCGAAATAATTTCGCCTATGATTCGAATCAGAAGAATTTCTGTTATCCGTCGT
TTCCTTCGAATTCGTTCCCCTGGTCGCCGTTCCAAATCAATTCGCAATCCGTTGCGGATAATCTCGACATTGCACTGCCAGAGCAAACTCTAGGGCTGAATCTGAACTTC
CACGATTTCAGCAATTTGGACGCGAATCTCTTCGCGAACGGCACCGTTTCGGCTTCGTCTTCATCGTCGTCCCCCAGCCTTTCGATTGCGATCGATCAGGAAGCTCCGCA
TTCGATATCGGCCGCATCGGATGCAATAGAGTCGAATATCGGAGCGAACAGTGGGAGCGGTGGCGGTGGCGGTGGCGGTGGCGATGGCGGAGGAGGAGGAGGAGGAGGAG
GGCTACACGTGGCGGTTGGGGAGGAGGAGATGGCGGAGATAAGATCGATAGGGGAGAAACACGAGATGGAATGGAGCGACAAGATGAATTTGGTGAAATCGGCGTGGTGG
TTGAGGTTCATGAAGATGGGGAAGGCGGAGGAGGAGGAGGAGGAGGAAGAAGGAGCCGGCGGGGGCGGCGGAGGAGGAGAAGGATTTCAGTTTCGCCATCCGTTCGATCA
AATCTTGGAGTTTCCAGATTGGATGAACAATGGAAACGAGAGGTGTTTCGAAGAACAATTGTTGAATGATTATTGCTCCCACGATTACCAGTTCTTCCATGATCCTGCCT
TACCATGG
mRNA sequenceShow/hide mRNA sequence
CATAAGCTCAAGGAAGATTCAGACCCTCACATGTCCGGCGCTTACATCCGTAGCCTGGTCAAACAATTGAGAACAAAGGACCCTATGAATCCCAATTCCTCTTCCTCTTC
TTCCTCCTCTGCTGCAGATTCCGATGCTGCACCACCGACCTCCGGCAACGCTCCCTCTAAAATGGCCGGAAATCACAGAGCTCCTCCGCCGATTCAACACCACAAGAAGC
AAGTCCGGAGACGCCTCCACACCACTCGCCCTTATCAGGAGCGCCTCTTGAATATGGCCGAGGCGCGCCGGGAGATTGTCACGGCGCTTAAGTTTCACCGTGCCGCTATG
AAAAAGGCCGCCGCCGTCGCCGGTGAGCAACCGCAGCAACAACCGCAGACGCCGCCGGAGACTCCGGCCAGATGCCAGGAAGGGAGGATCAAACCGAGAAAAATCCCCAA
ATCGTATCAAGCGCCATCGTCGAGCACTGAGAAAATTATAATTCCCGCTAATTATCCCCGAAATAATTTCGCCTATGATTCGAATCAGAAGAATTTCTGTTATCCGTCGT
TTCCTTCGAATTCGTTCCCCTGGTCGCCGTTCCAAATCAATTCGCAATCCGTTGCGGATAATCTCGACATTGCACTGCCAGAGCAAACTCTAGGGCTGAATCTGAACTTC
CACGATTTCAGCAATTTGGACGCGAATCTCTTCGCGAACGGCACCGTTTCGGCTTCGTCTTCATCGTCGTCCCCCAGCCTTTCGATTGCGATCGATCAGGAAGCTCCGCA
TTCGATATCGGCCGCATCGGATGCAATAGAGTCGAATATCGGAGCGAACAGTGGGAGCGGTGGCGGTGGCGGTGGCGGTGGCGATGGCGGAGGAGGAGGAGGAGGAGGAG
GGCTACACGTGGCGGTTGGGGAGGAGGAGATGGCGGAGATAAGATCGATAGGGGAGAAACACGAGATGGAATGGAGCGACAAGATGAATTTGGTGAAATCGGCGTGGTGG
TTGAGGTTCATGAAGATGGGGAAGGCGGAGGAGGAGGAGGAGGAGGAAGAAGGAGCCGGCGGGGGCGGCGGAGGAGGAGAAGGATTTCAGTTTCGCCATCCGTTCGATCA
AATCTTGGAGTTTCCAGATTGGATGAACAATGGAAACGAGAGGTGTTTCGAAGAACAATTGTTGAATGATTATTGCTCCCACGATTACCAGTTCTTCCATGATCCTGCCT
TACCATGG
Protein sequenceShow/hide protein sequence
HKLKEDSDPHMSGAYIRSLVKQLRTKDPMNPNSSSSSSSSAADSDAAPPTSGNAPSKMAGNHRAPPPIQHHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAM
KKAAAVAGEQPQQQPQTPPETPARCQEGRIKPRKIPKSYQAPSSSTEKIIIPANYPRNNFAYDSNQKNFCYPSFPSNSFPWSPFQINSQSVADNLDIALPEQTLGLNLNF
HDFSNLDANLFANGTVSASSSSSSPSLSIAIDQEAPHSISAASDAIESNIGANSGSGGGGGGGGDGGGGGGGGGLHVAVGEEEMAEIRSIGEKHEMEWSDKMNLVKSAWW
LRFMKMGKAEEEEEEEEGAGGGGGGGEGFQFRHPFDQILEFPDWMNNGNERCFEEQLLNDYCSHDYQFFHDPALPW