; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018597 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018597
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENTH domain-containing protein
Genome locationtig00153206:680784..687314
RNA-Seq ExpressionSgr018597
SyntenySgr018597
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031470.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-0944.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

PON62750.1 AP180 N-terminal domain containing protein [Parasponia andersonii]1.1e-0943.97Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        QGYSRTRELDSEE+LEHLP           C PEGAA+GNY          +  AL                               VLKES KI CA+N
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

XP_022942756.1 putative clathrin assembly protein At2g01600 isoform X1 [Cucurbita moschata]1.8e-0944.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

XP_022986104.1 putative clathrin assembly protein At2g01600 [Cucurbita maxima]1.8e-0944.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

XP_022986601.1 putative clathrin assembly protein At2g01600 isoform X1 [Cucurbita maxima]1.8e-0944.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

TrEMBL top hitse value%identityAlignment
A0A0A0KIT4 ENTH domain-containing protein8.7e-1044.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

A0A2P5CNX3 AP180 N-terminal domain containing protein5.1e-1043.97Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        QGYSRTRELDSEE+LEHLP           C PEGAA+GNY          +  AL                               VLKES KI CA+N
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

A0A6J1JBM7 putative clathrin assembly protein At2g01600 isoform X18.7e-1044.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

A0A6J1JF55 putative clathrin assembly protein At2g016008.7e-1044.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

A0A6J1JGI2 putative clathrin assembly protein At2g01600 isoform X28.7e-1044.83Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTRELDSEE+LEHLP           C PEGAAIGNY          +  AL                               VLKES KI CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        DGIINLVDK    PRH
Subjt:  DGIINLVDK----PRH

SwissProt top hitse value%identityAlignment
P94017 Putative clathrin assembly protein At1g149101.9e-0636.21Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYS+TR+LD E++LE LP           C PEGAA  N+           II                               S VLKES K+ CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        +GIINLV+K    PRH
Subjt:  DGIINLVDK----PRH

Q8LBH2 Putative clathrin assembly protein At2g016002.3e-0737.61Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTR+LD EE+LE LP           C PEGAA  N+          +  AL                               VLKES K+ CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK
        DGIINL+DK
Subjt:  DGIINLVDK

Q8VYT2 Putative clathrin assembly protein At4g259403.1e-0436.52Show/hide
Query:  NLGFFHWQGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESC
        N+ F   Q Y RTR L  EE+LE LP           C PEG+A  NY          +  AL                               VLKES 
Subjt:  NLGFFHWQGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESC

Query:  KIGCAINDGIINLVD
        KI CAINDGIINLVD
Subjt:  KIGCAINDGIINLVD

Q9LVD8 Putative clathrin assembly protein At5g572004.0e-0436.54Show/hide
Query:  RTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAINDGII
        RTR L  E++LE LP           C PEGAA  NY          +  AL                               VLKES KI CAINDGII
Subjt:  RTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAINDGII

Query:  NLVD
        NLVD
Subjt:  NLVD

Arabidopsis top hitse value%identityAlignment
AT1G14910.1 ENTH/ANTH/VHS superfamily protein1.4e-0736.21Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYS+TR+LD E++LE LP           C PEGAA  N+           II                               S VLKES K+ CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK----PRH
        +GIINLV+K    PRH
Subjt:  DGIINLVDK----PRH

AT2G01600.1 ENTH/ANTH/VHS superfamily protein1.6e-0837.61Show/hide
Query:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN
        +GYSRTR+LD EE+LE LP           C PEGAA  N+          +  AL                               VLKES K+ CAIN
Subjt:  QGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAIN

Query:  DGIINLVDK
        DGIINL+DK
Subjt:  DGIINLVDK

AT4G25940.1 ENTH/ANTH/VHS superfamily protein2.2e-0536.52Show/hide
Query:  NLGFFHWQGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESC
        N+ F   Q Y RTR L  EE+LE LP           C PEG+A  NY          +  AL                               VLKES 
Subjt:  NLGFFHWQGYSRTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESC

Query:  KIGCAINDGIINLVD
        KI CAINDGIINLVD
Subjt:  KIGCAINDGIINLVD

AT5G57200.1 ENTH/ANTH/VHS superfamily protein2.9e-0536.54Show/hide
Query:  RTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAINDGII
        RTR L  E++LE LP           C PEGAA  NY          +  AL                               VLKES KI CAINDGII
Subjt:  RTRELDSEEMLEHLPN--------SCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAINDGII

Query:  NLVD
        NLVD
Subjt:  NLVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCATAGATTCCTGAAATTCATTAATCTTGGTTTCTTTCATTGGCAGGGTTACAGCAGAACTAGGGAACTGGACAGTGAAGAAATGTTGGAGCATTTGCCTAACAG
CTGTTGTATCCCCGAAGGAGCAGCTATTGGGAATTATGCATACGGTACGCCTTGGCACTGGGAAGCAGTTATTATAGCATTGGTTTTTGAGGAGGGAGTCAATTTTGTAG
AGAGAGAGGGAGAGAGACAGAGTTGTCTCTTCTCTAAATGGTTTGAAATCGATTGGGGGAGTGAGGTATTGAAAGAGAGCTGTAAAATCGGTTGTGCTATTAATGATGGA
ATTATAAATCTCGTTGACAAGCCCCGCCACCCTCTGCCTCATTTATGTCCCTCTTTCTCTCCACCCTTTTTCTGGGTTATCTCTGTCTGCAAGCAAAGCAATCGCGAGAA
TCTCTCCGGAGAGACATCGGAGACTGGTATCGCCGGCGATGAAGAACGAAGAACAAAGAAGAAGAGCCTCGGAGGCTGCTCTAGAGGGGGCAACCATTTTGACATCGAAG
CATCAAGCCAAGTTCATCTCTGCGGCACACGATGCAACTGGTTGCGCTTCGAAAAGCCTCCGACGAGGCTAACAAGGGCAAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTCATAGATTCCTGAAATTCATTAATCTTGGTTTCTTTCATTGGCAGGGTTACAGCAGAACTAGGGAACTGGACAGTGAAGAAATGTTGGAGCATTTGCCTAACAG
CTGTTGTATCCCCGAAGGAGCAGCTATTGGGAATTATGCATACGGTACGCCTTGGCACTGGGAAGCAGTTATTATAGCATTGGTTTTTGAGGAGGGAGTCAATTTTGTAG
AGAGAGAGGGAGAGAGACAGAGTTGTCTCTTCTCTAAATGGTTTGAAATCGATTGGGGGAGTGAGGTATTGAAAGAGAGCTGTAAAATCGGTTGTGCTATTAATGATGGA
ATTATAAATCTCGTTGACAAGCCCCGCCACCCTCTGCCTCATTTATGTCCCTCTTTCTCTCCACCCTTTTTCTGGGTTATCTCTGTCTGCAAGCAAAGCAATCGCGAGAA
TCTCTCCGGAGAGACATCGGAGACTGGTATCGCCGGCGATGAAGAACGAAGAACAAAGAAGAAGAGCCTCGGAGGCTGCTCTAGAGGGGGCAACCATTTTGACATCGAAG
CATCAAGCCAAGTTCATCTCTGCGGCACACGATGCAACTGGTTGCGCTTCGAAAAGCCTCCGACGAGGCTAACAAGGGCAAG
Protein sequenceShow/hide protein sequence
MLHRFLKFINLGFFHWQGYSRTRELDSEEMLEHLPNSCCIPEGAAIGNYAYGTPWHWEAVIIALVFEEGVNFVEREGERQSCLFSKWFEIDWGSEVLKESCKIGCAINDG
IINLVDKPRHPLPHLCPSFSPPFFWVISVCKQSNRENLSGETSETGIAGDEERRTKKKSLGGCSRGGNHFDIEASSQVHLCGTRCNWLRFEKPPTRLTRAX