; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008175 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008175
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationChr10:20513296..20516195
RNA-Seq ExpressionHG10008175
SyntenyHG10008175
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7838133.1 UBP1-associated proteins 1C-like isoform X2 [Senna tora]4.1e-0426.29Show/hide
Query:  NHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDG----KKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSIC
        N  +      Q   V  S+L    P P     + KP+ G     K  +   PD   +E KRKAE    AV    + ++  P   +      KE++ C++C
Subjt:  NHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDG----KKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSIC

Query:  KVKAT----------SKKHMAKEGRRGLQ----------------TQEPSEEEDQKEKLDDPQKSAEQKAAVENKRAKFWCKICEIGTHCMANMVTHNNG
        +V  T           KKH AKE    +Q                 Q+   + D KE +     + + KA    ++ KFWC+ C++G      M +H  G
Subjt:  KVKAT----------SKKHMAKEGRRGLQ----------------TQEPSEEEDQKEKLDDPQKSAEQKAAVENKRAKFWCKICEIGTHCMANMVTHNNG

Query:  KKHKARLLKLSND
        KKH  R+    ND
Subjt:  KKHKARLLKLSND

MBA0846606.1 hypothetical protein [Gossypium schwendimanii]5.4e-0427.5Show/hide
Query:  MNHIVNPSRGLQTVPVS--------SSLLLPVRPVPDPQNEEPKPFDGKKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKF
        M+H+ N +R     P S        S +LLP        NE     + KK  +   PD  +  G ++         E       +P P ISSK    +++
Subjt:  MNHIVNPSRGLQTVPVS--------SSLLLPVRPVPDPQNEEPKPFDGKKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKF

Query:  FCSICKVKATS----------KKHMAKEGR----------------------RGLQTQEPSEE-------------------EDQKEKLDDPQKSAEQKA
         C+IC+V  TS          KKH A E R                       GL  Q   +E                     +KE+ D+ + + E   
Subjt:  FCSICKVKATS----------KKHMAKEGR----------------------RGLQTQEPSEE-------------------EDQKEKLDDPQKSAEQKA

Query:  AVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL
          + K+ KFWC IC +G H    M TH  GKKH ARLLKL
Subjt:  AVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL

XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]3.5e-3542.35Show/hide
Query:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ
        M+FRFRA DNKSPATAAS    +SDQPI+                  M  IV                                           + G+ 
Subjt:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ

Query:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK
         +  P SS+ LL V       N  EEPKP + +   L     PD GKF GKRKA G+  A A      E       IPTPWI SKKLAKE+F CS+C VK
Subjt:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK

Query:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC
        ATS          KKH AKEGRR +Q    +EP++ EED KEKLD P K+ ++  A+E NKR KF C+IC +G   MA MV+HNNG+KHKARLLKLS  C
Subjt:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC

Query:  KLEDHKE
        K ED K+
Subjt:  KLEDHKE

XP_022158136.1 uncharacterized protein LOC111024695 [Momordica charantia]4.1e-0428.51Show/hide
Query:  FRFRANDNKSPATAASSSVRSSDQPIRMNHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW-------------PDLGKFEGKRKA
        + FR     +  +AA+  VRS +         +  RG + V V      P+ P P P    P   D KK                   PD   F  KRKA
Subjt:  FRFRANDNKSPATAASSSVRSSDQPIRMNHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW-------------PDLGKFEGKRKA

Query:  EGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKEGRRGLQTQEPSEEED--------QKEKLD-----DPQKSA
        E        A D D   P+     KK  K+++ C++C+V  +S          KKH  KE   GL+ Q+ S            ++ K+      + +  +
Subjt:  EGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKEGRRGLQTQEPSEEED--------QKEKLD-----DPQKSA

Query:  EQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLL
         Q     NK  KFWCKIC++G      M  H NGK+HKAR L
Subjt:  EQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLL

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]9.7e-6253.11Show/hide
Query:  MDFRFRANDNKSPATAASSSVRSSDQ--------------------------------------------------------------------PIRMNH
        MDFRFRANDNKSPATA S+SVRS DQ                                                                     IRMNH
Subjt:  MDFRFRANDNKSPATAASSSVRSSDQ--------------------------------------------------------------------PIRMNH

Query:  -IVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVK
         IV+P RGL TVP SSS  LPV PVP+PQNEEPKPFD KK  L     PD  KFEGKRKAEGEA  V    DTDQ  PT WISSKKLAKE+F CS+C V+
Subjt:  -IVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVK

Query:  ATS----------KKHMAKEGRRGLQTQEPSEEEDQKEKLDDPQKSAEQKAAVENK-RAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDCKLED
         TS          KKHMAKEG R LQTQEPS  ED KEKLD+ +K A++K+A++NK   KFWCKICEIGT CMA MV+HNNGKKHKARLLKLS   KL+D
Subjt:  ATS----------KKHMAKEGRRGLQTQEPSEEEDQKEKLDDPQKSAEQKAAVENK-RAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDCKLED

Query:  HKEKP
         K++P
Subjt:  HKEKP

TrEMBL top hitse value%identityAlignment
A0A059CKW4 Uncharacterized protein3.4e-0430.92Show/hide
Query:  PDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKE-------GRRGLQTQEPSEEEDQKEKLD
        PD      KRKA   +    E S           S KK  KE++ C++C+V ATS          KKH  KE       G+ G      ++E    + L 
Subjt:  PDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKE-------GRRGLQTQEPSEEEDQKEKLD

Query:  DPQ--KSAEQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL
         P+  ++ E       K  +FWC +C+IG   +  M +H  GKKH ARL +L
Subjt:  DPQ--KSAEQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL

A0A1S3B9S7 uncharacterized protein LOC1034873431.7e-3542.35Show/hide
Query:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ
        M+FRFRA DNKSPATAAS    +SDQPI+                  M  IV                                           + G+ 
Subjt:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ

Query:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK
         +  P SS+ LL V       N  EEPKP + +   L     PD GKF GKRKA G+  A A      E       IPTPWI SKKLAKE+F CS+C VK
Subjt:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK

Query:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC
        ATS          KKH AKEGRR +Q    +EP++ EED KEKLD P K+ ++  A+E NKR KF C+IC +G   MA MV+HNNG+KHKARLLKLS  C
Subjt:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC

Query:  KLEDHKE
        K ED K+
Subjt:  KLEDHKE

A0A5D3B800 UBP1-associated proteins 1C-like isoform X31.7e-3542.35Show/hide
Query:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ
        M+FRFRA DNKSPATAAS    +SDQPI+                  M  IV                                           + G+ 
Subjt:  MDFRFRANDNKSPATAASSSVRSSDQPIR------------------MNHIVN-----------------------------------------PSRGLQ

Query:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK
         +  P SS+ LL V       N  EEPKP + +   L     PD GKF GKRKA G+  A A      E       IPTPWI SKKLAKE+F CS+C VK
Subjt:  TV--PVSSSLLLPVRPVPDPQN--EEPKPFDGKKTSLSFW--PDLGKFEGKRKAEGEATAVA------EASDTDQTIPTPWISSKKLAKEKFFCSICKVK

Query:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC
        ATS          KKH AKEGRR +Q    +EP++ EED KEKLD P K+ ++  A+E NKR KF C+IC +G   MA MV+HNNG+KHKARLLKLS  C
Subjt:  ATS----------KKHMAKEGRRGLQT---QEPSE-EEDQKEKLDDPQKSAEQKAAVE-NKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDC

Query:  KLEDHKE
        K ED K+
Subjt:  KLEDHKE

A0A6J1E048 uncharacterized protein LOC1110246952.0e-0428.51Show/hide
Query:  FRFRANDNKSPATAASSSVRSSDQPIRMNHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW-------------PDLGKFEGKRKA
        + FR     +  +AA+  VRS +         +  RG + V V      P+ P P P    P   D KK                   PD   F  KRKA
Subjt:  FRFRANDNKSPATAASSSVRSSDQPIRMNHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFW-------------PDLGKFEGKRKA

Query:  EGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKEGRRGLQTQEPSEEED--------QKEKLD-----DPQKSA
        E        A D D   P+     KK  K+++ C++C+V  +S          KKH  KE   GL+ Q+ S            ++ K+      + +  +
Subjt:  EGEATAVAEASDTDQTIPTPWISSKKLAKEKFFCSICKVKATS----------KKHMAKEGRRGLQTQEPSEEED--------QKEKLD-----DPQKSA

Query:  EQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLL
         Q     NK  KFWCKIC++G      M  H NGK+HKAR L
Subjt:  EQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLL

A0A7J9KJQ3 Uncharacterized protein2.6e-0427.5Show/hide
Query:  MNHIVNPSRGLQTVPVS--------SSLLLPVRPVPDPQNEEPKPFDGKKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKF
        M+H+ N +R     P S        S +LLP        NE     + KK  +   PD  +  G ++         E       +P P ISSK    +++
Subjt:  MNHIVNPSRGLQTVPVS--------SSLLLPVRPVPDPQNEEPKPFDGKKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPWISSKKLAKEKF

Query:  FCSICKVKATS----------KKHMAKEGR----------------------RGLQTQEPSEE-------------------EDQKEKLDDPQKSAEQKA
         C+IC+V  TS          KKH A E R                       GL  Q   +E                     +KE+ D+ + + E   
Subjt:  FCSICKVKATS----------KKHMAKEGR----------------------RGLQTQEPSEE-------------------EDQKEKLDDPQKSAEQKA

Query:  AVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL
          + K+ KFWC IC +G H    M TH  GKKH ARLLKL
Subjt:  AVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAGGTTCCGAGCCAACGATAACAAATCGCCGGCCACCGCCGCTTCCAGCTCCGTACGCTCCTCCGATCAACCGATCAGGATGAATCACATCGTGAATCCGTC
TAGAGGTTTACAGACTGTTCCAGTTTCCAGTTCTTTGCTCTTACCTGTTCGTCCAGTTCCGGACCCTCAAAACGAAGAACCGAAGCCTTTCGATGGTAAAAAAACAAGTT
TATCGTTCTGGCCAGACTTAGGAAAATTCGAAGGGAAGAGGAAAGCTGAGGGAGAAGCAACAGCAGTAGCGGAGGCCAGTGATACTGATCAAACAATCCCAACTCCATGG
ATCAGTTCAAAGAAATTAGCCAAGGAAAAGTTTTTCTGTAGCATTTGCAAAGTTAAGGCCACAAGCAAGAAGCACATGGCCAAAGAAGGCCGGCGTGGCTTACAAACTCA
AGAACCGAGCGAAGAGGAAGATCAGAAAGAAAAATTAGACGATCCTCAGAAGAGCGCAGAACAAAAAGCAGCCGTTGAAAACAAGCGGGCTAAATTCTGGTGCAAGATTT
GCGAAATTGGAACCCATTGTATGGCTAATATGGTCACACATAACAATGGGAAGAAGCATAAGGCTCGTCTGTTGAAACTTTCTAATGACTGCAAATTGGAAGACCACAAG
GAGAAACCCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAGGTTCCGAGCCAACGATAACAAATCGCCGGCCACCGCCGCTTCCAGCTCCGTACGCTCCTCCGATCAACCGATCAGGATGAATCACATCGTGAATCCGTC
TAGAGGTTTACAGACTGTTCCAGTTTCCAGTTCTTTGCTCTTACCTGTTCGTCCAGTTCCGGACCCTCAAAACGAAGAACCGAAGCCTTTCGATGGTAAAAAAACAAGTT
TATCGTTCTGGCCAGACTTAGGAAAATTCGAAGGGAAGAGGAAAGCTGAGGGAGAAGCAACAGCAGTAGCGGAGGCCAGTGATACTGATCAAACAATCCCAACTCCATGG
ATCAGTTCAAAGAAATTAGCCAAGGAAAAGTTTTTCTGTAGCATTTGCAAAGTTAAGGCCACAAGCAAGAAGCACATGGCCAAAGAAGGCCGGCGTGGCTTACAAACTCA
AGAACCGAGCGAAGAGGAAGATCAGAAAGAAAAATTAGACGATCCTCAGAAGAGCGCAGAACAAAAAGCAGCCGTTGAAAACAAGCGGGCTAAATTCTGGTGCAAGATTT
GCGAAATTGGAACCCATTGTATGGCTAATATGGTCACACATAACAATGGGAAGAAGCATAAGGCTCGTCTGTTGAAACTTTCTAATGACTGCAAATTGGAAGACCACAAG
GAGAAACCCTTATGA
Protein sequenceShow/hide protein sequence
MDFRFRANDNKSPATAASSSVRSSDQPIRMNHIVNPSRGLQTVPVSSSLLLPVRPVPDPQNEEPKPFDGKKTSLSFWPDLGKFEGKRKAEGEATAVAEASDTDQTIPTPW
ISSKKLAKEKFFCSICKVKATSKKHMAKEGRRGLQTQEPSEEEDQKEKLDDPQKSAEQKAAVENKRAKFWCKICEIGTHCMANMVTHNNGKKHKARLLKLSNDCKLEDHK
EKPL