; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G014200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G014200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr07:20719682..20720380
RNA-Seq ExpressionLsi07G014200
SyntenyLsi07G014200
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily
IPR044653 - C2H2-type zinc-finger protein AZF1/2/3-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN55980.1 hypothetical protein Csa_011357 [Cucumis sativus]1.0e-4253.53Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH
        MDH+ + LRIKIK+R+ NA A  VDD+D   QF S       +ET MEKGLLFQP TPVTA  +   S SP      VVDCS  L SWLLTGRRGRKS  
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH

Query:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI
          TS T         IP D L  KR+K ET SD +E S   K H KENKN   +EKEYKCN+C KVF++ +A+GGHKS H KANKTD AI KET +    
Subjt:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI

Query:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD
         TIGT+FE K T+S+ DSST     ++ ++DFDLN+LP ++
Subjt:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD

KGN56002.2 hypothetical protein Csa_011010 [Cucumis sativus]2.1e-1957.63Show/hide
Query:  KRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGR
        KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  KT D  I KETT         T+    IT+S+DSS   T +
Subjt:  KRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGR

Query:  KRMMDFDLN-QLPSDDDD
        KR+MDFDLN QLP DD++
Subjt:  KRMMDFDLN-QLPSDDDD

TYK19398.1 TPRXL protein [Cucumis melo var. makuwa]3.7e-4855.6Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK
        MDH+ + LRIKIK+RN +  AA VD   D+D  KQFCS+ HM  DSET M+KGLLFQ  TP+    +   S SPS  + D+VDCS  L SW LTGRRGRK
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK

Query:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN
        SI ++ T            I  D L  KR+K ET SD EKS +    +KENKN  V+EK+YKCN+CSKVF+S +A+GGHKS HYKANKT IAI KE  AN
Subjt:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN

Query:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP
            TIGTT E KI +S V SST R      MMDFDLN+LP
Subjt:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP

XP_008449183.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103491133 [Cucumis melo]1.2e-1148Show/hide
Query:  LTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITK
        L GRRGRK         +  T Y I  D L  KR KIET SD ++  K+ G           KEY C+ CSKVF++ RA+GGHKSSHYK NKT D  I K
Subjt:  LTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITK

Query:  ETTANTTITTIGTTFESKITLSVDS
        ETT         TT E KIT+S+DS
Subjt:  ETTANTTITTIGTTFESKITLSVDS

XP_011652725.1 putative protein TPRXL [Cucumis sativus]2.8e-4049.42Show/hide
Query:  DGLRIKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----
        D  +IK + RN      ++D +D NKQ CS+G HMR D ETTMEKGL  QPPT  T   SSS SS PS A+ +  VVDC  SL TSW LTGRRGRK    
Subjt:  DGLRIKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----

Query:  SIHTSYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKAN
        S  +S T  DD++                          + KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  
Subjt:  SIHTSYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKAN

Query:  KT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD
        KT D  I KETT         T+    IT+S+DSS   T +KR+MDFDLN QLP DD++
Subjt:  KT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD

TrEMBL top hitse value%identityAlignment
A0A0A0L1Z9 C2H2-type domain-containing protein5.0e-4353.53Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH
        MDH+ + LRIKIK+R+ NA A  VDD+D   QF S       +ET MEKGLLFQP TPVTA  +   S SP      VVDCS  L SWLLTGRRGRKS  
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH

Query:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI
          TS T         IP D L  KR+K ET SD +E S   K H KENKN   +EKEYKCN+C KVF++ +A+GGHKS H KANKTD AI KET +    
Subjt:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI

Query:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD
         TIGT+FE K T+S+ DSST     ++ ++DFDLN+LP ++
Subjt:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD

A0A0A0L4M3 C2H2-type domain-containing protein1.1e-4050.2Show/hide
Query:  IKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHT
        +KIK R  N    ++D +D NKQ CS+G HMR D ETTMEKGL  QPPT  T   SSS SS PS A+ +  VVDC  SL TSW LTGRRGRK    S  +
Subjt:  IKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHT

Query:  SYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-D
        S T  DD++                          + KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  KT D
Subjt:  SYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-D

Query:  IAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD
          I KETT         T+    IT+S+DSS   T +KR+MDFDLN QLP DD++
Subjt:  IAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD

A0A1S3BLH2 LOW QUALITY PROTEIN: uncharacterized protein LOC1034911336.0e-1248Show/hide
Query:  LTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITK
        L GRRGRK         +  T Y I  D L  KR KIET SD ++  K+ G           KEY C+ CSKVF++ RA+GGHKSSHYK NKT D  I K
Subjt:  LTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITK

Query:  ETTANTTITTIGTTFESKITLSVDS
        ETT         TT E KIT+S+DS
Subjt:  ETTANTTITTIGTTFESKITLSVDS

A0A5D3D748 TPRXL protein1.8e-4855.6Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK
        MDH+ + LRIKIK+RN +  AA VD   D+D  KQFCS+ HM  DSET M+KGLLFQ  TP+    +   S SPS  + D+VDCS  L SW LTGRRGRK
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK

Query:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN
        SI ++ T            I  D L  KR+K ET SD EKS +    +KENKN  V+EK+YKCN+CSKVF+S +A+GGHKS HYKANKT IAI KE  AN
Subjt:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN

Query:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP
            TIGTT E KI +S V SST R      MMDFDLN+LP
Subjt:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP

A0A5D3D759 TPRXL protein1.5e-0748.48Show/hide
Query:  MEKGLLFQPPTPVTANCSSSFSSSPS-AASIDVVDCSISLTSWLLTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKE
        MEKGLL QPPT       S  SSSPS AAS  VVDC   ++SW L GRRGRK         +  T Y I  D L  KR KIET SD ++  K+ G  +E
Subjt:  MEKGLLFQPPTPVTANCSSSFSSSPS-AASIDVVDCSISLTSWLLTGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49930.1 C2H2 and C2HC zinc fingers superfamily protein1.6e-0432.04Show/hide
Query:  SLTSWLLTGRRGRKSIHTSYTIP--DDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKET
        +L SW    R  R  I      P  ++ L    + +   S D  SP    HS    ++  +K+YKC++C K F S +A+GGHK+SH K    D+  +  T
Subjt:  SLTSWLLTGRRGRKSIHTSYTIP--DDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKET

Query:  TAN
          N
Subjt:  TAN

AT5G04340.1 zinc finger of Arabidopsis thaliana 64.6e-0447.92Show/hide
Query:  YKCNICSKVFSSSRAMGGHKSSHYKA-NKTDIAITKETTANTTITTIG
        YKC++C K FSS +A+GGHK+SH K+ + T  A   E + ++ ITT G
Subjt:  YKCNICSKVFSSSRAMGGHKSSHYKA-NKTDIAITKETTANTTITTIG

AT5G56200.1 C2H2 type zinc finger transcription factor family2.9e-0635.29Show/hide
Query:  EYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSV-DSSTGRKRMMDFDLNQLPSDDDDDQ
        ++ CNIC K FS+ +A+GGHK  H+ A  + +A T   TA  T+    T   S++T +V +    ++R+++FDLN+LP ++++++
Subjt:  EYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSV-DSSTGRKRMMDFDLNQLPSDDDDDQ

AT5G67450.1 zinc-finger protein 14.6e-0435.38Show/hide
Query:  KEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSVDSSTG
        ++YKC +C K FSS +A+GGHK+SH K   T I    +  +N + +  G+     + ++V  +TG
Subjt:  KEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSVDSSTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATGATTGCGACGGACTCAGGATCAAGATCAAGGTCAGAAACGGCAACGCCGTTGCCGCCACGGTGGACGACGACGACAGTAATAAGCAGTTTTGTTCCGTTGG
TCATATGAGGATTGATTCTGAAACAACAATGGAGAAGGGATTATTATTCCAACCCCCAACTCCGGTCACCGCCAACTGTTCTTCCTCTTTCTCTTCCTCTCCATCTGCCG
CTTCTATCGACGTTGTTGATTGTTCTATATCTCTAACCTCCTGGTTGCTTACTGGCCGGAGAGGGCGGAAGAGTATTCATACTTCTTACACTATCCCTGATGATATACTT
GAGGCAAAGAGAATGAAGATTGAGACTCGTTCCGATGACGAGAAGTCACCAAAAAGGAAAGGGCACAGTAAAGAAAATAAGAATGAGGTTATGGAGAAGGAATATAAATG
CAATATATGTAGCAAAGTGTTCTCAAGTTCAAGAGCAATGGGAGGCCATAAGTCCAGCCACTACAAAGCCAATAAAACAGACATTGCCATTACAAAGGAAACTACTGCCA
ATACTACTATTACTACCATTGGAACAACCTTCGAATCGAAGATAACGTTGTCGGTAGATTCATCGACGGGTCGGAAGAGGATGATGGATTTTGATCTCAACCAGCTGCCG
TCGGACGACGACGACGACCAAGACAGAAGCCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCATGATTGCGACGGACTCAGGATCAAGATCAAGGTCAGAAACGGCAACGCCGTTGCCGCCACGGTGGACGACGACGACAGTAATAAGCAGTTTTGTTCCGTTGG
TCATATGAGGATTGATTCTGAAACAACAATGGAGAAGGGATTATTATTCCAACCCCCAACTCCGGTCACCGCCAACTGTTCTTCCTCTTTCTCTTCCTCTCCATCTGCCG
CTTCTATCGACGTTGTTGATTGTTCTATATCTCTAACCTCCTGGTTGCTTACTGGCCGGAGAGGGCGGAAGAGTATTCATACTTCTTACACTATCCCTGATGATATACTT
GAGGCAAAGAGAATGAAGATTGAGACTCGTTCCGATGACGAGAAGTCACCAAAAAGGAAAGGGCACAGTAAAGAAAATAAGAATGAGGTTATGGAGAAGGAATATAAATG
CAATATATGTAGCAAAGTGTTCTCAAGTTCAAGAGCAATGGGAGGCCATAAGTCCAGCCACTACAAAGCCAATAAAACAGACATTGCCATTACAAAGGAAACTACTGCCA
ATACTACTATTACTACCATTGGAACAACCTTCGAATCGAAGATAACGTTGTCGGTAGATTCATCGACGGGTCGGAAGAGGATGATGGATTTTGATCTCAACCAGCTGCCG
TCGGACGACGACGACGACCAAGACAGAAGCCATTCTTGA
Protein sequenceShow/hide protein sequence
MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIHTSYTIPDDIL
EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSVDSSTGRKRMMDFDLNQLP
SDDDDDQDRSHS