; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007798 (gene) of Snake gourd v1 genome

Gene IDTan0007798
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA repair REX1-B protein
Genome locationLG04:6486651..6489803
RNA-Seq ExpressionTan0007798
SyntenyTan0007798
Gene Ontology termsNA
InterPro domainsIPR039491 - Required for excision 1-B domain-containing protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012978.1 hypothetical protein SDJN02_25732 [Cucurbita argyrosperma subsp. argyrosperma]6.4e-8178.85Show/hide
Query:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD+KLSLAEQMEED +T  D+S SSNTL+LLRKFL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+C EITAEF+NCS QVIDIESNF SPDHDR+
Subjt:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRFKTPQEHECVHVH IT      EAE DA+YDNALKEAIKGVQ+AVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA EDK
Subjt:  EIAAFEDK

XP_004135056.1 uncharacterized protein LOC101210078 [Cucumis sativus]1.9e-7777.18Show/hide
Query:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL
        MEDF++KLSL EQMEED KT   DSSNTLSLLR+FL+IQ+RRAEAY+KLK GFDEYMTS REI +QQ+CSEIT EF+NCS QVIDIESNFRS DH+RLEL
Subjt:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL

Query:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI
        AN LRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRF  P+EHECVHVH IT      EAE DA+YDNALKEAIKGVQDAVTTINEHMEE+RYEI
Subjt:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI

Query:  AAFEDK
         A EDK
Subjt:  AAFEDK

XP_022945518.1 uncharacterized protein C19orf60 homolog [Cucurbita moschata]1.9e-8078.37Show/hide
Query:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD+KLSLAEQMEED +T  D+S SSNTL+LLRKFL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+C EITAEF+NCS QVIDIESNF SPDHDR+
Subjt:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRFKTP+EHECVHVH IT      EAE DA+YDNALKEAIKGVQ+AVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA EDK
Subjt:  EIAAFEDK

XP_023541138.1 uncharacterized protein C19orf60 homolog [Cucurbita pepo subsp. pepo]1.4e-8078.37Show/hide
Query:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD+KLSLAEQMEED +T  D+S SSNTL+LLRKFL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+C EITAEF+NCS QVIDIESNF SPDHDR+
Subjt:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRFKTP+EHEC+HVH IT      EAE DA+YDNALKEAIKGVQDAVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA EDK
Subjt:  EIAAFEDK

XP_038892275.1 uncharacterized protein LOC120081463 [Benincasa hispida]1.0e-7875.73Show/hide
Query:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL
        MEDF++KLSL EQMEED++TDASDSSNTLSLLR+FL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+CS+IT EF+NCS QVIDIESNFRS DH+RLEL
Subjt:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL

Query:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI
        ANLLRS+QE+E++K HLT QIQLLK+ GRPS  +VSH NCRF  P+EHECVHVH +T      EAE +A+YDNALKEAIKGVQD VTTINE+MEE+RYEI
Subjt:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI

Query:  AAFEDK
        AA EDK
Subjt:  AAFEDK

TrEMBL top hitse value%identityAlignment
A0A0A0KUN3 Uncharacterized protein9.3e-7877.18Show/hide
Query:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL
        MEDF++KLSL EQMEED KT   DSSNTLSLLR+FL+IQ+RRAEAY+KLK GFDEYMTS REI +QQ+CSEIT EF+NCS QVIDIESNFRS DH+RLEL
Subjt:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL

Query:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI
        AN LRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRF  P+EHECVHVH IT      EAE DA+YDNALKEAIKGVQDAVTTINEHMEE+RYEI
Subjt:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI

Query:  AAFEDK
         A EDK
Subjt:  AAFEDK

A0A5D3CFT2 Uncharacterized protein3.0e-7675.73Show/hide
Query:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL
        MEDF++KLSL EQMEED  T   DSSNTLSLLR+FL+IQ+RRAEAY+KLK GFDEYMTS REI +QQ+CSEIT EF+NCS QVIDIESNFR+ DH+RLEL
Subjt:  MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLEL

Query:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI
        ANLLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCR   P+EHECVHVH IT      EAE DA+YDNALKEAIKGVQDAVTTIN+HMEE+RYEI
Subjt:  ANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEI

Query:  AAFEDK
         A EDK
Subjt:  AAFEDK

A0A6J1DAT6 uncharacterized protein LOC1110187572.7e-7775.48Show/hide
Query:  MEDFDEKLSLAE--QMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD KLSLAE  +MEED++  A+D SNTL LLRKFL+IQ+RRAEAYAKLK GFDEYMTS RE+ +QQ+CSEIT EFN CS QVIDIES+FR+PDH RL
Subjt:  MEDFDEKLSLAE--QMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+Q QE++K HLT QIQLLK+ GRPS  LVSH NCRFKTP+EHECVHVH IT      EAE DA+YDN+LKEAIKGVQDAVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA ED+
Subjt:  EIAAFEDK

A0A6J1G147 uncharacterized protein C19orf60 homolog9.0e-8178.37Show/hide
Query:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD+KLSLAEQMEED +T  D+S SSNTL+LLRKFL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+C EITAEF+NCS QVIDIESNF SPDHDR+
Subjt:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRFKTP+EHECVHVH IT      EAE DA+YDNALKEAIKGVQ+AVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA EDK
Subjt:  EIAAFEDK

A0A6J1HYI6 uncharacterized protein C19orf60 homolog9.0e-8178.37Show/hide
Query:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL
        MEDFD+KLSLAEQMEED +T  D+S SSNTL+LLRKFL+IQ+RRAEAYA+LK GFDEYMTS REI +QQ+C EITAEF+NCS QVIDIESNF SPDHDR+
Subjt:  MEDFDEKLSLAEQMEEDEKT--DASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRL

Query:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY
        ELA+LLRS+QEQE++K HLT QIQLLK+ GRPS  LVSH NCRFKTP+EHECVHVH IT      EAE DA+YDNALKEAIKGVQ+AVTTINEHMEE+RY
Subjt:  ELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRY

Query:  EIAAFEDK
        EIAA EDK
Subjt:  EIAAFEDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G04910.1 unknown protein4.1e-5452.07Show/hide
Query:  EDFDEKLSLAEQMEEDEKTDASDS--------------SNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIE
        EDF +K+SL + +  D + +   S              S TL LLR  L+IQ+RRA+AYA LKSGF EY+ S  E  +Q++CSEIT EF+ CS QV ++E
Subjt:  EDFDEKLSLAEQMEEDEKTDASDS--------------SNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIE

Query:  SNFRSPDHDRLELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVT
        S F +P+  R +LA LL  +Q QE++K HLTV IQ+LK+ GRPS  +++H NC+FK P +HECVH+H IT      EAE DA++DNALKEAI+GVQDAVT
Subjt:  SNFRSPDHDRLELANLLRSLQEQERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTIT------EAEEDAKYDNALKEAIKGVQDAVT

Query:  TINEHMEELRYEIAAFE
        +INE++E++RYEI A E
Subjt:  TINEHMEELRYEIAAFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATTTTGACGAGAAGCTCTCTCTGGCTGAGCAAATGGAGGAAGACGAAAAAACCGATGCCAGTGATTCCTCTAACACTCTCAGTTTGCTCCGCAAGTTTCTTAA
AATCCAGCGGCGGAGAGCGGAAGCTTATGCCAAGCTTAAGAGTGGTTTCGACGAGTATATGACTTCAAGAAGGGAGATACCTTTCCAACAGATCTGCAGTGAGATCACTG
CGGAATTCAATAATTGCTCCGCACAAGTCATTGACATTGAGTCGAATTTCCGGAGTCCAGATCATGATAGGCTCGAGTTAGCCAACTTGCTTAGATCTCTTCAAGAACAA
GAAAGGAAGAAATTTCACCTGACAGTCCAAATTCAGCTCTTGAAGCAAATGGGCCGCCCATCAGGGTGCTTGGTGAGCCACAACAATTGCAGATTCAAGACACCTCAAGA
GCACGAGTGTGTGCATGTCCATACGATAACTGAAGCAGAAGAAGATGCCAAATATGACAATGCTCTCAAGGAAGCAATCAAAGGTGTGCAAGATGCTGTGACGACCATAA
ATGAGCACATGGAGGAACTGAGATATGAGATTGCAGCTTTTGAAGATAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAATTTTTGAAACTTAAAATCTTACTAAATTCCACAAACATTTTTTTTAATTAAAAAAACAAGATTTTTCGCCTTTTTAACATCCAAATGCTTGAAATTGAAATCAT
CCGAAACCGTAACTGAAATCGTCTGAATCAAGGGTTCCAAACAGAAGATAAAGCTGCAGGGAGAAATATTGCCAAGAGAAGGCTCCCGCGCCCTAAAGAATGGAAGATTT
TGACGAGAAGCTCTCTCTGGCTGAGCAAATGGAGGAAGACGAAAAAACCGATGCCAGTGATTCCTCTAACACTCTCAGTTTGCTCCGCAAGTTTCTTAAAATCCAGCGGC
GGAGAGCGGAAGCTTATGCCAAGCTTAAGAGTGGTTTCGACGAGTATATGACTTCAAGAAGGGAGATACCTTTCCAACAGATCTGCAGTGAGATCACTGCGGAATTCAAT
AATTGCTCCGCACAAGTCATTGACATTGAGTCGAATTTCCGGAGTCCAGATCATGATAGGCTCGAGTTAGCCAACTTGCTTAGATCTCTTCAAGAACAAGAAAGGAAGAA
ATTTCACCTGACAGTCCAAATTCAGCTCTTGAAGCAAATGGGCCGCCCATCAGGGTGCTTGGTGAGCCACAACAATTGCAGATTCAAGACACCTCAAGAGCACGAGTGTG
TGCATGTCCATACGATAACTGAAGCAGAAGAAGATGCCAAATATGACAATGCTCTCAAGGAAGCAATCAAAGGTGTGCAAGATGCTGTGACGACCATAAATGAGCACATG
GAGGAACTGAGATATGAGATTGCAGCTTTTGAAGATAAGTAATCTTCCAACATATTGATAATATCTTGCTCTGCAATCGAATTGGTTCTTGAATAACTTGCATATGGCGA
CGTAAGGCCAAACCACAAACTTTACGTTCTACACTTTTTGCTTGATATTTCACCATCTTTTATGAGATATATCAAATGAATTCACAATTTTTGTGTCTCCTTTAGATTGA
GAAATAAAGCAGGGACAGTAATTAGGAGAAGGTCAAGGATACTCTTGCAAGCCTTTTTTGCCTCGGCTTTTGCTGCTGCTAACGGTAATAGGCTTTTGTCTCTACTACTT
TTCCAAAATATCCTATATCAACCACATCGTAACCAAAGCTTACTATTGAACTTTATTGTTATTCAACTTGAATCTCTCAAATTTTCTAATACTTTATTGACATATCTATA
TCGATATGTTATTTTATTTCATGAATACCATACTATTTTTATATGCTGTCAAGACAAACAACCAATTGAGGTAGTATGAAACATTGTGTTATTATA
Protein sequenceShow/hide protein sequence
MEDFDEKLSLAEQMEEDEKTDASDSSNTLSLLRKFLKIQRRRAEAYAKLKSGFDEYMTSRREIPFQQICSEITAEFNNCSAQVIDIESNFRSPDHDRLELANLLRSLQEQ
ERKKFHLTVQIQLLKQMGRPSGCLVSHNNCRFKTPQEHECVHVHTITEAEEDAKYDNALKEAIKGVQDAVTTINEHMEELRYEIAAFEDK