; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019843 (gene) of Snake gourd v1 genome

Gene IDTan0019843
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1677)
Genome locationLG02:94030840..94033627
RNA-Seq ExpressionTan0019843
SyntenyTan0019843
Gene Ontology termsNA
InterPro domainsIPR012876 - Protein of unknown function DUF1677, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065049.1 DUF1677 domain-containing protein [Cucumis melo var. makuwa]4.7e-5877.36Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI
        +GLQRTISDIS ELTKEA+  AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEA      KG KREEALKEHMSACAKFNRI
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        GR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

XP_008445002.1 PREDICTED: uncharacterized protein LOC103488172 isoform X1 [Cucumis melo]3.1e-6281.25Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNR
        +GLQRTISDIS ELTKEA+A AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEAVNE ME KG KREEALKEHMSACAKFNR
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        IGR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPTMV
Subjt:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

XP_016899975.1 PREDICTED: uncharacterized protein LOC103488172 isoform X2 [Cucumis melo]1.3e-6381.76Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI
        +GLQRTISDIS ELTKEA+A AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEAVNE MEKG KREEALKEHMSACAKFNRI
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        GR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPTMV
Subjt:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

XP_038885709.1 uncharacterized protein LOC120076007 isoform X1 [Benincasa hispida]4.5e-6181.37Show/hide
Query:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNRI
        +GLQRTISDISSELTKE +A  G L LP ISEVEAAACECCGLSEDCTAEYI  VRDKFMGKLICGLCAEAVNE ME K   REEALKEHM+ACAKFN+I
Subjt:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCD-PTMVK
        GRAYPVLYQAEAIK+ILKK+S        A +AHRNGRIGRTSSCIPAITRDVCD PTMVK
Subjt:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCD-PTMVK

XP_038885710.1 uncharacterized protein LOC120076007 isoform X2 [Benincasa hispida]1.8e-6281.88Show/hide
Query:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRIG
        +GLQRTISDISSELTKE +A  G L LP ISEVEAAACECCGLSEDCTAEYI  VRDKFMGKLICGLCAEAVNE MEK   REEALKEHM+ACAKFN+IG
Subjt:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRIG

Query:  RAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCD-PTMVK
        RAYPVLYQAEAIK+ILKK+S        A +AHRNGRIGRTSSCIPAITRDVCD PTMVK
Subjt:  RAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCD-PTMVK

TrEMBL top hitse value%identityAlignment
A0A0A0LLX3 Uncharacterized protein1.6e-5675.78Show/hide
Query:  QGLQRTISDISSELTKEAVAF-AGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK-GRKREEALKEHMSACAKFNR
        +GLQRTISDIS EL+KE +   A  + LP ISEVEAAACECCGLSEDCTAEYI  V+DKFMGKLICGLCAEAVNE MEK G KREEALKEHMSACAKFNR
Subjt:  QGLQRTISDISSELTKEAVAF-AGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK-GRKREEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMVK
        IGR YPVLYQAEAIK+ILKK+           + HR GRIGR+SSCIPA+ RDVCDPTMVK
Subjt:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMVK

A0A1S3BBP3 uncharacterized protein LOC103488172 isoform X11.5e-6281.25Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNR
        +GLQRTISDIS ELTKEA+A AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEAVNE ME KG KREEALKEHMSACAKFNR
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGME-KGRKREEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        IGR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPTMV
Subjt:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

A0A1S4DW91 uncharacterized protein LOC103488172 isoform X26.1e-6481.76Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI
        +GLQRTISDIS ELTKEA+A AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEAVNE MEKG KREEALKEHMSACAKFNRI
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        GR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPTMV
Subjt:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

A0A5A7VHU7 DUF1677 domain-containing protein2.3e-5877.36Show/hide
Query:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI
        +GLQRTISDIS ELTKEA+  AG  + LP ISEVEAAACECCGLSEDCTAEYI RV+DKFMGKLICGLCAEA      KG KREEALKEHMSACAKFNRI
Subjt:  QGLQRTISDISSELTKEAVAFAG-KLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        GR YPVLYQAEAIK+ILKK+ T       A  AHR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  GRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV

A0A6J1K942 uncharacterized protein LOC1114917779.5e-5775.93Show/hide
Query:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK-GRKREEALKEHMSACAKFNRI
        +GLQRTISDISSELTK  +  A    LP ISEVEAA CECCGLSE+CTAEYI+RVR +FMGK+ICGLCAEAVNE M K G ++EE+LKEHMSACAKFNR 
Subjt:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK-GRKREEALKEHMSACAKFNRI

Query:  GRAYPVLYQAEAIKKILKKSSTR--ETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMVK
        GR YPVLYQAEAIK+ILKKSSTR    S     A HRNGRIGRTSSCIPAIT+DVCDPT+VK
Subjt:  GRAYPVLYQAEAIKKILKKSSTR--ETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72510.1 Protein of unknown function (DUF1677)1.2e-1436.97Show/hide
Query:  ISDISSELTKEAVAFAGKLSLPVIS--EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRK---REEALKEHMSACAKFNRIG-
        +SD +  ++   V     +S+  IS  E ++  C+CCGL+E+CT  YI  VR+++MGK ICGLC+EAV   + + ++    EEA+  HM+ C KF     
Subjt:  ISDISSELTKEAVAFAGKLSLPVIS--EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRK---REEALKEHMSACAKFNRIG-

Query:  RAYPVLYQAEAIKKILKKS
           P  +   A+++IL+KS
Subjt:  RAYPVLYQAEAIKKILKKS

AT1G79770.1 Protein of unknown function (DUF1677)1.5e-3351.95Show/hide
Query:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR--EEALKEHMSACAKFNR
        + L R++SDIS +  +E      +  L  I EVE A CECCG+ E+CT EYI RVR+KF GK ICGLC+EAV E  +K  +   E ALKEHMSAC +FN+
Subjt:  QGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR--EEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDV
        +GR YP L+QA+A++ +L++ STR  S     A      I RTSSCIPAITRD+
Subjt:  IGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDV

AT3G22540.1 Protein of unknown function (DUF1677)1.4e-1538Show/hide
Query:  EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR--EEALKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSSTRETSTAVA
        E+E+  CECCGL EDCT +YIS V+  F  K +CGLC+EAV + + + +    +EA+K H+S C KF +     P ++ A+ ++++L++ S   T+++ +
Subjt:  EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR--EEALKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSSTRETSTAVA

AT4G14819.1 Protein of unknown function (DUF1677)2.5e-1746.67Show/hide
Query:  EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR-EEALKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSS
        E+E+  CECCGL EDCT  YIS+V+  F GK +CGLC+EAV++   +  K  EEA+  HMS C KFN    A P    A+ ++++L++ S
Subjt:  EVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKR-EEALKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSS

AT5G25840.1 Protein of unknown function (DUF1677)4.7e-4054.34Show/hide
Query:  LQRTISDISSE------LTKEAVAFAGK---LSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK--------GRKREEA
        LQRTISDIS +      LTKE    A     LSL  ISEVE A CECCG+SE+CT EYI RVR KF GKLICGLC +AV   MEK         ++REEA
Subjt:  LQRTISDISSE------LTKEAVAFAGK---LSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEK--------GRKREEA

Query:  LKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV
        +K HMSAC++FNR+GR+YPVLYQAEA+K++LKK S +     V A     G + R+SSC+PA+ +++ D T+V
Subjt:  LKEHMSACAKFNRIGRAYPVLYQAEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAGGGCCTTCAACGAACCATATCCGACATCTCCTCCGAGCTAACCAAAGAAGCTGTTGCATTCGCCGGAAAGCTGTCGCTACCGGTCATTTCTGAGGTGGAGGC
CGCTGCCTGCGAGTGCTGTGGCCTCTCGGAGGACTGCACGGCGGAGTACATCAGCCGTGTGCGAGACAAGTTCATGGGGAAGCTGATTTGTGGCCTCTGCGCTGAGGCTG
TCAATGAGGGGATGGAGAAGGGGCGAAAAAGGGAGGAGGCTTTGAAGGAGCACATGAGCGCTTGTGCGAAGTTCAACAGGATCGGTAGGGCGTATCCTGTGCTGTATCAG
GCAGAGGCCATTAAAAAGATTCTGAAGAAAAGCTCCACCAGAGAAACCTCCACCGCCGTCGCCGCCGCCGCCCACCGGAACGGCAGGATTGGCCGGACCTCCAGTTGCAT
TCCGGCCATCACCAGAGACGTCTGTGATCCAACCATGGTCAAATGA
mRNA sequenceShow/hide mRNA sequence
CAGATCAGAAACAAAGGTGTGGAAGAACAAAAGCCATCAATGGATCAGGGCCTTCAACGAACCATATCCGACATCTCCTCCGAGCTAACCAAAGAAGCTGTTGCATTCGC
CGGAAAGCTGTCGCTACCGGTCATTTCTGAGGTGGAGGCCGCTGCCTGCGAGTGCTGTGGCCTCTCGGAGGACTGCACGGCGGAGTACATCAGCCGTGTGCGAGACAAGT
TCATGGGGAAGCTGATTTGTGGCCTCTGCGCTGAGGCTGTCAATGAGGGGATGGAGAAGGGGCGAAAAAGGGAGGAGGCTTTGAAGGAGCACATGAGCGCTTGTGCGAAG
TTCAACAGGATCGGTAGGGCGTATCCTGTGCTGTATCAGGCAGAGGCCATTAAAAAGATTCTGAAGAAAAGCTCCACCAGAGAAACCTCCACCGCCGTCGCCGCCGCCGC
CCACCGGAACGGCAGGATTGGCCGGACCTCCAGTTGCATTCCGGCCATCACCAGAGACGTCTGTGATCCAACCATGGTCAAATGAAGTGATTATTATGCCACCCGTACGT
TTAGAGACACTCTTATCCTTTCATTTTAAAATCGTTTTTTAATGTGGGTATATATGTGAATTTTTAGATGTATTAATATTTATGTATATTAAAGTGAGGACGAGGACGTA
TATGAATGGAGGGACAAAAGTACGTCGAATTTATATTCAATTCAAGGGTTTTTAGAGAGTATGAAAATTGTTGGATCCACATCCAACGATTATTATGAGATAGCTAACAC
TG
Protein sequenceShow/hide protein sequence
MDQGLQRTISDISSELTKEAVAFAGKLSLPVISEVEAAACECCGLSEDCTAEYISRVRDKFMGKLICGLCAEAVNEGMEKGRKREEALKEHMSACAKFNRIGRAYPVLYQ
AEAIKKILKKSSTRETSTAVAAAAHRNGRIGRTSSCIPAITRDVCDPTMVK