; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004629 (gene) of Snake gourd v1 genome

Gene IDTan0004629
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationLG02:8530426..8531097
RNA-Seq ExpressionTan0004629
SyntenyTan0004629
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]4.5e-7569.86Show/hide
Query:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR
        T+  LDLQLSLRPPS       +P  +     N+L + +ITR  GT RSSLRRCNSR  +  E I PPY WSTN RA+V TLN L++++IL ITGDVRCR
Subjt:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR

Query:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK
        +CQ EY IEYD+VSKF+EIA FVE+NKN FRDRAP SWMNPNYP CRFCG ENG RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYTNNHRTGAK
Subjt:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK

Query:  NRLLFLTYI
        NRLL+LT I
Subjt:  NRLLFLTYI

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]4.8e-8571.43Show/hide
Query:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR
        T+  LDLQLSLRPPS       +P  +     N+L + +ITR  GT RSSLRRCNSR  +  E I PPY WSTN RA+V TLN L++++IL ITGDVRCR
Subjt:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR

Query:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK
        +CQ EY IEYD+VSKF+EIA FVE+NKN FRDRAP SWMNPNYP CRFCG ENG RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYTNNHRTGAK
Subjt:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK

Query:  NRLLFLTYITLCHQVDPSGRFRRM
        NRLL+LTYITLCHQVDPSGRF R+
Subjt:  NRLLFLTYITLCHQVDPSGRFRRM

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]6.2e-8570.31Show/hide
Query:  NQSNETHDDLDLQLSLRP--------PSAAPTGLV--NSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITG
        NQ NE H+ LDL+LSLRP        PSAAP G    N++ +M++TR+ GT RSS +RCNSR  +  E I PPY WSTN RA+V TLN LK+N+IL ITG
Subjt:  NQSNETHDDLDLQLSLRP--------PSAAPTGLV--NSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITG

Query:  DVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNH
        DV+CR+CQ EY IEYD+ SKF+EIA FVE+NKNSFRDRAP SWMNPNYP CRFCG ENG RPVIP++ R+INWLFLLLG+MLG LNLNHLKYFCS T NH
Subjt:  DVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNH

Query:  RTGAKNRLLFLTYITLCHQVDPSGRFRRM
        RTGAKNRLL+LTYITLCHQVDPSGRF R+
Subjt:  RTGAKNRLLFLTYITLCHQVDPSGRFRRM

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]1.3e-6657.66Show/hide
Query:  LDLQLSLRPPSAAPTGL----------------VNSLASMKITRTYGTPRSSLR-RCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRC
        +DL+LSL  PSAA                    ++ L+S++     G  ++SLR R ++     PI PPY WST+  AVVHTL+YL +N+IL+ITG+V+C
Subjt:  LDLQLSLRPPSAAPTGL----------------VNSLASMKITRTYGTPRSSLR-RCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRC

Query:  RRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGA
        ++C+R Y+IEYDVVSKF+EI  FVE N  SFRDRAP  WM PNYP CRFCG E GV+PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYT NHRTG+
Subjt:  RRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGA

Query:  KNRLLFLTYITLCHQVDPSGRF
        K+RL++LTYITLC Q+DPSGRF
Subjt:  KNRLLFLTYITLCHQVDPSGRF

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]1.0e-6662.5Show/hide
Query:  SAAPTGLVNSLASMKITRTYGTPRSSLRR--CNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVE
        +AA    ++ L+S++     G  ++SLRR  CNS     PI PPY WST+  AVVHTL+YL  N+IL+ITGDV+C++C+R Y+IEY+VVSKF+EI  FVE
Subjt:  SAAPTGLVNSLASMKITRTYGTPRSSLRR--CNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVE

Query:  KNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSGRFRRM
         N  SFRDRAP  WM PNYP CRFCG E GV+PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYT NHRTG+K+RL++LTYITLC Q+DPSGRF R+
Subjt:  KNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSGRFRRM

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein3.0e-8570.31Show/hide
Query:  NQSNETHDDLDLQLSLRP--------PSAAPTGLV--NSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITG
        NQ NE H+ LDL+LSLRP        PSAAP G    N++ +M++TR+ GT RSS +RCNSR  +  E I PPY WSTN RA+V TLN LK+N+IL ITG
Subjt:  NQSNETHDDLDLQLSLRP--------PSAAPTGLV--NSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITG

Query:  DVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNH
        DV+CR+CQ EY IEYD+ SKF+EIA FVE+NKNSFRDRAP SWMNPNYP CRFCG ENG RPVIP++ R+INWLFLLLG+MLG LNLNHLKYFCS T NH
Subjt:  DVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNH

Query:  RTGAKNRLLFLTYITLCHQVDPSGRFRRM
        RTGAKNRLL+LTYITLCHQVDPSGRF R+
Subjt:  RTGAKNRLLFLTYITLCHQVDPSGRFRRM

A0A1S3BHR1 uncharacterized protein LOC1034897702.3e-8571.43Show/hide
Query:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR
        T+  LDLQLSLRPPS       +P  +     N+L + +ITR  GT RSSLRRCNSR  +  E I PPY WSTN RA+V TLN L++++IL ITGDVRCR
Subjt:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR

Query:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK
        +CQ EY IEYD+VSKF+EIA FVE+NKN FRDRAP SWMNPNYP CRFCG ENG RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYTNNHRTGAK
Subjt:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK

Query:  NRLLFLTYITLCHQVDPSGRFRRM
        NRLL+LTYITLCHQVDPSGRF R+
Subjt:  NRLLFLTYITLCHQVDPSGRFRRM

A0A5A7T547 Uncharacterized protein2.2e-7569.86Show/hide
Query:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR
        T+  LDLQLSLRPPS       +P  +     N+L + +ITR  GT RSSLRRCNSR  +  E I PPY WSTN RA+V TLN L++++IL ITGDVRCR
Subjt:  THDDLDLQLSLRPPSA------APTGL----VNSLASMKITRTYGTPRSSLRRCNSR--KRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCR

Query:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK
        +CQ EY IEYD+VSKF+EIA FVE+NKN FRDRAP SWMNPNYP CRFCG ENG RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYTNNHRTGAK
Subjt:  RCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAK

Query:  NRLLFLTYI
        NRLL+LT I
Subjt:  NRLLFLTYI

A0A6J1GLD4 uncharacterized protein LOC1114553886.3e-6757.66Show/hide
Query:  LDLQLSLRPPSAAPTGL----------------VNSLASMKITRTYGTPRSSLR-RCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRC
        +DL+LSL  PSAA                    ++ L+S++     G  ++SLR R ++     PI PPY WST+  AVVHTL+YL +N+IL+ITG+V+C
Subjt:  LDLQLSLRPPSAAPTGL----------------VNSLASMKITRTYGTPRSSLR-RCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRC

Query:  RRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGA
        ++C+R Y+IEYDVVSKF+EI  FVE N  SFRDRAP  WM PNYP CRFCG E GV+PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYT NHRTG+
Subjt:  RRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGA

Query:  KNRLLFLTYITLCHQVDPSGRF
        K+RL++LTYITLC Q+DPSGRF
Subjt:  KNRLLFLTYITLCHQVDPSGRF

A0A6J1I5V9 uncharacterized protein LOC1114709684.8e-6762.5Show/hide
Query:  SAAPTGLVNSLASMKITRTYGTPRSSLRR--CNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVE
        +AA    ++ L+S++     G  ++SLRR  CNS     PI PPY WST+  AVVHTL+YL  N+IL+ITGDV+C++C+R Y+IEY+VVSKF+EI  FVE
Subjt:  SAAPTGLVNSLASMKITRTYGTPRSSLRR--CNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVE

Query:  KNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSGRFRRM
         N  SFRDRAP  WM PNYP CRFCG E GV+PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYT NHRTG+K+RL++LTYITLC Q+DPSGRF R+
Subjt:  KNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSGRFRRM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.5e-4445.61Show/hide
Query:  GTPRSSLRRCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMC
        G+ R    R    K+ + I PP+ W+TN R  + +L YL++N+I +ITG+V+CR C++ Y++ Y++  +F E+ +F    K   RDRA   W  P    C
Subjt:  GTPRSSLRRCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMC

Query:  RFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDP
          CG E  V+PVI E K +INWLFLLLGQ LGF  L  LK FC ++ NHRTGAK+R+L+LTY+ LC  + P
Subjt:  RFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)7.2e-3939.11Show/hide
Query:  PTGLVNSLASMKI-TRTYGTPRSSLRRCNSRKRI---------EPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEI
        P   +N +A++ + T   G P     R NS++ +           I PPY W+T     + +   L +N I  I+G V C+ C R   +EY++  KF E+
Subjt:  PTGLVNSLASMKI-TRTYGTPRSSLRRCNSRKRI---------EPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEI

Query:  ARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSG
          +++ NK   R RAP SW  P    CR C  E  ++PV+ E K  INWLFLLLGQMLG   L+ L+YFC   + HRTG+K+R++++TY++LC Q+DP G
Subjt:  ARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSG

Query:  RF
         F
Subjt:  RF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.9e-2436.75Show/hide
Query:  PTGLVNSLASMKI-TRTYGTPRSSLRRCNSRKRI---------EPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEI
        P   +N +A++ + T   G P     R NS++ +           I PPY W+T     + +   L +N I  I+G V C+ C R   +EY++  KF E+
Subjt:  PTGLVNSLASMKI-TRTYGTPRSSLRRCNSRKRI---------EPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYDVVSKFDEI

Query:  ARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHL
          +++ NK   R RAP SW  P    CR C  E  ++PV+ E K  INWLFLLLGQMLG   L+ L
Subjt:  ARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATCACTCCATAAACCAAAGCAATGAAACTCATGACGACCTCGACCTGCAACTCTCCCTCCGGCCGCCGTCCGCCGCCCCCACCGGCCTTGTGAATTCACTAGC
CTCAATGAAAATCACTCGCACTTATGGAACTCCTCGATCATCTCTCCGTCGCTGCAATTCCCGAAAGCGAATAGAACCGATCCCGCCGCCATATCAATGGTCGACGAACT
GCCGAGCGGTGGTCCACACCCTAAACTACCTCAAAGCAAATGAAATCCTCAGCATCACCGGCGACGTCCGGTGCCGACGGTGCCAGAGAGAGTACAAGATCGAATACGAT
GTCGTTTCGAAGTTCGACGAGATTGCGAGGTTCGTAGAGAAGAACAAGAATTCTTTTCGCGACCGAGCACCGAGCTCGTGGATGAACCCTAATTATCCAATGTGTAGATT
TTGTGGACTTGAAAATGGAGTGAGGCCGGTGATTCCAGAGGAAAAGAGAAGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTAGGGTTTCTGAATCTCAATCATC
TAAAATACTTCTGCAGTTACACGAACAATCATCGAACTGGTGCTAAGAATCGTCTTCTTTTTCTCACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGCCGTTTC
CGTCGAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATCACTCCATAAACCAAAGCAATGAAACTCATGACGACCTCGACCTGCAACTCTCCCTCCGGCCGCCGTCCGCCGCCCCCACCGGCCTTGTGAATTCACTAGC
CTCAATGAAAATCACTCGCACTTATGGAACTCCTCGATCATCTCTCCGTCGCTGCAATTCCCGAAAGCGAATAGAACCGATCCCGCCGCCATATCAATGGTCGACGAACT
GCCGAGCGGTGGTCCACACCCTAAACTACCTCAAAGCAAATGAAATCCTCAGCATCACCGGCGACGTCCGGTGCCGACGGTGCCAGAGAGAGTACAAGATCGAATACGAT
GTCGTTTCGAAGTTCGACGAGATTGCGAGGTTCGTAGAGAAGAACAAGAATTCTTTTCGCGACCGAGCACCGAGCTCGTGGATGAACCCTAATTATCCAATGTGTAGATT
TTGTGGACTTGAAAATGGAGTGAGGCCGGTGATTCCAGAGGAAAAGAGAAGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTAGGGTTTCTGAATCTCAATCATC
TAAAATACTTCTGCAGTTACACGAACAATCATCGAACTGGTGCTAAGAATCGTCTTCTTTTTCTCACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGCCGTTTC
CGTCGAATGTGA
Protein sequenceShow/hide protein sequence
MENHSINQSNETHDDLDLQLSLRPPSAAPTGLVNSLASMKITRTYGTPRSSLRRCNSRKRIEPIPPPYQWSTNCRAVVHTLNYLKANEILSITGDVRCRRCQREYKIEYD
VVSKFDEIARFVEKNKNSFRDRAPSSWMNPNYPMCRFCGLENGVRPVIPEEKRRINWLFLLLGQMLGFLNLNHLKYFCSYTNNHRTGAKNRLLFLTYITLCHQVDPSGRF
RRM