; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001365 (gene) of Snake gourd v1 genome

Gene IDTan0001365
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationLG02:8548588..8549292
RNA-Seq ExpressionTan0001365
SyntenyTan0001365
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.0e-6163.07Show/hide
Query:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ
        RRR R + +T  IEPPYPWS +QRA IH L+YL+S NIVT+ GDV+C +C+  Y +EY++M KF+E+A +IE  RD +HDRAP  W  P+LPNC  C ++
Subjt:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ

Query:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        NCVEP+IP+ +      RINWLFLLLGQ++GRLKL QLKYFC HT NHRTGAKDRLI++TYL+LCKQLQP NRLF+
Subjt:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-6263.07Show/hide
Query:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ
        RRR R + +T  IEPPYPWS +QRA IH L+YL+S NIVT+ GDV+C +C+  Y +EY++M KF+E+A +IE  RD +HDRAP  W  P+LPNC  C ++
Subjt:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ

Query:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        NCVEP+IP+ +      RINWLFLLLGQ++GRLKL QLKYFC HT NHRTGAKDRLI++TYL+LCKQLQP NRLF+
Subjt:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]1.2e-6555.79Show/hide
Query:  PLPLRPQPRPPPPLPTPALIP-------HPSSPKLSRTKKDIQHPN--------PQSRRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTG
        P PL+PQP+P PP+P P+           PS  + S  +    HP+        P   RR R KP   AIEPPYPWST  RAV+H+L YL+   I+T+TG
Subjt:  PLPLRPQPRPPPPLPTPALIP-------HPSSPKLSRTKKDIQHPN--------PQSRRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTG

Query:  DVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIP----ETDHKRINWLFLLLGQMLGRLKLGQLKYFCT
        DVKCS+C+++Y++EYD++ KF+E+AS+IE N+DTLHDRAPSSW  P LPNC  C Q++C+ P+IP    + D+K INWLFLLLGQM+G L L  LKYFCT
Subjt:  DVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIP----ETDHKRINWLFLLLGQMLGRLKLGQLKYFCT

Query:  HTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        +T NHRT AKDRL+Y+TYLSLCKQLQP   LFH
Subjt:  HTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

XP_022953023.1 mucin-16-like [Cucurbita moschata]1.0e-6163.07Show/hide
Query:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ
        RRR R + +T  IEPPYPWS +QRA IH L+YL+S NIVT+ GDV+C +C+  Y +EY++M KF+E+A +IE  RD +HDRAP  W  P+LPNC  C ++
Subjt:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ

Query:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        NCVEP+IP+ +      RINWLFLLLGQ++GRLKL QLKYFC HT NHRTGAKDRLI++TYL+LCKQLQP NRLF+
Subjt:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]5.4e-6353.54Show/hide
Query:  NLELSLR-PPLPLRPQPRPPPPLPTPALIPHPSSP---------------KLSRTKKDI--QHPN--------------PQSR-RRKRKKPNTMAIEPPY
        NLELSLR P  P    P PPPP P P   P P SP               K     ++I  QH N              PQ R RR+R + +   IEPPY
Subjt:  NLELSLR-PPLPLRPQPRPPPPLPTPALIPHPSSP---------------KLSRTKKDI--QHPN--------------PQSR-RRKRKKPNTMAIEPPY

Query:  PWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINW
        PWST +RAVIHEL YL+S NIVT+ G+VKC +C++KY MEYD+M KFNE+A +IE  +D++HDRAP  W +P+LPNC LCN++ CVEP+I E D+ +INW
Subjt:  PWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINW

Query:  LFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLF
        LFLLLG+ LG LKL QLKYFC  T  HRTGAK+RL+Y+ YL+LC QLQP N LF
Subjt:  LFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLF

TrEMBL top hitse value%identityAlignment
A0A0A0LAK2 Uncharacterized protein2.0e-5548.61Show/hide
Query:  LNLELSLRPPLPLRPQPRPPP----PLP---------------TPALIP-------------HPSSPKLSRTKKDIQHPNPQSRR--RKRKKPNTMAIEP
        L+L LSL PP P  P P+  P    PLP               +P L+P             + +  ++  TK+    PN +  R  R RK+ +T  IEP
Subjt:  LNLELSLRPPLPLRPQPRPPP----PLP---------------TPALIP-------------HPSSPKLSRTKKDIQHPNPQSRR--RKRKKPNTMAIEP

Query:  PYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRI
        PYPWST+Q A+IH+L+YL++ NI T+ G+VKC RCK K  +EYD+M KF E+  +IE  +  +HDRAP+ W  P L NC  CN++ CVEP+IP  +  +I
Subjt:  PYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRI

Query:  NWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQP
        NWLFLLLG  LGRLKL QLK+FCT TK HRTGAKDRL+Y TY  LCKQLQP
Subjt:  NWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQP

A0A6J1C462 uncharacterized protein LOC1110077684.3e-5846.86Show/hide
Query:  LNLELSLRPPL--------------------PLRPQPR----PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSR------------------------
        L++ELSLRPP                     PL PQP       P   T   I H S+   +     ++ P  ++R                        
Subjt:  LNLELSLRPPL--------------------PLRPQPR----PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSR------------------------

Query:  RRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQN
        RR R +     I+PPYPWST+ +AV+H+L+YLR   I+T+TGDV+C RC+++Y +EYD+M KF E+AS+IE N+ TLHDRAP SW  P   +C LC ++N
Subjt:  RRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQN

Query:  CVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        CV P IPE D+K INWLFLLLGQM+GRLKL  LKYFC +T NHRTGAK+RL+Y+TYL+LCKQLQP   LFH
Subjt:  CVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

A0A6J1C690 probable serine/threonine-protein kinase samkC5.6e-6655.79Show/hide
Query:  PLPLRPQPRPPPPLPTPALIP-------HPSSPKLSRTKKDIQHPN--------PQSRRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTG
        P PL+PQP+P PP+P P+           PS  + S  +    HP+        P   RR R KP   AIEPPYPWST  RAV+H+L YL+   I+T+TG
Subjt:  PLPLRPQPRPPPPLPTPALIP-------HPSSPKLSRTKKDIQHPN--------PQSRRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTG

Query:  DVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIP----ETDHKRINWLFLLLGQMLGRLKLGQLKYFCT
        DVKCS+C+++Y++EYD++ KF+E+AS+IE N+DTLHDRAPSSW  P LPNC  C Q++C+ P+IP    + D+K INWLFLLLGQM+G L L  LKYFCT
Subjt:  DVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIP----ETDHKRINWLFLLLGQMLGRLKLGQLKYFCT

Query:  HTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        +T NHRT AKDRL+Y+TYLSLCKQLQP   LFH
Subjt:  HTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

A0A6J1GM83 mucin-16-like4.9e-6263.07Show/hide
Query:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ
        RRR R + +T  IEPPYPWS +QRA IH L+YL+S NIVT+ GDV+C +C+  Y +EY++M KF+E+A +IE  RD +HDRAP  W  P+LPNC  C ++
Subjt:  RRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQ

Query:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH
        NCVEP+IP+ +      RINWLFLLLGQ++GRLKL QLKYFC HT NHRTGAKDRLI++TYL+LCKQLQP NRLF+
Subjt:  NCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like5.1e-5954.29Show/hide
Query:  PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRK-RKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNE
        P  P+  P  +    +   +  +    H   + RRR+ R + +T  IEPPYPWS +QRA IH L+YL+S NIV + GDV+C +C+  Y +EY++M KF+E
Subjt:  PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRK-RKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNE

Query:  LASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCK
        +A +IE  RD +HDRAP  W  P+LPNC  C ++NCVEP+IP+ +     +RINWLFLLLGQ++GRLKL QLKYFC HT NHRTGAKDRLI++TYL+LCK
Subjt:  LASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETD----HKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCK

Query:  QLQPFNRLFH
        QLQP NRLF+
Subjt:  QLQPFNRLFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein5.8e-3937.22Show/hide
Query:  LRPPLPLRPQPRPPPPL---PTPA-------LIPHPSSPK----LSRTKKDIQHPNP------------------------------------QSRRRKR
        ++ P+P+ P P  P P+   PTPA       ++P P  P     L  +    Q PNP                                    +SR    
Subjt:  LRPPLPLRPQPRPPPPL---PTPA-------LIPHPSSPK----LSRTKKDIQHPNP------------------------------------QSRRRKR

Query:  KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEP
        KK +T  I PP+PW+T +R  I  L+YL S  I T+TG+V+C  C++ Y++ Y++ E+F E+  +    +  + DRA   W  P    C LC ++  V+P
Subjt:  KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEP

Query:  LIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLF
        +I E    +INWLFLLLGQ LG   L QLK FC H+KNHRTGAKDR++Y+TY+ LCK LQP + LF
Subjt:  LIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSLCKQLQPFNRLF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.9e-3741.44Show/hide
Query:  LSLRPPLPLRPQPR--PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRKR-------KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGD
        +S+R PLP +P     PPP L   A +   ++P+  R       P  Q+RR  +       +      I PPYPW+TK+   I     L S NI  ++G 
Subjt:  LSLRPPLPLRPQPR--PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRKR-------KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGD

Query:  VKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNH
        V C  C     +EY++ EKF+EL  YI++N++ +  RAP SW  P L  C  C  +  ++P++ E   + INWLFLLLGQMLG   L QL+YFC     H
Subjt:  VKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNH

Query:  RTGAKDRLIYVTYLSLCKQLQP
        RTG+KDR++Y+TYLSLCKQL P
Subjt:  RTGAKDRLIYVTYLSLCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.2e-2236.63Show/hide
Query:  LSLRPPLPLRPQPR--PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRKR-------KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGD
        +S+R PLP +P     PPP L   A +   ++P+  R       P  Q+RR  +       +      I PPYPW+TK+   I     L S NI  ++G 
Subjt:  LSLRPPLPLRPQPR--PPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRKR-------KKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGD

Query:  VKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNH
        V C  C     +EY++ EKF+EL  YI++N++ +  RAP SW  P L  C  C  +  ++P++ E   + INWLFLLLGQMLG   L QL    +  K+H
Subjt:  VKCSRCKEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNH

Query:  RT
         T
Subjt:  RT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACTCACCAAAGCCATGAGGGTCTCAATCTCGAACTCTCTCTCCGTCCACCGTTGCCTCTTCGGCCTCAGCCTCGGCCTCCGCCTCCACTGCCAACACCAGCATT
GATTCCTCATCCATCCTCTCCCAAGCTCTCTCGGACGAAGAAGGACATCCAACACCCTAATCCACAATCACGACGACGAAAGAGAAAGAAACCGAACACGATGGCGATCG
AGCCACCGTATCCATGGTCGACGAAGCAGCGAGCGGTAATCCACGAACTCGACTACCTTCGATCGAAGAACATCGTGACGGTGACCGGGGACGTGAAATGCAGCCGGTGC
AAGGAAAAGTACAGGATGGAGTACGATGTGATGGAGAAGTTCAATGAGTTAGCGAGTTATATAGAGATAAATAGGGATACTTTGCATGATAGAGCGCCGAGTTCATGGAT
GCAACCTGTTTTACCGAATTGCATGTTGTGTAATCAACAAAACTGCGTGGAGCCGCTGATTCCTGAAACTGATCATAAGAGAATCAATTGGTTGTTTTTGCTTTTGGGTC
AAATGCTTGGACGTTTGAAACTCGGACAACTCAAATACTTCTGCACTCACACTAAAAATCATCGAACTGGGGCCAAGGATCGCCTTATTTATGTCACTTATCTTTCTTTG
TGTAAGCAACTTCAACCCTTCAACAGGCTCTTCCATCATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATACTCACCAAAGCCATGAGGGTCTCAATCTCGAACTCTCTCTCCGTCCACCGTTGCCTCTTCGGCCTCAGCCTCGGCCTCCGCCTCCACTGCCAACACCAGCATT
GATTCCTCATCCATCCTCTCCCAAGCTCTCTCGGACGAAGAAGGACATCCAACACCCTAATCCACAATCACGACGACGAAAGAGAAAGAAACCGAACACGATGGCGATCG
AGCCACCGTATCCATGGTCGACGAAGCAGCGAGCGGTAATCCACGAACTCGACTACCTTCGATCGAAGAACATCGTGACGGTGACCGGGGACGTGAAATGCAGCCGGTGC
AAGGAAAAGTACAGGATGGAGTACGATGTGATGGAGAAGTTCAATGAGTTAGCGAGTTATATAGAGATAAATAGGGATACTTTGCATGATAGAGCGCCGAGTTCATGGAT
GCAACCTGTTTTACCGAATTGCATGTTGTGTAATCAACAAAACTGCGTGGAGCCGCTGATTCCTGAAACTGATCATAAGAGAATCAATTGGTTGTTTTTGCTTTTGGGTC
AAATGCTTGGACGTTTGAAACTCGGACAACTCAAATACTTCTGCACTCACACTAAAAATCATCGAACTGGGGCCAAGGATCGCCTTATTTATGTCACTTATCTTTCTTTG
TGTAAGCAACTTCAACCCTTCAACAGGCTCTTCCATCATCGCTGA
Protein sequenceShow/hide protein sequence
MDTHQSHEGLNLELSLRPPLPLRPQPRPPPPLPTPALIPHPSSPKLSRTKKDIQHPNPQSRRRKRKKPNTMAIEPPYPWSTKQRAVIHELDYLRSKNIVTVTGDVKCSRC
KEKYRMEYDVMEKFNELASYIEINRDTLHDRAPSSWMQPVLPNCMLCNQQNCVEPLIPETDHKRINWLFLLLGQMLGRLKLGQLKYFCTHTKNHRTGAKDRLIYVTYLSL
CKQLQPFNRLFHHR