; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018861 (gene) of Snake gourd v1 genome

Gene IDTan0018861
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG05:77530557..77535195
RNA-Seq ExpressionTan0018861
SyntenyTan0018861
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573362.1 hypothetical protein SDJN03_27249, partial [Cucurbita argyrosperma subsp. sororia]5.8e-5155Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPT P     LPL+FRRNVSSS+ L  + NK  +  DAR+  NNG +I C  F+    S S   GFP  + K+A+QSLLC   D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISA K++AL+ LLA NP+ AE IM  V + Y+  +   KYDATLA+V ILIH G  +SLG AI HLN IE   N PSD KR LYRAVIYTLLE + 
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGPPRR
        +AK  WK ++  IG GP  R
Subjt:  DAKNFWKNYTGTIGSGPPRR

XP_022955331.1 uncharacterized protein LOC111457324 [Cucurbita moschata]4.9e-5055.45Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPT P     LPL+FRRNVSSS+ L  + N+  +  DAR+  NNG +I C  F+    S S   GFP  N K A+QSLLC   D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISA K+KALQ LLA NP+ AE+IM NV + Y++ +   KY+A LA V ILIH G  +SLG AI HLN IEE  N PSD KR LYRAVIYTLL+ + 
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGPPRR
        +AK  WK ++  IG GP  R
Subjt:  DAKNFWKNYTGTIGSGPPRR

XP_022994895.1 uncharacterized protein LOC111490483 [Cucurbita maxima]7.8e-5659.45Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPTLP     LPL+FRRN+SSS+ L  + N+  +  DAR+  NNG +I C ++++  SSGS   GFPF N K+A+QSLLCFS D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISAQK+KALQ LLA NPAEAERIM  V +KY+     IKY+ATLA+V ILIH G  +SL  A+ +LN IE  +  PSD KR LYRAVIYTLLE ++
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGP
        DAK+ WK ++  IGSGP
Subjt:  DAKNFWKNYTGTIGSGP

XP_023542455.1 uncharacterized protein LOC111802357 [Cucurbita pepo subsp. pepo]8.3e-5054.84Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPT P     LPL+FRRNVSSS+ L  + N+  +  DAR+  NNG +I C  F+    S S   GFP  N K+A+QSLLC   D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISA K+KALQ LLA NP+ AE+IM  + ++Y+  +   KY+ATLA+V ILIH G  +SL  AI +LN +E   N PSDAKR LYRAVIYTLLE  +
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGP
        DAK  WK ++  IG GP
Subjt:  DAKNFWKNYTGTIGSGP

XP_038895250.1 uncharacterized protein LOC120083532 [Benincasa hispida]5.8e-5153.91Show/hide
Query:  MNSSAFLRRGIAPTLP-LPAAG-----LPLTFRRNVSSSIRLFTIENKQHDNDDAR--KFNNG--SQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLC
        M+S+ FLRRG +P  P LP        LPL+FR NV SSI L   ++ ++ N   R  K NNG    I C K     S+  GGG FP  NAKMALQSL C
Subjt:  MNSSAFLRRGIAPTLP-LPAAG-----LPLTFRRNVSSSIRLFTIENKQHDNDDAR--KFNNG--SQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLC

Query:  FSFDTAAEAETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQR-ASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRA
        FSFD A E  TKDI+ +KK+ALQ LLAQNP +AE+IM ++Y+KYQ+  +G I+YDATLA +++LIH+GT +SL  AI  LN IE   + PSDA+ +LYRA
Subjt:  FSFDTAAEAETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQR-ASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRA

Query:  VIYTLLEENLDAKNFWKNYTGTIGSGPPRR
        +IYTLL+E  DAK FW  YTG IG+GP  R
Subjt:  VIYTLLEENLDAKNFWKNYTGTIGSGPPRR

TrEMBL top hitse value%identityAlignment
A0A0A0LXK7 Uncharacterized protein3.7e-2739.15Show/hide
Query:  MNSSAFLRRGIA--PTLPLPAAGLPLTFR----------RNVSSSIRLFTIENKQHDNDDAR-----KFNNGSQIRCAKFSKTRSSGSGGG-GFPFGNAK
        M+S+A L RG +  P  PLP        R           N  SS+ L  +  K +DN ++R     K NN   I+C   S +  SGS  G  FP  NA+
Subjt:  MNSSAFLRRGIA--PTLPLPAAGLPLTFR----------RNVSSSIRLFTIENKQHDNDDAR-----KFNNGSQIRCAKFSKTRSSGSGGG-GFPFGNAK

Query:  MALQSLLCFSFDTAAEAETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASG-TIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSD
        +AL+SLLCFS+ +     T  ++ QK+KAL  LL QNP EAE+I+  V  KY+  S   I+Y+A +A+++ILIH+GT +SL  A+   + I   +  PSD
Subjt:  MALQSLLCFSFDTAAEAETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASG-TIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSD

Query:  AKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSGP
        A   LY AVI TL+ +N +  + WK Y   I S P
Subjt:  AKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSGP

A0A1S3BA19 uncharacterized protein LOC1034876768.2e-1938.17Show/hide
Query:  MNSSAFLRRGIA-------PTLPLPAAG---LP-LTFRRNVSSSIRLFTIENKQHDND--------DARKFNNG-SQIRCAKFSKTRSSGSGGGGFPFGN
        M+S+A L RG +       PT     AG   LP LTFR NVSS   +  +  K +DN         + +  NNG   I+C + S+          FP  N
Subjt:  MNSSAFLRRGIA-------PTLPLPAAG---LP-LTFRRNVSSSIRLFTIENKQHDND--------DARKFNNG-SQIRCAKFSKTRSSGSGGGGFPFGN

Query:  AKMALQSLLCFSFDTAAEAETKDISAQKKKALQTLLAQNPAE----AERIMTNVYKKYQR-ASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEET
        A++ALQSLLCFS     + +T  I+  K  ALQ LL + P E    A  IM  VY+KY+   +  I+Y+A +A ++ILIH+GT +S   A   L  +E+ 
Subjt:  AKMALQSLLCFSFDTAAEAETKDISAQKKKALQTLLAQNPAE----AERIMTNVYKKYQR-ASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEET

Query:  QNLPSDAKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSGP
        Q  PSDA   LY+AVI TLL      K  W NYT      P
Subjt:  QNLPSDAKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSGP

A0A6J1CHE7 uncharacterized protein LOC1110109061.0e-3246.64Show/hide
Query:  SAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARKFNNGSQIRCAKFSKTRSSGSG-GGGFPFGNAKMALQSLLCFSFDT---AAEA
        + FLRRGIAPT PLP A       RNVS         N +H N   R     + IRC + ++ R SGS   GGFP  N   AL++LLCF+  T   AA A
Subjt:  SAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARKFNNGSQIRCAKFSKTRSSGSG-GGGFPFGNAKMALQSLLCFSFDT---AAEA

Query:  E-TKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEE---TQNLPSDAKRTLYRAVIYTLL
        E    I+ +KK+ALQ L+A+N  EAE IM  VY++Y+  +   +YDA LALVE LIH GT +S   A+GHLN +E     +NLPSDAK  LY A++ TLL
Subjt:  E-TKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEE---TQNLPSDAKRTLYRAVIYTLL

Query:  EENLDAKNFWKNYTGTIGSGPPR
        ++  +AK  W  YT  IG+GP R
Subjt:  EENLDAKNFWKNYTGTIGSGPPR

A0A6J1GTH1 uncharacterized protein LOC1114573242.4e-5055.45Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPT P     LPL+FRRNVSSS+ L  + N+  +  DAR+  NNG +I C  F+    S S   GFP  N K A+QSLLC   D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISA K+KALQ LLA NP+ AE+IM NV + Y++ +   KY+A LA V ILIH G  +SLG AI HLN IEE  N PSD KR LYRAVIYTLL+ + 
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGPPRR
        +AK  WK ++  IG GP  R
Subjt:  DAKNFWKNYTGTIGSGPPRR

A0A6J1K2L3 uncharacterized protein LOC1114904833.8e-5659.45Show/hide
Query:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA
        MNS+A L RGIAPTLP     LPL+FRRN+SSS+ L  + N+  +  DAR+  NNG +I C ++++  SSGS   GFPF N K+A+QSLLCFS D AAE 
Subjt:  MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARK-FNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEA

Query:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL
        ETKDISAQK+KALQ LLA NPAEAERIM  V +KY+     IKY+ATLA+V ILIH G  +SL  A+ +LN IE  +  PSD KR LYRAVIYTLLE ++
Subjt:  ETKDISAQKKKALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENL

Query:  DAKNFWKNYTGTIGSGP
        DAK+ WK ++  IGSGP
Subjt:  DAKNFWKNYTGTIGSGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34540.2 unknown protein2.1e-0628.5Show/hide
Query:  SIRLFTIENKQHDNDDARKFNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAE-----------------AETKDISAQKKKALQTL
        SI  F I N  +    A   ++   I   K +   S+ SG   FP   AK AL+SL   S   A+                      DI + K +A++ +
Subjt:  SIRLFTIENKQHDNDDARKFNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAE-----------------AETKDISAQKKKALQTL

Query:  LAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSG
              EA +++ +   +Y R      ++  +ALVEILI +   Q   A    LN  +E   + SD +  LY+A+IYT+L+++ +AK  WK +  +IG G
Subjt:  LAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCATCCGCCTTCCTCCGCCGTGGAATCGCCCCAACGCTGCCACTCCCGGCGGCGGGACTTCCATTGACATTCCGTCGCAATGTGTCGTCTTCGATTCGACTCTT
CACAATCGAGAACAAGCAGCACGACAACGACGACGCTCGGAAGTTTAATAATGGCAGCCAAATACGGTGCGCCAAGTTTTCCAAAACACGGTCGTCCGGTTCTGGTGGCG
GAGGCTTTCCTTTTGGAAATGCTAAAATGGCGTTGCAGTCGCTGTTGTGCTTCTCCTTCGATACGGCGGCCGAGGCTGAAACCAAAGATATAAGTGCTCAAAAGAAAAAA
GCATTACAAACATTGTTAGCTCAGAATCCTGCAGAAGCAGAGAGGATAATGACGAACGTGTACAAGAAATACCAGAGAGCCTCTGGGACGATTAAATATGATGCTACTTT
GGCTCTCGTTGAAATTCTCATCCACATAGGAACGACTCAAAGCTTGGGTGCTGCTATAGGCCATCTGAACGTCATTGAAGAGACCCAGAATCTTCCAAGTGATGCAAAGC
GTACCCTTTACAGAGCTGTTATATATACCTTATTAGAAGAAAATCTTGACGCTAAAAATTTTTGGAAAAATTACACCGGCACTATAGGCAGTGGCCCTCCAAGGCGTTGA
mRNA sequenceShow/hide mRNA sequence
CACCATTGTCATGAACTCATCCGCCTTCCTCCGCCGTGGAATCGCCCCAACGCTGCCACTCCCGGCGGCGGGACTTCCATTGACATTCCGTCGCAATGTGTCGTCTTCGA
TTCGACTCTTCACAATCGAGAACAAGCAGCACGACAACGACGACGCTCGGAAGTTTAATAATGGCAGCCAAATACGGTGCGCCAAGTTTTCCAAAACACGGTCGTCCGGT
TCTGGTGGCGGAGGCTTTCCTTTTGGAAATGCTAAAATGGCGTTGCAGTCGCTGTTGTGCTTCTCCTTCGATACGGCGGCCGAGGCTGAAACCAAAGATATAAGTGCTCA
AAAGAAAAAAGCATTACAAACATTGTTAGCTCAGAATCCTGCAGAAGCAGAGAGGATAATGACGAACGTGTACAAGAAATACCAGAGAGCCTCTGGGACGATTAAATATG
ATGCTACTTTGGCTCTCGTTGAAATTCTCATCCACATAGGAACGACTCAAAGCTTGGGTGCTGCTATAGGCCATCTGAACGTCATTGAAGAGACCCAGAATCTTCCAAGT
GATGCAAAGCGTACCCTTTACAGAGCTGTTATATATACCTTATTAGAAGAAAATCTTGACGCTAAAAATTTTTGGAAAAATTACACCGGCACTATAGGCAGTGGCCCTCC
AAGGCGTTGATAACATTATTGTACTATCCTTTTATATGGGGGGGGGAAAATATATTATATTTTTCACTCTCTTTCATTATTATTTATTATTTGTGGTACTTATTTATGTA
CCCTAAAAGTTTTTCATTTGAGTCCCTGTACTTAAGTTAAATTTGGGATCATTTCTCCAAATTTATCTTTGTAATTCAAATACTTGGTGTAAGAAC
Protein sequenceShow/hide protein sequence
MNSSAFLRRGIAPTLPLPAAGLPLTFRRNVSSSIRLFTIENKQHDNDDARKFNNGSQIRCAKFSKTRSSGSGGGGFPFGNAKMALQSLLCFSFDTAAEAETKDISAQKKK
ALQTLLAQNPAEAERIMTNVYKKYQRASGTIKYDATLALVEILIHIGTTQSLGAAIGHLNVIEETQNLPSDAKRTLYRAVIYTLLEENLDAKNFWKNYTGTIGSGPPRR