; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001111 (gene) of Snake gourd v1 genome

Gene IDTan0001111
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPolyprotein
Genome locationLG09:59392096..59393279
RNA-Seq ExpressionTan0001111
SyntenyTan0001111
Gene Ontology termsNA
InterPro domainsIPR028919 - Viral movement protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051217.1 polyprotein [Cucumis melo var. makuwa]6.5e-6772.83Show/hide
Query:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV
        C NGS+AF LI +NP +   Q K+     N+GMIQIGVKTLT KI SNASIILCVFDTRND FEDSILG+VE+ LSDGP++FN+FPNITM  FHPKL E 
Subjt:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV

Query:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK
        L LI +V+GFEQLP GT PISLMWRTCYKLQGS  P AL+ESPQGKTVFFQTDFE+S VAVQKVS+WD+V+ K
Subjt:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK

KAG6588194.1 hypothetical protein SDJN03_16759, partial [Cucurbita argyrosperma subsp. sororia]1.2e-7660.56Show/hide
Query:  MATFLKPCWSSNP--GGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNV
        MATFLKPCWSS    GG++HALD+EE+IKKGNN+LNWKIPK+PT+KIYK  PF+F SDPSIKTKQVE++CGNG++A +LI E+PF+  L+ K+ Y  FNV
Subjt:  MATFLKPCWSSNP--GGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNV

Query:  GMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQ
        GMIQIG KTLT KIP NA+I LCVFDTR +KFEDSILGMVE+ L                                         TKPI L WRTCYKLQ
Subjt:  GMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQ

Query:  GSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLNLFLSEK
         S L +AL+ES  G+TVFFQTDFE+S+VAV KVS WDDVL KVL+LF SEK
Subjt:  GSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLNLFLSEK

KGN66494.1 hypothetical protein Csa_006902 [Cucumis sativus]1.8e-9672.02Show/hide
Query:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM
        M+TF   CWSSN GG  H+LDSEEYIKKG N+L WKIPKIPTTKIYK  PF FFSDP IKTK+  + C NGS+ F LIS NP +   Q K+ Y + N+GM
Subjt:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM

Query:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS
        IQIGVKTLT KIPSNASIILCVFDTRND FEDSILG+VES L DGP++FN+FPNITMP FHPKL E   LI +V+GFEQLP GT PISLMWRTCYKLQ S
Subjt:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS

Query:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLN
         LP AL+ESPQGKTVFFQT+FE+S VA QKVS+WD+V+ KV N
Subjt:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLN

TYK18873.1 polyprotein [Cucumis melo var. makuwa]7.7e-6873.41Show/hide
Query:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV
        C NGS+AF LI +NP +   Q K+     N+GMIQIGVKTLT KIPSNASIILCVFDTRND FEDSILG+VE+ LSDGP++FN+FPNITM  FHPKL E 
Subjt:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV

Query:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK
        L LI +V+GFEQLP GT PISLMWRTCYKLQGS  P AL+ESPQGKTVFFQTDFE+S VAVQKVS+WD+V+ K
Subjt:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK

XP_038880673.1 uncharacterized protein LOC120072292 [Benincasa hispida]2.7e-8162.6Show/hide
Query:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM
        MAT  KPC SSN GG  HALDSEEYIKKGNN+L WKIPK+PTTKIYKR PF+FFSDPSIKT++ ++SC NGS+AF LI++NP M      + Y   NVGM
Subjt:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM

Query:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS
        IQIGVK +T KIPSNASIILCVFD+RN+ FED+ILG+VESNL                                 GFEQLP GT+PISLMWRTCYKLQ S
Subjt:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS

Query:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLNLFL
         LP+ALLESPQGKTV+FQTD ++S VAVQKVSKWD+V+ K+ N+ +
Subjt:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLNLFL

TrEMBL top hitse value%identityAlignment
A0A0A0LXM3 Uncharacterized protein1.1e-5446.96Show/hide
Query:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIY-KRKPFSFF-----SDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRK-KIY
        M+ F K C S N  G+ H+L+ EEY++KG +++ WK+P++P  KIY +R+   FF     +DPSI+T + +IS GN   +F L  + P     +R+   +
Subjt:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIY-KRKPFSFF-----SDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRK-KIY

Query:  CKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRT
           N+G++QIGVKTLT KIP NASIILC+ D R +K EDS+L +VES L DGP YFNVFPNI +  F   ++ VL + V+VKG +++P G+ PI +  RT
Subjt:  CKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRT

Query:  CYKLQGSDL-PDALLESPQGKTVFFQTDF--EDSNVAVQKVSKWDDV
        CYKL  +D   +AL+ESP GKTVFFQ +   +D +  VQKV+ W+ V
Subjt:  CYKLQGSDL-PDALLESPQGKTVFFQTDF--EDSNVAVQKVSKWDDV

A0A0A0LZS0 Uncharacterized protein8.5e-9772.02Show/hide
Query:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM
        M+TF   CWSSN GG  H+LDSEEYIKKG N+L WKIPKIPTTKIYK  PF FFSDP IKTK+  + C NGS+ F LIS NP +   Q K+ Y + N+GM
Subjt:  MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGM

Query:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS
        IQIGVKTLT KIPSNASIILCVFDTRND FEDSILG+VES L DGP++FN+FPNITMP FHPKL E   LI +V+GFEQLP GT PISLMWRTCYKLQ S
Subjt:  IQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGS

Query:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLN
         LP AL+ESPQGKTVFFQT+FE+S VA QKVS+WD+V+ KV N
Subjt:  DLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGKVLN

A0A2G9GSM0 Uncharacterized protein4.5e-2937.05Show/hide
Query:  NVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKF
        N+ NWK+P+  T++IY+ K F+F SD  IK            E F  +S     +  Q K+ Y   ++GMIQIG+K LT ++  N S ++ + D R+++F
Subjt:  NVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKF

Query:  EDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLES-----PQGKTVFFQTDFEDSN
        EDS+LG+VES L DGP+YF  FPN T+    P + + L L +  +GF+ L  GT  ++L++R CYK+  + +P   + S      +G+T  F TD E SN
Subjt:  EDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLES-----PQGKTVFFQTDFEDSN

Query:  VAVQKVSKWDDVLGKVLNLFLSEK
        + V K   W+ V  K+L ++  E+
Subjt:  VAVQKVSKWDDVLGKVLNLFLSEK

A0A5A7U9X3 Polyprotein3.2e-6772.83Show/hide
Query:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV
        C NGS+AF LI +NP +   Q K+     N+GMIQIGVKTLT KI SNASIILCVFDTRND FEDSILG+VE+ LSDGP++FN+FPNITM  FHPKL E 
Subjt:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV

Query:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK
        L LI +V+GFEQLP GT PISLMWRTCYKLQGS  P AL+ESPQGKTVFFQTDFE+S VAVQKVS+WD+V+ K
Subjt:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK

A0A5D3D5V1 Polyprotein3.7e-6873.41Show/hide
Query:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV
        C NGS+AF LI +NP +   Q K+     N+GMIQIGVKTLT KIPSNASIILCVFDTRND FEDSILG+VE+ LSDGP++FN+FPNITM  FHPKL E 
Subjt:  CGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTNKIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEV

Query:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK
        L LI +V+GFEQLP GT PISLMWRTCYKLQGS  P AL+ESPQGKTVFFQTDFE+S VAVQKVS+WD+V+ K
Subjt:  LGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTDFEDSNVAVQKVSKWDDVLGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTTTTCTCAAACCCTGTTGGAGTTCAAACCCTGGAGGAGAAAAGCATGCTTTAGACAGTGAAGAATACATAAAAAAAGGTAACAATGTATTGAATTGGAAAAT
CCCAAAAATTCCCACAACCAAAATCTACAAAAGAAAGCCATTCAGTTTCTTCTCTGATCCATCCATCAAAACCAAACAAGTTGAAATCTCTTGTGGGAATGGCAGCGAAG
CTTTCAACTTAATATCTGAAAACCCCTTCATGGTTGGCCTCCAAAGGAAAAAAATCTACTGCAAGTTCAATGTGGGGATGATCCAAATTGGAGTCAAAACTTTAACCAAC
AAAATCCCCTCAAATGCCTCAATTATTCTCTGTGTTTTCGACACTCGAAATGACAAGTTTGAGGATTCAATTTTGGGGATGGTGGAATCGAATTTAAGTGATGGCCCTTT
GTATTTCAATGTTTTTCCAAACATTACAATGCCTTCGTTTCATCCTAAGTTGTCGGAGGTCTTAGGTTTGATTGTCATTGTTAAGGGTTTTGAGCAGCTTCCCTCAGGGA
CAAAACCAATTAGTTTGATGTGGAGGACTTGTTACAAGCTGCAAGGTAGTGATTTGCCAGATGCCTTGCTTGAAAGCCCACAAGGTAAAACTGTGTTTTTTCAGACAGAT
TTTGAGGATTCTAATGTTGCTGTTCAGAAGGTTTCAAAATGGGATGATGTTCTTGGCAAAGTATTGAACTTATTTCTCTCTGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
TACTTTTCAACCTTCTCTCTCTCAATAATACACACAACATTCAAGTCTATAAAGCCCTTATCTTCTTCCATGGCAGAACCAAGCCCATCAACAAAACCCAAAACTTAAAC
ACCATTTTAACCATTTCCCATATCAAAAACTGTCTGCCAAAAAAAAAAAACAAAAAAATGGCAACTTTTCTCAAACCCTGTTGGAGTTCAAACCCTGGAGGAGAAAAGCA
TGCTTTAGACAGTGAAGAATACATAAAAAAAGGTAACAATGTATTGAATTGGAAAATCCCAAAAATTCCCACAACCAAAATCTACAAAAGAAAGCCATTCAGTTTCTTCT
CTGATCCATCCATCAAAACCAAACAAGTTGAAATCTCTTGTGGGAATGGCAGCGAAGCTTTCAACTTAATATCTGAAAACCCCTTCATGGTTGGCCTCCAAAGGAAAAAA
ATCTACTGCAAGTTCAATGTGGGGATGATCCAAATTGGAGTCAAAACTTTAACCAACAAAATCCCCTCAAATGCCTCAATTATTCTCTGTGTTTTCGACACTCGAAATGA
CAAGTTTGAGGATTCAATTTTGGGGATGGTGGAATCGAATTTAAGTGATGGCCCTTTGTATTTCAATGTTTTTCCAAACATTACAATGCCTTCGTTTCATCCTAAGTTGT
CGGAGGTCTTAGGTTTGATTGTCATTGTTAAGGGTTTTGAGCAGCTTCCCTCAGGGACAAAACCAATTAGTTTGATGTGGAGGACTTGTTACAAGCTGCAAGGTAGTGAT
TTGCCAGATGCCTTGCTTGAAAGCCCACAAGGTAAAACTGTGTTTTTTCAGACAGATTTTGAGGATTCTAATGTTGCTGTTCAGAAGGTTTCAAAATGGGATGATGTTCT
TGGCAAAGTATTGAACTTATTTCTCTCTGAGAAATAAGGGGAAGGAAGGAGATGGGTTTTTTTTTTTTTCTGATTTATTAATGAAGCAATAACTCAGTTCTTTCTATGTT
TTGATCAAATTTAGAAGAGGGTGTGTGATCATTTATGGATTTGATCAACTTAGTGAATTACATTTAAATTTGTGTTTAGCATATATACTTTTTTTTTTTCCTTGAGAGGA
GTGTGTTTGGCATAAAGTTATTGTGCTTTGTACTTTGTAATATGGCTTCTATTTATTGCATAAATCTAATAGCTATTTTTGTGT
Protein sequenceShow/hide protein sequence
MATFLKPCWSSNPGGEKHALDSEEYIKKGNNVLNWKIPKIPTTKIYKRKPFSFFSDPSIKTKQVEISCGNGSEAFNLISENPFMVGLQRKKIYCKFNVGMIQIGVKTLTN
KIPSNASIILCVFDTRNDKFEDSILGMVESNLSDGPLYFNVFPNITMPSFHPKLSEVLGLIVIVKGFEQLPSGTKPISLMWRTCYKLQGSDLPDALLESPQGKTVFFQTD
FEDSNVAVQKVSKWDDVLGKVLNLFLSEK