; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006278 (gene) of Snake gourd v1 genome

Gene IDTan0006278
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA polymerase II elongation factor
Genome locationLG01:19430749..19431643
RNA-Seq ExpressionTan0006278
SyntenyTan0006278
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015562.1 hypothetical protein SDJN02_23198, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-9885.71Show/hide
Query:  ETKDSG--AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP
        ETKDSG  AH+VEIP E    NQN+MISVI+QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTL+FTSSDP
Subjt:  ETKDSG--AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP

Query:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL
         VC  HKWWVP V+MGATSGVFV+VVQLK+WLYWKA  QLQREK+ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTW SRNF+TI L
Subjt:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL

Query:  VCFSGLVFAASKFILCG
        +CFSG+VF ASKFILCG
Subjt:  VCFSGLVFAASKFILCG

XP_004149573.1 uncharacterized protein LOC101219596 [Cucumis sativus]2.8e-9884.93Show/hide
Query:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD
        MAETKDS  AHIVEIPVEQEN  QNLMISVIQ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTL++TSSD
Subjt:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD

Query:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS
        P VC   KWWVPAV +GATSGVFV+VVQLK+W+YWKA GQLQ+EK ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTWFSRNFITIS
Subjt:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS

Query:  LVCFSGLVFAASKFILCGF
        L+ FS ++F  SKFILCGF
Subjt:  LVCFSGLVFAASKFILCGF

XP_008449068.1 PREDICTED: uncharacterized protein LOC103491049 [Cucumis melo]2.1e-9885.39Show/hide
Query:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD
        MA+TKDS  AHIVEIPVEQEN  QNLMISVIQ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTL++TSSD
Subjt:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD

Query:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS
        P VC   KWWVPAV +GATSGVFV+VVQLK+W+YWKA GQLQREK+ENRALTRC QELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTWFSRNFI IS
Subjt:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS

Query:  LVCFSGLVFAASKFILCGF
        L+ FS +VFAASKFILCGF
Subjt:  LVCFSGLVFAASKFILCGF

XP_023553388.1 uncharacterized protein LOC111810817 [Cucurbita pepo subsp. pepo]1.8e-9784.72Show/hide
Query:  ETKDSG-AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDPK
        ET+DSG AH+VEIP E    NQN+MISVI+QHPLRQISESSGHLLLLKLWQREEHLFGLR+GRRETK+ESLKQQIFQLCCYFFLFHALSLTL+FTSSDP 
Subjt:  ETKDSG-AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDPK

Query:  VCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISLV
        VC  HKWWVP V+MGATSGVFV+VVQLK+WLYWKA  QLQREK+ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTW SRNF+TI L+
Subjt:  VCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISLV

Query:  CFSGLVFAASKFILCG
        CFSG+VF ASKFILCG
Subjt:  CFSGLVFAASKFILCG

XP_038905570.1 uncharacterized protein LOC120091553 [Benincasa hispida]1.0e-10087.27Show/hide
Query:  MAETKDSGAHIVEIPVEQEN--QNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSS
        MAETKDSGAHIVEIPVEQEN  QNQN MISVIQ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTL++TSS
Subjt:  MAETKDSGAHIVEIPVEQEN--QNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSS

Query:  DPKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITI
        DP VC   KWWVPAV+MGATSGVFV+VVQLK+WLYWKA GQLQREK ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTWFSRNFITI
Subjt:  DPKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITI

Query:  SLVCFSGLVFAASKFILCGF
         L+ FS +VFAASKFILC F
Subjt:  SLVCFSGLVFAASKFILCGF

TrEMBL top hitse value%identityAlignment
A0A0A0L2U7 Uncharacterized protein1.4e-9884.93Show/hide
Query:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD
        MAETKDS  AHIVEIPVEQEN  QNLMISVIQ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTL++TSSD
Subjt:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD

Query:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS
        P VC   KWWVPAV +GATSGVFV+VVQLK+W+YWKA GQLQ+EK ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTWFSRNFITIS
Subjt:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS

Query:  LVCFSGLVFAASKFILCGF
        L+ FS ++F  SKFILCGF
Subjt:  LVCFSGLVFAASKFILCGF

A0A1S3BL81 uncharacterized protein LOC1034910491.0e-9885.39Show/hide
Query:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD
        MA+TKDS  AHIVEIPVEQEN  QNLMISVIQ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTL++TSSD
Subjt:  MAETKDS-GAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSD

Query:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS
        P VC   KWWVPAV +GATSGVFV+VVQLK+W+YWKA GQLQREK+ENRALTRC QELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTWFSRNFI IS
Subjt:  PKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITIS

Query:  LVCFSGLVFAASKFILCGF
        L+ FS +VFAASKFILCGF
Subjt:  LVCFSGLVFAASKFILCGF

A0A2P6QLA8 Uncharacterized protein2.4e-7162.9Show/hide
Query:  MAETKDSGAHIVEIPVEQENQ----NQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFT
        MAE KD   H+VEIPV++E+Q    +   +IS IQ HPL +ISES GHLLLLKLW+REE LFG RI RRET+++ ++++IFQLCC+F +FHA  LT++FT
Subjt:  MAETKDSGAHIVEIPVEQENQ----NQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFT

Query:  SS-DPKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNF
        SS + +     KWW+P++   +TS VFV++VQ+ +W YWK  GQLQREK ENRAL RC+QELRMKG SF+LSKEPQSGKR+KSSSVEIKW P+TW S+N 
Subjt:  SS-DPKVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNF

Query:  ITISLVCFSGLVFAASKFILC
        ITI LVCF+GLVF ASK +LC
Subjt:  ITISLVCFSGLVFAASKFILC

A0A6J1EWB5 uncharacterized protein LOC1114387097.5e-9785.25Show/hide
Query:  ETKDSG--AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP
        ETKDSG  AH+VEIP E    NQN+MISVI+QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTL+FTSSDP
Subjt:  ETKDSG--AHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP

Query:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL
         VC  HKWWVP V+MGATSGVFV+VVQLK+WLYWKA  QLQREK+ENRALTRCVQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLT  SRNF+TI L
Subjt:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL

Query:  VCFSGLVFAASKFILCG
        +CFSG+VF ASKFILCG
Subjt:  VCFSGLVFAASKFILCG

A0A6J1HN07 uncharacterized protein LOC1114651142.0e-9785.71Show/hide
Query:  ETKDSGA--HIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP
        ETKDSGA  HIVEIP E    NQN+MISVI+QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTL+FTSSDP
Subjt:  ETKDSGA--HIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDP

Query:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL
         VC  HKWWVP V+MGATSGVFV+VVQLK+WLYWKA  QLQREK+ENRALTR VQELRMKG  FNLSKEPQ G RMKSSSVEIKWGPLTW SRNF+TI L
Subjt:  KVCHNHKWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISL

Query:  VCFSGLVFAASKFILCG
        +CFSG+VF ASKFILCG
Subjt:  VCFSGLVFAASKFILCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12870.1 unknown protein1.2e-2536.65Show/hide
Query:  QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMF-----TSSDPKVCHNHKWWVPAVSMGATSGVFVMVV
        +HPL QI+++  H LLLK W +EE L   R+  +E++++S++++I QL  +FFLFH++SL L+F     +SS        + W+P++    +S   +  V
Subjt:  QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMF-----TSSDPKVCHNHKWWVPAVSMGATSGVFVMVV

Query:  QLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPL-TWFSRNFITISLVCFSGLVFAASKFILC
        + K  +    E  L+REK + + L +CV+EL+ KG  F+L KE  + +R KS  VE K  P+  W +R+F+T+     S LV A  + ILC
Subjt:  QLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPL-TWFSRNFITISLVCFSGLVFAASKFILC

AT5G56120.1 unknown protein2.6e-6556.72Show/hide
Query:  MAETKDSGAHIVEIPVEQEN----QNQNL---------MISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFH
        MA TKDS  H+VEIPV++E+    Q Q L         ++ VIQQHPL +ISES GHLLLLKLWQREE LF  R+  +E++LES+K++IFQLCC+F +FH
Subjt:  MAETKDSGAHIVEIPVEQEN----QNQNL---------MISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFH

Query:  ALSLTLMFTSS---DPKVCHNH----KWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSS
            TL+++SS   D  V  ++    KWW+P+    ATS V V +VQ K++++WK    + RE+N+NR LTRCV ELRMKG SF+LSKEP SGKRMKSSS
Subjt:  ALSLTLMFTSS---DPKVCHNH----KWWVPAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSS

Query:  VEIKWGPLTWFSRNFITISLVCFSGLVFAASKFILCGF
        VEIKW P+TWFS+  ITI L+C +GL F  SKFILCGF
Subjt:  VEIKWGPLTWFSRNFITISLVCFSGLVFAASKFILCGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAACCAAAGATTCCGGTGCCCACATAGTCGAAATCCCAGTAGAACAAGAGAATCAGAATCAAAACCTTATGATCTCTGTAATCCAACAACACCCATTGAGGCA
AATTTCTGAAAGCTCCGGGCATCTATTGCTCTTAAAACTCTGGCAACGAGAGGAGCATCTGTTCGGCCTCCGAATCGGGCGGCGAGAGACCAAACTGGAGTCTCTGAAGC
AACAAATCTTCCAACTCTGCTGCTACTTCTTCCTCTTCCACGCCCTCTCTCTGACCCTGATGTTCACTTCGTCGGATCCCAAGGTGTGCCACAACCACAAATGGTGGGTT
CCGGCGGTGTCGATGGGGGCGACGTCGGGGGTGTTTGTGATGGTGGTGCAGCTGAAAGTGTGGCTGTATTGGAAGGCAGAAGGGCAGTTGCAGAGGGAGAAGAATGAAAA
CAGAGCACTTACAAGATGTGTTCAAGAGCTGAGGATGAAAGGGTGTAGTTTTAATTTGTCTAAAGAGCCTCAGAGTGGGAAGAGGATGAAGAGCTCTAGTGTTGAGATTA
AATGGGGGCCTCTCACTTGGTTCTCTAGGAATTTCATTACCATTTCTCTTGTTTGCTTTTCAGGGCTTGTTTTTGCTGCTTCCAAGTTCATTCTTTGTGGGTTTTAG
mRNA sequenceShow/hide mRNA sequence
TTTAAATCTCGCACCAAACAAAAAACCCCCTTCAAAACCTCAACAAAATTCAATCAATCCCATTTTCCAACAAAATACCAATCTCCATTTCTTCAATCACTCATGGCTGA
AACCAAAGATTCCGGTGCCCACATAGTCGAAATCCCAGTAGAACAAGAGAATCAGAATCAAAACCTTATGATCTCTGTAATCCAACAACACCCATTGAGGCAAATTTCTG
AAAGCTCCGGGCATCTATTGCTCTTAAAACTCTGGCAACGAGAGGAGCATCTGTTCGGCCTCCGAATCGGGCGGCGAGAGACCAAACTGGAGTCTCTGAAGCAACAAATC
TTCCAACTCTGCTGCTACTTCTTCCTCTTCCACGCCCTCTCTCTGACCCTGATGTTCACTTCGTCGGATCCCAAGGTGTGCCACAACCACAAATGGTGGGTTCCGGCGGT
GTCGATGGGGGCGACGTCGGGGGTGTTTGTGATGGTGGTGCAGCTGAAAGTGTGGCTGTATTGGAAGGCAGAAGGGCAGTTGCAGAGGGAGAAGAATGAAAACAGAGCAC
TTACAAGATGTGTTCAAGAGCTGAGGATGAAAGGGTGTAGTTTTAATTTGTCTAAAGAGCCTCAGAGTGGGAAGAGGATGAAGAGCTCTAGTGTTGAGATTAAATGGGGG
CCTCTCACTTGGTTCTCTAGGAATTTCATTACCATTTCTCTTGTTTGCTTTTCAGGGCTTGTTTTTGCTGCTTCCAAGTTCATTCTTTGTGGGTTTTAGAGGATGATGAA
ACAGGGGGAGCCTCTGAATTCTGAAAGCAAGTCAAAGTAGTACACAACAAGACTAGTGTTGTTGAGTGTGTTTAATTAAGAAGCAACACTGCTTGTGTGCACACTCTGTT
TTCTTTTTCTTACCC
Protein sequenceShow/hide protein sequence
MAETKDSGAHIVEIPVEQENQNQNLMISVIQQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLMFTSSDPKVCHNHKWWV
PAVSMGATSGVFVMVVQLKVWLYWKAEGQLQREKNENRALTRCVQELRMKGCSFNLSKEPQSGKRMKSSSVEIKWGPLTWFSRNFITISLVCFSGLVFAASKFILCGF