; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019686 (gene) of Snake gourd v1 genome

Gene IDTan0019686
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon gag protein
Genome locationLG01:29292661..29300182
RNA-Seq ExpressionTan0019686
SyntenyTan0019686
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059024.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.2e-2957.89Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  ++ QRT +FDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK L+ + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        ++KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

KAA0063719.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.2e-2959.09Show/hide
Query:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-H
        KE ++  E+KQRT VFDRIKP   R SVFQR+SM   EEE+Q PT    R S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MKFL+ + F + N  
Subjt:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-H

Query:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.5e-2959.09Show/hide
Query:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELF-DKVNH
        +E ++  E+KQRTFVFDRIKP   R SVFQR+SM   EEE Q PT   TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+K+MK L+ + F +K   
Subjt:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELF-DKVNH

Query:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        +KIHS +PSR KRK S+ INT+GSL   P F+
Subjt:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

TYK09793.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-2858.65Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  E+ QRT VFDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK  + + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
         +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

TYK09793.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.1e-0429.13Show/hide
Query:  THSCSNELRSQEDEVVTLSGVSTN----------NKFLIKFNPLFDSNY-------------DIVSVMITETNTMEERMTRMQEQINDLMKAIKEKDYQI
        THS S E+  Q  EV     V+ N             +IK NP  D +Y             +I+SVM+T  +T E RMT +++++N LMK ++E+DY+I
Subjt:  THSCSNELRSQEDEVVTLSGVSTN----------NKFLIKFNPLFDSNY-------------DIVSVMITETNTMEERMTRMQEQINDLMKAIKEKDYQI

Query:  AYLKSQIENQDVVESKEAEDEKEVKQRTFVFDRIK----PSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQ
        A+LK+ IE+ D  ES      K   +   V    +     S    SV Q   M+A   + Q   P  T            S  ++P     D L + N  
Subjt:  AYLKSQIENQDVVESKEAEDEKEVKQRTFVFDRIK----PSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQ

Query:  QEKKMK
        Q  K +
Subjt:  QEKKMK

TYK14888.1 gag protease polyprotein [Cucumis melo var. makuwa]1.2e-2858.65Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  E+ QRT VFDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK  + + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
         +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

TrEMBL top hitse value%identityAlignment
A0A5A7UV73 Retrotransposon gag protein2.1e-2957.89Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  ++ QRT +FDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK L+ + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        ++KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

A0A5A7VDY3 Ty3-gypsy retrotransposon protein1.6e-2959.09Show/hide
Query:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-H
        KE ++  E+KQRT VFDRIKP   R SVFQR+SM   EEE+Q PT    R S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MKFL+ + F + N  
Subjt:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-H

Query:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

A0A5D3CA53 Retrotransposon gag protein4.6e-2959.09Show/hide
Query:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELF-DKVNH
        +E ++  E+KQRTFVFDRIKP   R SVFQR+SM   EEE Q PT   TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+K+MK L+ + F +K   
Subjt:  KEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELF-DKVNH

Query:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
        +KIHS +PSR KRK S+ INT+GSL   P F+
Subjt:  NKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

A0A5D3CD55 Ty3-gypsy retrotransposon protein6.0e-2958.65Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  E+ QRT VFDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK  + + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
         +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

A0A5D3CD55 Ty3-gypsy retrotransposon protein3.9e-0429.13Show/hide
Query:  THSCSNELRSQEDEVVTLSGVSTN----------NKFLIKFNPLFDSNY-------------DIVSVMITETNTMEERMTRMQEQINDLMKAIKEKDYQI
        THS S E+  Q  EV     V+ N             +IK NP  D +Y             +I+SVM+T  +T E RMT +++++N LMK ++E+DY+I
Subjt:  THSCSNELRSQEDEVVTLSGVSTN----------NKFLIKFNPLFDSNY-------------DIVSVMITETNTMEERMTRMQEQINDLMKAIKEKDYQI

Query:  AYLKSQIENQDVVESKEAEDEKEVKQRTFVFDRIK----PSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQ
        A+LK+ IE+ D  ES      K   +   V    +     S    SV Q   M+A   + Q   P  T            S  ++P     D L + N  
Subjt:  AYLKSQIENQDVVESKEAEDEKEVKQRTFVFDRIK----PSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQ

Query:  QEKKMK
        Q  K +
Subjt:  QEKKMK

A0A5D3CUI5 Gag protease polyprotein6.0e-2958.65Show/hide
Query:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-
        SKE  +  E+ QRT VFDRIKPS  R SVFQR+SM   EEE+Q PT + TR S F+RL+ STSKK+RPSTS FDRL++TNDQQ+++MK  + + F + N 
Subjt:  SKEAEDEKEVKQRTFVFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVN-

Query:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL
         +KIHS +PSRMKRK S+ INT+GSL   P F+
Subjt:  HNKIHSTIPSRMKRKYSILINTKGSLKGLPCFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTATTACCCACAGCTGCTCCAATGAGTTGAGGTCGCAAGAAGATGAAGTAGTGACACTGTCGGGGGTATCAACTAACAACAAGTTCTTGATCAAGTTTAACCC
TTTATTTGATTCTAATTATGATATAGTGTCTGTCATGATAACTGAAACAAACACTATGGAAGAAAGAATGACGAGGATGCAGGAACAAATCAACGACTTAATGAAGGCGA
TTAAAGAAAAAGATTATCAAATCGCATACTTAAAGAGTCAGATTGAGAATCAAGATGTCGTTGAATCAAAAGAAGCTGAAGACGAAAAAGAAGTCAAGCAAAGGACCTTC
GTCTTCGATCGCATAAAGCCTTCAGCTGCTAGACCTTCAGTCTTCCAAAGGATGAGTATGGTCGCGACAGAAGAAGAAGACCAAGACCCAACTCCTGTCTTAACTCGACC
TTCAGCCTTCCAAAGGTTAAATGACTCCACATCAAAGAAAAATCGACCTTCAACATCTGTTTTTGATCGTCTCGAAGTAACAAACGATCAACAAGAAAAAAAGATGAAAT
TCTTGGAGGAAGAATTGTTCGATAAAGTGAATCATAACAAGATTCATAGTACCATTCCTTCACGTATGAAAAGGAAGTACTCCATTCTCATAAATACAAAGGGCTCCTTG
AAGGGGCTTCCTTGCTTCCTCTCCAAGTTTGAGGGACTTCGCCGCTTCCTCTCTAAGTTTGAGGGACTTCGTCGCTTCCTCTCTAAATTCGAGGGATTTCATCACTTCGT
TGCTTCCTCCCCAAGTTTGAGAGACTTTGTCGCTTCCTCTCCAAGTTCGAGGGACTTCGTCGCTTCCTCCCACAGTTCGAGGGACTTCGTCGCTTCCTCTAAAAATTTGA
GGGGCTTCGTTGCTTCCTCTCCAAATTCGAGGGGCTTCGTTGCTTCCTCTCCAAGTTCGAGAGAGACTTCGTTGGTTCCTCCCCAAGTTCGAGGGACTTTGTCGCCTCTT
CCCCCAGGTTCAAAGGGTTTAATCTTCAAAGTCCTTCTTCCGACGGCTTCATCTTCAAGGTCCTTCTTTCGGCGCCTTCATCTTCAAGGTCTGTCTATGGCGACACTTCA
TCCTCAAAGCCTCTGGCGGCGCTTAGTCTTCAAGTCTCCATGCTCAAGCGTCACCTCGTATGTGCTTCAGAGTGGACACTTGAGGATCTACTCAAGCACCGCCTCGCAGA
TAGTTCAACTTTTCTCCCAAATCAAAATTGGATTGAAATTGAAATTGAAATTCAATTTCTATTCCAATTCCAATTGCACTTCAGAAATTGGAGACTCTAAAGAGAATTCA
CAAACCATAAGACTATTTATTCAACCTCGAGAACAAGCTCTTCAGCGCTCGAGAAAAGATTACAAGTCGAAGATTTTCAACATGCTTACAAGCCGAAGACCTTCAACAAG
CTACAAGTCGAAGATCTTCAACAAGCTTACAAGCCGAACACTTCAACAAGCTTCAAGCCGAAGATCTTCAACAAGCTACAAGTCGATGATCTTCAACAAGCTACAAGCCG
ACAATCTTCAACAATCTTACAAGCCGAAGACCTTCAACAAGCTACAAGCCGACGATCTTCAACAAGCTTACAAGCCGAAGTCCTTCAATAAGCTACAAGCCGACGATCTT
CAACAAGCTTACAAGTCGAAGACCTTTAACAAACTGCAAATCCCTCGAAATAGTTGTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTATTACCCACAGCTGCTCCAATGAGTTGAGGTCGCAAGAAGATGAAGTAGTGACACTGTCGGGGGTATCAACTAACAACAAGTTCTTGATCAAGTTTAACCC
TTTATTTGATTCTAATTATGATATAGTGTCTGTCATGATAACTGAAACAAACACTATGGAAGAAAGAATGACGAGGATGCAGGAACAAATCAACGACTTAATGAAGGCGA
TTAAAGAAAAAGATTATCAAATCGCATACTTAAAGAGTCAGATTGAGAATCAAGATGTCGTTGAATCAAAAGAAGCTGAAGACGAAAAAGAAGTCAAGCAAAGGACCTTC
GTCTTCGATCGCATAAAGCCTTCAGCTGCTAGACCTTCAGTCTTCCAAAGGATGAGTATGGTCGCGACAGAAGAAGAAGACCAAGACCCAACTCCTGTCTTAACTCGACC
TTCAGCCTTCCAAAGGTTAAATGACTCCACATCAAAGAAAAATCGACCTTCAACATCTGTTTTTGATCGTCTCGAAGTAACAAACGATCAACAAGAAAAAAAGATGAAAT
TCTTGGAGGAAGAATTGTTCGATAAAGTGAATCATAACAAGATTCATAGTACCATTCCTTCACGTATGAAAAGGAAGTACTCCATTCTCATAAATACAAAGGGCTCCTTG
AAGGGGCTTCCTTGCTTCCTCTCCAAGTTTGAGGGACTTCGCCGCTTCCTCTCTAAGTTTGAGGGACTTCGTCGCTTCCTCTCTAAATTCGAGGGATTTCATCACTTCGT
TGCTTCCTCCCCAAGTTTGAGAGACTTTGTCGCTTCCTCTCCAAGTTCGAGGGACTTCGTCGCTTCCTCCCACAGTTCGAGGGACTTCGTCGCTTCCTCTAAAAATTTGA
GGGGCTTCGTTGCTTCCTCTCCAAATTCGAGGGGCTTCGTTGCTTCCTCTCCAAGTTCGAGAGAGACTTCGTTGGTTCCTCCCCAAGTTCGAGGGACTTTGTCGCCTCTT
CCCCCAGGTTCAAAGGGTTTAATCTTCAAAGTCCTTCTTCCGACGGCTTCATCTTCAAGGTCCTTCTTTCGGCGCCTTCATCTTCAAGGTCTGTCTATGGCGACACTTCA
TCCTCAAAGCCTCTGGCGGCGCTTAGTCTTCAAGTCTCCATGCTCAAGCGTCACCTCGTATGTGCTTCAGAGTGGACACTTGAGGATCTACTCAAGCACCGCCTCGCAGA
TAGTTCAACTTTTCTCCCAAATCAAAATTGGATTGAAATTGAAATTGAAATTCAATTTCTATTCCAATTCCAATTGCACTTCAGAAATTGGAGACTCTAAAGAGAATTCA
CAAACCATAAGACTATTTATTCAACCTCGAGAACAAGCTCTTCAGCGCTCGAGAAAAGATTACAAGTCGAAGATTTTCAACATGCTTACAAGCCGAAGACCTTCAACAAG
CTACAAGTCGAAGATCTTCAACAAGCTTACAAGCCGAACACTTCAACAAGCTTCAAGCCGAAGATCTTCAACAAGCTACAAGTCGATGATCTTCAACAAGCTACAAGCCG
ACAATCTTCAACAATCTTACAAGCCGAAGACCTTCAACAAGCTACAAGCCGACGATCTTCAACAAGCTTACAAGCCGAAGTCCTTCAATAAGCTACAAGCCGACGATCTT
CAACAAGCTTACAAGTCGAAGACCTTTAACAAACTGCAAATCCCTCGAAATAGTTGTATATGA
Protein sequenceShow/hide protein sequence
MSSITHSCSNELRSQEDEVVTLSGVSTNNKFLIKFNPLFDSNYDIVSVMITETNTMEERMTRMQEQINDLMKAIKEKDYQIAYLKSQIENQDVVESKEAEDEKEVKQRTF
VFDRIKPSAARPSVFQRMSMVATEEEDQDPTPVLTRPSAFQRLNDSTSKKNRPSTSVFDRLEVTNDQQEKKMKFLEEELFDKVNHNKIHSTIPSRMKRKYSILINTKGSL
KGLPCFLSKFEGLRRFLSKFEGLRRFLSKFEGFHHFVASSPSLRDFVASSPSSRDFVASSHSSRDFVASSKNLRGFVASSPNSRGFVASSPSSRETSLVPPQVRGTLSPL
PPGSKGLIFKVLLPTASSSRSFFRRLHLQGLSMATLHPQSLWRRLVFKSPCSSVTSYVLQSGHLRIYSSTASQIVQLFSQIKIGLKLKLKFNFYSNSNCTSEIGDSKENS
QTIRLFIQPREQALQRSRKDYKSKIFNMLTSRRPSTSYKSKIFNKLTSRTLQQASSRRSSTSYKSMIFNKLQADNLQQSYKPKTFNKLQADDLQQAYKPKSFNKLQADDL
QQAYKSKTFNKLQIPRNSCI