; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020445 (gene) of Snake gourd v1 genome

Gene IDTan0020445
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG01:65297740..65299306
RNA-Seq ExpressionTan0020445
SyntenyTan0020445
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-9067.48Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LV+LQ F+PL+D TSL  QI+KDADV+ T  M  +IASH+SPRFVATLQ+S R FTNY+VD  ++S+VSL+SFHDA+L+GGSFSSM+IH+L+   +M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI
        LR+E    N PPL+ ELPLSPP+ E+LG VE+GKFFTVNSK LR+IIKELPLF DD V V  T++++KFSIASKEI +TKE G+C+IVG+E E ET+L I
Subjt:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI

Query:  VLRPMLFFLNLTYKANRVWFYKTTNSRSLISVPAFGVYGQYVIHFP
          RPM+FFLN TYKANRVWFYKTTNS+S+ISVPAFG+YGQYV++FP
Subjt:  VLRPMLFFLNLTYKANRVWFYKTTNSRSLISVPAFGVYGQYVIHFP

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]1.1e-8271.89Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LVRL+ F+PL+D TSL AQ+AKDADVK TP M  II S+ SP+FVATLQ+SRR FTN++VD N +S+VSLQ FHDA+L+GGSFSSMTIHLLD  N+M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI
        LRFE    +VPPL+HEL LSPP+ E LG VE+G FFTV S+ELRRIIKELPLF  D+V VTVT SQ+KFSI SKEIILTKEGGHCKIVG+E EVET+LQ+
Subjt:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI

Query:  VLRPMLFFLNLTYKANR
        VLRPM+FFLN TY+AN+
Subjt:  VLRPMLFFLNLTYKANR

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]1.7e-8364.82Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM
        M+LV+L+NFDPL+D TS  AQI+ D AD+K TPS FFIIASH SPRF+ATLQ+S ++FT ++VD +H+S+VSL+SFHDA+L+GGSF+SMTIHLLDK N+M
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM

Query:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV
         LRF+     + PL+HEL LSPP+ E        ++  K+F V SK LRRIIK+LP+FQ+DS + V VTNS++KFSIASKEIILT EG HCKI GFEEEV
Subjt:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV

Query:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP
        ETQ QI+L PM+FFLN TYKANRVWFYKT  N+ +L+ VPA+G++GQYVI+FP
Subjt:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP

XP_031743957.1 uncharacterized protein LOC116404737 [Cucumis sativus]1.1e-7158.1Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIA-KDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM
        M+LV+L  F+PL+D T   A  +  +A VK TP   F+I S+  P F+ATLQ+S R+FTN++VD  H+S+VSL+SFHDA+L+GG FS+M+IHLLDK N+M
Subjt:  MYLVRLQNFDPLVDVTSLFAQIA-KDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM

Query:  TLRFEASPGNVPPLYHELPLSPPEGE--ALGHVEHG--KFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV
         LRF+     + PL+ EL LSPP+ E   +G  E    KFF V SK LRR+IKELP+FQ+DS +CV VT SQ+KFSIAS +I+ + EG HC+I GFEEEV
Subjt:  TLRFEASPGNVPPLYHELPLSPPEGE--ALGHVEHG--KFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV

Query:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP
        ETQ QI L PMLFFLN TY+A+RVWFYKT  N+ +++ VPAFG+YGQY I+FP
Subjt:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]3.1e-8565.74Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LV+L NF+PL+D TS  AQI+  ADVK TP  F++IA + SPRFVATLQ+S++ FTNY+VD  HTS+V L+SFHDA+L+GGSF+SMTIHLL+K N+M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPP---EGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVET
        LRF+     +PPL+HEL  SPP   +    G +E GKFF V S+ LRRIIKELP+FQDDS VCV VT+SQ+KFSIASKEI+L  +  HC+IVGFEEEVET
Subjt:  LRFEASPGNVPPLYHELPLSPP---EGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVET

Query:  QLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP
        Q QI+LRPMLFFLN TYKAN+VWFYKT  NS S+++VPAFG+ GQYVI+FP
Subjt:  QLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980105.3e-8371.89Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LVRL+ F+PL+D TSL AQ+AKDADVK TP M  II S+ SP+FVATLQ+SRR FTN++VD N +S+VSLQ FHDA+L+GGSFSSMTIHLLD  N+M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI
        LRFE    +VPPL+HEL LSPP+ E LG VE+G FFTV S+ELRRIIKELPLF  D+V VTVT SQ+KFSI SKEIILTKEGGHCKIVG+E EVET+LQ+
Subjt:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI

Query:  VLRPMLFFLNLTYKANR
        VLRPM+FFLN TY+AN+
Subjt:  VLRPMLFFLNLTYKANR

A0A1S3CL88 uncharacterized protein LOC1035022508.2e-8464.82Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM
        M+LV+L+NFDPL+D TS  AQI+ D AD+K TPS FFIIASH SPRF+ATLQ+S ++FT ++VD +H+S+VSL+SFHDA+L+GGSF+SMTIHLLDK N+M
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM

Query:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV
         LRF+     + PL+HEL LSPP+ E        ++  K+F V SK LRRIIK+LP+FQ+DS + V VTNS++KFSIASKEIILT EG HCKI GFEEEV
Subjt:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV

Query:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP
        ETQ QI+L PM+FFLN TYKANRVWFYKT  N+ +L+ VPA+G++GQYVI+FP
Subjt:  ETQLQIVLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein1.3e-6563.38Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM
        M+LV+L+NFDPL+D TS  AQI+ D AD+K TPS FFIIASH SPRF+ATLQ+S ++FT ++VD +H+S+VSL+SFHDA+L+GGSF+SMTIHLLDK N+M
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKD-ADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKM

Query:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV
         LRF+     + PL+HEL LSPP+ E        ++  K+F V SK LRRIIK+LP+FQ+DS + V VTNS++KFSIASKEIILT EG HCKI GFEEEV
Subjt:  TLRFEASPGNVPPLYHELPLSPPEGE----ALGHVEHGKFFTVNSKELRRIIKELPLFQDDS-VCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEV

Query:  ETQLQIVLRPMLF
        ETQ QI+L PM++
Subjt:  ETQLQIVLRPMLF

A0A6J1KIW5 uncharacterized protein LOC1114948741.0e-6552.23Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LVRL+NF PLVD TS  AQIA+++D+  TP    +  S  SPRF+ATLQI  + FT Y+V+++H S++SL+S HDALL+ GS S+MTIHLL+  N M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI
        LRFE +P + P L H+  L PP+ +++  +E+ K   ++S++LR++IKELPLF  DSVCVTVT+S+++FSIAS+E+I  KE G C+I+G + +  T+ +I
Subjt:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI

Query:  VLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP
        VL PMLFFLNLTY    VWF+KT TN+  ++  P F ++ QYVI+FP
Subjt:  VLRPMLFFLNLTYKANRVWFYKT-TNSRSLISVPAFGVYGQYVIHFP

A0A6J1KZ05 uncharacterized protein LOC1114988871.1e-6453.06Show/hide
Query:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT
        M+LVRL +FDPL + TSL AQI+ +AD+K + S F +I S+ S RFVAT QIS RFF NY+VD NH+S+VSLQSF+DA+ +G  FSSMTIH  +  ++M 
Subjt:  MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMT

Query:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI
        L+FE+S      ++  L LSP + E LG ++H +FF++ S++ R II  LP F ++S+ V++T+S++KF  AS+E ILTKEGG C I+G+E E E   QI
Subjt:  LRFEASPGNVPPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQI

Query:  VLRPMLFFLNLTYKANRVWFYKTTNSRSLISVPAFGVYGQYVIHF
         L P  FF NL+Y A R+WFYKT +SR +I VPAFG+  QYVI+F
Subjt:  VLRPMLFFLNLTYKANRVWFYKTTNSRSLISVPAFGVYGQYVIHF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTAGTGAGGCTTCAGAACTTTGATCCTCTTGTGGACGTAACATCCTTATTCGCTCAAATTGCTAAGGATGCGGACGTGAAATTGACACCGTCAATGTTCTTCAT
AATCGCATCGCACTCTTCCCCTCGCTTCGTTGCAACCCTCCAAATTTCACGTCGTTTCTTCACAAACTATACCGTCGATCTGAATCACACTTCACAGGTTTCCCTTCAAT
CCTTCCACGATGCTTTGTTGGAGGGCGGAAGTTTCTCTTCAATGACAATCCATCTTCTTGACAAAATTAACAAAATGACTCTTCGATTTGAGGCTTCACCTGGGAATGTG
CCACCGTTGTATCATGAATTGCCATTGTCTCCTCCGGAAGGAGAGGCTTTAGGCCATGTTGAACATGGAAAATTTTTCACTGTTAATTCTAAAGAATTAAGACGTATTAT
CAAAGAATTACCTCTCTTTCAAGATGATTCAGTTTGTGTTACGGTAACAAATTCACAACTCAAGTTCTCTATTGCATCCAAGGAGATTATTCTTACCAAAGAGGGTGGAC
ATTGTAAAATTGTAGGTTTTGAAGAAGAAGTTGAAACTCAACTCCAAATCGTTCTCCGCCCCATGTTATTTTTCCTTAATTTGACTTATAAAGCCAATAGGGTATGGTTT
TATAAGACAACTAATTCCCGTAGTCTAATCAGTGTTCCAGCTTTTGGAGTGTATGGTCAATATGTGATCCATTTCCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACTTAGTGAGGCTTCAGAACTTTGATCCTCTTGTGGACGTAACATCCTTATTCGCTCAAATTGCTAAGGATGCGGACGTGAAATTGACACCGTCAATGTTCTTCAT
AATCGCATCGCACTCTTCCCCTCGCTTCGTTGCAACCCTCCAAATTTCACGTCGTTTCTTCACAAACTATACCGTCGATCTGAATCACACTTCACAGGTTTCCCTTCAAT
CCTTCCACGATGCTTTGTTGGAGGGCGGAAGTTTCTCTTCAATGACAATCCATCTTCTTGACAAAATTAACAAAATGACTCTTCGATTTGAGGCTTCACCTGGGAATGTG
CCACCGTTGTATCATGAATTGCCATTGTCTCCTCCGGAAGGAGAGGCTTTAGGCCATGTTGAACATGGAAAATTTTTCACTGTTAATTCTAAAGAATTAAGACGTATTAT
CAAAGAATTACCTCTCTTTCAAGATGATTCAGTTTGTGTTACGGTAACAAATTCACAACTCAAGTTCTCTATTGCATCCAAGGAGATTATTCTTACCAAAGAGGGTGGAC
ATTGTAAAATTGTAGGTTTTGAAGAAGAAGTTGAAACTCAACTCCAAATCGTTCTCCGCCCCATGTTATTTTTCCTTAATTTGACTTATAAAGCCAATAGGGTATGGTTT
TATAAGACAACTAATTCCCGTAGTCTAATCAGTGTTCCAGCTTTTGGAGTGTATGGTCAATATGTGATCCATTTCCCCTAG
Protein sequenceShow/hide protein sequence
MYLVRLQNFDPLVDVTSLFAQIAKDADVKLTPSMFFIIASHSSPRFVATLQISRRFFTNYTVDLNHTSQVSLQSFHDALLEGGSFSSMTIHLLDKINKMTLRFEASPGNV
PPLYHELPLSPPEGEALGHVEHGKFFTVNSKELRRIIKELPLFQDDSVCVTVTNSQLKFSIASKEIILTKEGGHCKIVGFEEEVETQLQIVLRPMLFFLNLTYKANRVWF
YKTTNSRSLISVPAFGVYGQYVIHFP