; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007783 (gene) of Snake gourd v1 genome

Gene IDTan0007783
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG04:4762976..4765636
RNA-Seq ExpressionTan0007783
SyntenyTan0007783
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX55172.1 ribonuclease H [Trifolium pratense]1.9e-2639.16Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+F+LC + LS LIS   + + +  + ++ R+P+ SHLFF DDSL F +A+  E S +  I+  Y+ ASGQ +N+NKS ++F+  V  + KQ ++ I
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPPNL
        LPM++VD+  KYLG+P    R K      +KEK+W  L  WK    S A     +K + +  P  L
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPPNL

XP_015389496.1 uncharacterized protein LOC107178625 [Citrus sinensis]1.5e-2639.75Show/hide
Query:  PYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLILP
        PY+F+LC EGLS LI    R   +  + +   +P  SHLFF DD   F KAN +E  ++K ILG+Y + SGQ +N +KS+I F+ NV+++ K+ L  IL 
Subjt:  PYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLILP

Query:  MRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        +    N G YLG+PS   R K +    +++++W  LHSW  N  S A  +  LK + ++ P
Subjt:  MRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

XP_023880426.1 uncharacterized protein LOC111992797 [Quercus suber]5.1e-2743.56Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +S Y+FLLC EG S LI+  A   +++ I++   +PK SHLFF DDS+ F KAN NE + LK ILG+YE ASGQ IN NKS+I F+PN  Q  K  +  I
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        L      +  KYLG+PS   R K      +K+K+   L  WKG L S    +  +K + +  P
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

XP_030922958.1 uncharacterized protein LOC115949824 [Quercus lobata]1.9e-2643.9Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCK-QFLSL
        +SPY+FLLC EG + LI+   R ++++ IT+   +PK SHLFF DD+L F KAN  E + L  IL IYE ASGQ IN  KS+I F+PN  Q  K + LS+
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCK-QFLSL

Query:  ILPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        + PM       KYLG+PS   R K      ++EK+   L SWKG L S    +  +K + +  P
Subjt:  ILPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

XP_034218997.1 uncharacterized protein LOC117630370 [Prunus dulcis]3.3e-2639.88Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+FLLC EGL+ LI+ + R   +  +++   +P  SHLFF DDS  F +AN  +  +LK IL  YE+ASGQ +N  KSA+ F  NV +  +  L+  
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        + +  VD+  +YLG+P    + K      LKE+LW  L +WKG L S A  +  +KV+ +  P
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

TrEMBL top hitse value%identityAlignment
A0A0J8B9Y6 Reverse transcriptase domain-containing protein2.7e-2636.79Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+FLLC E  S L+S  A   ++    +    P+ SHLFF DDS+ F +A   E S++  IL  YE+ASGQ IN +KS + F+ +V+ + +  +  +
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPP---NLWVVSDCQRNCVS--CSQDRWRSSG
          +R V+   KYLG+P+   R K     VLKE++W  L  WK  L S A  +  LK + +  P    +L+ V DC  N ++  CS+  W + G
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPP---NLWVVSDCQRNCVS--CSQDRWRSSG

A0A2K3JMB0 Ribonuclease H9.3e-2739.16Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+F+LC + LS LIS   + + +  + ++ R+P+ SHLFF DDSL F +A+  E S +  I+  Y+ ASGQ +N+NKS ++F+  V  + KQ ++ I
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPPNL
        LPM++VD+  KYLG+P    R K      +KEK+W  L  WK    S A     +K + +  P  L
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPPNL

A0A392N7V7 Reverse transcriptase domain-containing protein (Fragment)7.9e-2640.94Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+F+LCV+ LS +IS       ++ I + + +PK SHLFF DDSL F KAN  E + L  +L  Y++ SGQ IN++KS + F PN++Q  K     +
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSA
        +P++V DN+ KYLG+P+   R K    K + +++W  L  WK    S A
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSA

A0A7N2L6Z9 Reverse transcriptase domain-containing protein6.0e-2642.33Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+FLLC EGLS L+   AR + ++ I+L    P+ +HLFF DDSL F KAN  E   LK IL  YE ASGQ +N +KS+I F+PN     K+ +  I
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        L         KYLG+PS   R K      +KE++ + L  WKG L SS   +  +K + +  P
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

A0A803NWD5 Uncharacterized protein4.2e-2741.1Show/hide
Query:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI
        +SPY+F+LC EGLS L+ +     ++  I+++ R+P  +HLFF DDSL F  A+ +    L+ I  IY KASGQAIN +KS+ILF+PNV    K      
Subjt:  MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLI

Query:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP
        L ++ +  + KYLG+P   SR K      LK+ +  +LHSW    FS A  +  LKV+ +  P
Subjt:  LPMRVVDNLGKYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACCTTATATGTTTCTATTGTGTGTTGAAGGACTTTCCGTTCTAATTTCCTCAGAAGCAAGAACAAGAAAGGTGTCCAGTATTACATTAACATCTCGTAGTCCTAA
GACATCACATTTGTTCTTTGTTGATGATAGTCTTTTTTTCCTAAAAGCAAATGCAAATGAATTCAGTCTGCTAAAGGTTATTCTGGGTATTTATGAAAAGGCATCAGGGC
AAGCTATCAATATGAACAAGTCTGCAATTTTGTTCACCCCAAATGTGGAGCAACACTGTAAACAATTCTTGAGCTTAATATTACCCATGAGAGTTGTTGACAATTTGGGG
AAATACTTGGGAATCCCCTCTTCCTTTTCAAGGTGCAAGTGGGATGACTTGAAGGTTTTAAAAGAAAAATTATGGGTGATACTGCATAGTTGGAAAGGTAACTTGTTTTC
CTCAGCAGCAAATAAGTACTACTTAAAAGTGTTGCCCAAGCTTTACCCACCTAATCTATGGGTTGTTTCAGACTGCCAAAGAAATTGTGTCAGTTGTTCACAAGACAGAT
GGCGCAGTTCTGGTGGGGGGCTATAA
mRNA sequenceShow/hide mRNA sequence
ATTGACACTGATTACGTGAGTCCAAACAAGAGGCACACCGAATAGGTTGAGGAGCTTTTGGCGGTTAATGATGAGGTATGTACTCTTAATTTGGCAGAGGTTGCAACGCA
ACCCCGCCAATTGCCATGAAAATTTTATGTTGGAATGTCCGAGGGTTGGGGAGCTCTCGGGCATTCCATGTTTTAAGCGACGAATTAGCTCGTTATACGCTTCATTTATG
CTTTTTATTGGAAACAAAATGTAATTCTGGTATTCTGAGTAAATTGAAAAACAATTTGCACTATTATGGTTATTTTATTGTTGATAGAGTAGGCCTGAGCGGAAGACTTT
GTTTACTTTGGAAAGAGGAGATTGGGGTGACAATTAGATCCCTTTTCACTCATCATATAGATGTTTCTATTGTTTGGAATAATAGATGTTGGAGATTCATGGGCTTGTAT
GGACAACCGGATCAAGTCTTACGACATCAGACATGGGAACTTCTCAGGCGTTTAAGGAATAATGATTCTTCGTCGAGGGTTGTGAAGGGTGATCTGAATGAAATTCTTTG
GGATTGTGAAAAGAGAGCGGGGGAGACAGACAACAGGGACTGATTCAAGAATTCTGAAGAGTAGTGGATGATTGTCACTTGTGTGATTTGGGTTTCGAGGGCGACCAATT
TACATGGTGTAACAGAAGGAAAGCAAATGACTAGGTGAGTCTCCGCCTTGACAGATTTTTGGCTAATCAGAATTTATGTACCTTGTTCCTGGACTGTGTAGTTCGCCACC
TTGACTGGGCAAAGTCGGATCATCGTCCCATTTTACTGACTTTAGTTGTGTCACCTAATTGCACGGGGACATCTCGAAAACAGTTCAGGTTTGAGGAATTATGGCTGAAA
GATCCGGACTACAGACATATTATTAGTAAAGCGGGGGACTGGGGTAAGTTGAGTAAGTCTGGTCCATTACTACAGTCCTTGTTGGCTTCGACGCCAAGTTTATTGGGAGC
TTAGGCCAGAAAACAAAATTGTCAACTTAGATCTGACATAATCAGGCTGAAACTGCAATTAAAGGAAGCTTATGCACATCTGAGGCCTGTGGACTATGATCTTATTAAAT
CCTTGGAAAGTAAGCTTGATGGGAAAATGTTAGAAGAAGAAATCTACTGGAAGCAAAGATCAAGGGAAAACTGGCTAAATTGGGGTGATAAAAACACCAAGTGGTTCCAC
CTGAAGGCTTCGATCAGGAAGAAAAGAAACACAATTAAAGGAATCAGAGATGGCAAGGGACAGTGGCAAGAAGATCCGTTGACTGTATCGTCGGTGTTCTTGAGACATTT
TCAGAATGTGTTCAGATCCTCAGAACCGCCTAGGAAGGATATTTCACAGGTCCAATGTTATCTCTACCCAATTTTCTGATGGTGTGAATAATATGCTACTACAGGAATAT
ACTAAGGAGGAGATTGAACGAGCAATTAGGAGTTTCCATCCAACCAAACCTCCTGGTCCAGATGGTTTCCCTGCAGTCTTTTATCAAAAGTTTTGGGACATCATTGGTCC
CCAGACAAATGAGAGTTTCTTGAAAATTCTCAACCAGGATCTTTCGTTAGACGTGTGGAATAAAACTACAATTTTTTTAATCCCCAAGGTAAAAGAGCCTAAAGATGTAG
CAGATTTTCGACCAATCAACCTATGTAATGTTTGTTATAAAATTGTGACCAAGGTAATAACAAAACATGCTGAAATGGATTCTGGGTGATACAATATCAGAACCTCAATC
TGTTTTCCTTCCTGGGAGGAATATATCAGATAATTTGATTATTGCCTATGAAACATTAAACTAAATGAAACACAAAAAGAACGGGAAGAGGAGGGCTATGTTGCGATAAA
GTTAGATATGAGTAAGGTCTATGATAGGGTGGAATGGTCCTTTCTCAAGCAGGTGTTATTGAAATTGGGCTTCCATTGTTGTTGGGTGAGCTTGATTATGGATTGCATTT
CTACAGTATCCTTTTCTATCTCTTTGAATGGGTCTGCAATTGGTTCCGTCAAACCATCCAGGGGTTTAAGACAAGGGGACCATGTCACCTTATATGTTTCTATTGTGTGT
TGAAGGACTTTCCGTTCTAATTTCCTCAGAAGCAAGAACAAGAAAGGTGTCCAGTATTACATTAACATCTCGTAGTCCTAAGACATCACATTTGTTCTTTGTTGATGATA
GTCTTTTTTTCCTAAAAGCAAATGCAAATGAATTCAGTCTGCTAAAGGTTATTCTGGGTATTTATGAAAAGGCATCAGGGCAAGCTATCAATATGAACAAGTCTGCAATT
TTGTTCACCCCAAATGTGGAGCAACACTGTAAACAATTCTTGAGCTTAATATTACCCATGAGAGTTGTTGACAATTTGGGGAAATACTTGGGAATCCCCTCTTCCTTTTC
AAGGTGCAAGTGGGATGACTTGAAGGTTTTAAAAGAAAAATTATGGGTGATACTGCATAGTTGGAAAGGTAACTTGTTTTCCTCAGCAGCAAATAAGTACTACTTAAAAG
TGTTGCCCAAGCTTTACCCACCTAATCTATGGGTTGTTTCAGACTGCCAAAGAAATTGTGTCAGTTGTTCACAAGACAGATGGCGCAGTTCTGGTGGGGGGCTATAATGG
AGAAAAAGAAGATTCATTGGA
Protein sequenceShow/hide protein sequence
MSPYMFLLCVEGLSVLISSEARTRKVSSITLTSRSPKTSHLFFVDDSLFFLKANANEFSLLKVILGIYEKASGQAINMNKSAILFTPNVEQHCKQFLSLILPMRVVDNLG
KYLGIPSSFSRCKWDDLKVLKEKLWVILHSWKGNLFSSAANKYYLKVLPKLYPPNLWVVSDCQRNCVSCSQDRWRSSGGGL