; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015610 (gene) of Snake gourd v1 genome

Gene IDTan0015610
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG01:104762963..104763652
RNA-Seq ExpressionTan0015610
SyntenyTan0015610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.7e-5551.9Show/hide
Query:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG
        MANA      P S  +  F  PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL G  P P  F+ +   S TT     +D    ASSS    
Subjt:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG

Query:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL
         VN L+E W+T                    +MG+ N +DLWDA Q  +G+QSRAEEDFLRQ+ Q TRKGN KM +YL +MK + DNLGQ GSPV  RAL
Subjt:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL

Query:  ISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF
        ISQVLLGLDE YN V+  +QGKPD+ W D+ ++LL+F
Subjt:  ISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]1.8e-4150.74Show/hide
Query:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG
        MANA      P S  +  F  PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL G  P P  F+ +   S TT     +D    ASSS    
Subjt:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG

Query:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL
         VN L+E W+T                    +MG+ N +DLWDA Q  +G+QSRAEEDFLRQ+ Q TRKGN KM +YL +MK + DNLGQ GSPV  RAL
Subjt:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL

Query:  ISQ
        ISQ
Subjt:  ISQ

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]4.8e-3955.56Show/hide
Query:  MFLQNGDGSGTTSDV------ASSSSVEGTVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGN
        MF+Q   G+  TS        +SS + E  +NPLYE+W+T                    VMGY+N+ DLW AIQ L+G+QS+AEED+LRQVFQQTRKG+
Subjt:  MFLQNGDGSGTTSDV------ASSSSVEGTVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGN

Query:  QKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNE
         KM D+LR+MK HADNLGQAGSPV  R+LISQVLLGLDEEYNPVVAT+QGK  + W ++  E
Subjt:  QKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNE

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]8.2e-5554.5Show/hide
Query:  FGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLT---------------
        F +PPLNQLLNQ+TSIK+DR NFLLW+NLALPILRSYKL  +L G KP PP  L   D + T  + ++SS    T+NP YEAW+                
Subjt:  FGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLT---------------

Query:  -----VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQ
             VMG+  S++LW A+Q L+G+QSRAE D+L+QVFQQT KG+ +M++YL++MK HADNL  AGS VS R L+SQVL GLDEEYNP+V  +QGK ++ 
Subjt:  -----VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQ

Query:  WSDVHNELLVF
        WS++H ELL +
Subjt:  WSDVHNELLVF

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]9.7e-4850.9Show/hide
Query:  TSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMF----------LQNGDGSGTTSD--------------VASSSSVEGTVNPLYEA-------
        T+IKLD+ N+LLW+NLALPILRSY+LEGHL G  P PP F          +  GD +G                   AS+SS    VNP YE+       
Subjt:  TSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMF----------LQNGDGSGTTSD--------------VASSSSVEGTVNPLYEA-------

Query:  ---WL----------TVMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPV
           WL           VMGY+N K LW AIQ L+G+QSRA ED+LRQVFQQT KG  KM +YLR+MK H+DNLG  GSPV  RAL+SQVLLGLDEE+NP 
Subjt:  ---WL----------TVMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPV

Query:  VATLQGKPDVQWSDVHNELLVF
        VAT+QG+ ++ W+++  ELL F
Subjt:  VATLQGKPDVQWSDVHNELLVF

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein8.0e-5651.9Show/hide
Query:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG
        MANA      P S  +  F  PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL G  P P  F+ +   S TT     +D    ASSS    
Subjt:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG

Query:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL
         VN L+E W+T                    +MG+ N +DLWDA Q  +G+QSRAEEDFLRQ+ Q TRKGN KM +YL +MK + DNLGQ GSPV  RAL
Subjt:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL

Query:  ISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF
        ISQVLLGLDE YN V+  +QGKPD+ W D+ ++LL+F
Subjt:  ISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF

A0A5D3BCH9 Uncharacterized protein8.6e-4250.74Show/hide
Query:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG
        MANA      P S  +  F  PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL G  P P  F+ +   S TT     +D    ASSS    
Subjt:  MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTT-----SDV---ASSSSVEG

Query:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL
         VN L+E W+T                    +MG+ N +DLWDA Q  +G+QSRAEEDFLRQ+ Q TRKGN KM +YL +MK + DNLGQ GSPV  RAL
Subjt:  TVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRAL

Query:  ISQ
        ISQ
Subjt:  ISQ

A0A5D3E3L7 Uncharacterized protein4.9e-3748.97Show/hide
Query:  PILRSYKLEGHLLGSKPRPPMFLQN--GDGSGTTSDVASSSSVEGT------------VNPLYEAWLT--------------------VMGYDNSKDLWD
        P     ++ G   G    P  FL +  G+ + T    ASS   EGT            VNP YE W+T                    +MG++ +KDLW+
Subjt:  PILRSYKLEGHLLGSKPRPPMFLQN--GDGSGTTSDVASSSSVEGT------------VNPLYEAWLT--------------------VMGYDNSKDLWD

Query:  AIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF
        AIQ L+GI+SRAEE FLR  FQ TR+GN KM DYLRIMK +ADNLGQAGSPV +R LISQVLLGLDE YNPV A +QGKPD+ W D+ +ELL+F
Subjt:  AIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF

A0A6J1D5J0 uncharacterized protein LOC1110175012.3e-3955.56Show/hide
Query:  MFLQNGDGSGTTSDV------ASSSSVEGTVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGN
        MF+Q   G+  TS        +SS + E  +NPLYE+W+T                    VMGY+N+ DLW AIQ L+G+QS+AEED+LRQVFQQTRKG+
Subjt:  MFLQNGDGSGTTSDV------ASSSSVEGTVNPLYEAWLT--------------------VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGN

Query:  QKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNE
         KM D+LR+MK HADNLGQAGSPV  R+LISQVLLGLDEEYNPVVAT+QGK  + W ++  E
Subjt:  QKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNE

A0A6J1DCW4 uncharacterized protein LOC1110195984.0e-5554.5Show/hide
Query:  FGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLT---------------
        F +PPLNQLLNQ+TSIK+DR NFLLW+NLALPILRSYKL  +L G KP PP  L   D + T  + ++SS    T+NP YEAW+                
Subjt:  FGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLT---------------

Query:  -----VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQ
             VMG+  S++LW A+Q L+G+QSRAE D+L+QVFQQT KG+ +M++YL++MK HADNL  AGS VS R L+SQVL GLDEEYNP+V  +QGK ++ 
Subjt:  -----VMGYDNSKDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQ

Query:  WSDVHNELLVF
        WS++H ELL +
Subjt:  WSDVHNELLVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0524.53Show/hide
Query:  IKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLTVMGYDNSKDLWDAIQLLYGIQSRAEEDFLR
        + ++ SN+  W+ L L    S+ + GH+ G+     +     D +    D     S+ GT+ P       V     S+D+W  I+  +     A    L 
Subjt:  IKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLTVMGYDNSKDLWDAIQLLYGIQSRAEEDFLR

Query:  QVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQ
           +    G+ ++ DY R MK  AD+L     PV++R L+  VL GL+ +++ ++  ++
Subjt:  QVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.4e-0622.03Show/hide
Query:  SIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLTVMGYDNSKDLWDAIQLLYGIQSRAEEDFL
        ++ L++ N+ +W+ L   +  S+ + GH+ GS    PM     +      D      + GT+       +  +G   ++DLW +++ L+     A     
Subjt:  SIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLTVMGYDNSKDLWDAIQLLYGIQSRAEEDFL

Query:  RQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGK-PDVQWSDVHNELLV
            + T   +  + +Y + +K  +D L    SP+S+R L+  +L GL E+Y+ ++  ++ K P   +++  + LL+
Subjt:  RQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGK-PDVQWSDVHNELLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACGCCTCATTAAATCCTGGGATGCCACAATCGGCTGGAAACAACAATTTCGGGACTCCGCCGTTGAATCAACTTCTCAATCAAGTAACATCAATAAAGCTAGA
TAGAAGCAACTTTCTCCTCTGGAAAAACCTTGCCCTTCCAATTCTCCGGAGCTACAAACTGGAGGGTCATCTCCTCGGTTCAAAACCGCGTCCTCCAATGTTTCTACAGA
ATGGTGATGGGTCTGGAACGACGAGCGATGTAGCATCATCCTCATCTGTTGAAGGTACTGTGAATCCCCTATATGAAGCGTGGCTAACAGTAATGGGTTATGATAATTCG
AAAGACCTTTGGGATGCTATTCAACTTTTATATGGCATCCAATCCAGAGCAGAGGAGGACTTTCTCCGCCAAGTTTTCCAGCAAACCAGAAAAGGCAATCAGAAGATGAT
GGACTATCTTCGTATAATGAAGTGCCACGCCGACAACTTAGGACAAGCTGGAAGTCCAGTGTCGAACAGAGCCCTAATTTCTCAAGTTCTTCTTGGCCTAGACGAGGAAT
ACAACCCAGTGGTGGCTACTCTTCAAGGTAAGCCTGATGTTCAATGGTCTGATGTTCATAATGAACTCCTTGTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACGCCTCATTAAATCCTGGGATGCCACAATCGGCTGGAAACAACAATTTCGGGACTCCGCCGTTGAATCAACTTCTCAATCAAGTAACATCAATAAAGCTAGA
TAGAAGCAACTTTCTCCTCTGGAAAAACCTTGCCCTTCCAATTCTCCGGAGCTACAAACTGGAGGGTCATCTCCTCGGTTCAAAACCGCGTCCTCCAATGTTTCTACAGA
ATGGTGATGGGTCTGGAACGACGAGCGATGTAGCATCATCCTCATCTGTTGAAGGTACTGTGAATCCCCTATATGAAGCGTGGCTAACAGTAATGGGTTATGATAATTCG
AAAGACCTTTGGGATGCTATTCAACTTTTATATGGCATCCAATCCAGAGCAGAGGAGGACTTTCTCCGCCAAGTTTTCCAGCAAACCAGAAAAGGCAATCAGAAGATGAT
GGACTATCTTCGTATAATGAAGTGCCACGCCGACAACTTAGGACAAGCTGGAAGTCCAGTGTCGAACAGAGCCCTAATTTCTCAAGTTCTTCTTGGCCTAGACGAGGAAT
ACAACCCAGTGGTGGCTACTCTTCAAGGTAAGCCTGATGTTCAATGGTCTGATGTTCATAATGAACTCCTTGTTTTTTAA
Protein sequenceShow/hide protein sequence
MANASLNPGMPQSAGNNNFGTPPLNQLLNQVTSIKLDRSNFLLWKNLALPILRSYKLEGHLLGSKPRPPMFLQNGDGSGTTSDVASSSSVEGTVNPLYEAWLTVMGYDNS
KDLWDAIQLLYGIQSRAEEDFLRQVFQQTRKGNQKMMDYLRIMKCHADNLGQAGSPVSNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELLVF