; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021997 (gene) of Snake gourd v1 genome

Gene IDTan0021997
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationLG07:34778812..34779579
RNA-Seq ExpressionTan0021997
SyntenyTan0021997
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]6.1e-1750.43Show/hide
Query:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN
        FVP+ FH      L+ LRQGSKSVE YY EM TL+  L++ ED    MARF  GLNKEIA ++DLQ Y ++E M+H+ IKIEK+LQR+S  Y+  KP  +
Subjt:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN

Query:  VHSTLEKVSLPIENK
          ST +  S   ++K
Subjt:  VHSTLEKVSLPIENK

XP_022158803.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025268 [Momordica charantia]9.4e-1851.3Show/hide
Query:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN
        FVP+ FH      L+ LRQGSKSVE YY EM TL+  L++ ED    MARF  GLNKEIA ++DLQ Y ++E M+H+ IKIEK+LQR+S RY+  KP  +
Subjt:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN

Query:  VHSTLEKVSLPIENK
          ST +  S   ++K
Subjt:  VHSTLEKVSLPIENK

XP_022159097.1 uncharacterized protein LOC111025539 [Momordica charantia]2.9e-1945.97Show/hide
Query:  HFVPKFFHNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILNVHST
        HF+      L+ LRQG+KSVE YY +M+TL+E ++I E+E  TMARF  GLN E+A ++DLQ YED+E +VH ++KIEK++QR+ +RY   KP  N  S+
Subjt:  HFVPKFFHNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILNVHST

Query:  LEKVSLPIENKPIVEVESGKIQNE
         +K     ++K  V  E  K + E
Subjt:  LEKVSLPIENKPIVEVESGKIQNE

XP_022946091.1 uncharacterized protein LOC111450286, partial [Cucurbita moschata]1.0e-1633.33Show/hide
Query:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK
        M +HFVP++F       L+ L+QG KSVE YY EM TL++ LE+ ED    MARF  GLN EIA + DLQ Y ++E ++HI IKIE+++QR+S RY+  K
Subjt:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK

Query:  PILNVHSTLEKVSLPIE----NKPIVE------------------VESGKIQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG
           N  ST +K S  I+    N+ I E                  VE   ++N  L          ++R  DC +  +         T DE H D+ E  
Subjt:  PILNVHSTLEKVSLPIE----NKPIVE------------------VESGKIQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG

Query:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF
                      E +  E+ S+ +   ISL      R+A N+ + E G + +  LF
Subjt:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF

XP_038887118.1 uncharacterized protein K02A2.6-like [Benincasa hispida]9.4e-1850Show/hide
Query:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK
        M +HFVP  F       L+ LRQG+KSVE YY EM  L++ L++ ED    MARF  GLNKEIA ++DLQ Y D+E M+H+ IK+EK L  K  RYT  K
Subjt:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK

Query:  PILNVHST
        P  +++S+
Subjt:  PILNVHST

TrEMBL top hitse value%identityAlignment
A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241152.9e-1750.43Show/hide
Query:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN
        FVP+ FH      L+ LRQGSKSVE YY EM TL+  L++ ED    MARF  GLNKEIA ++DLQ Y ++E M+H+ IKIEK+LQR+S  Y+  KP  +
Subjt:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN

Query:  VHSTLEKVSLPIENK
          ST +  S   ++K
Subjt:  VHSTLEKVSLPIENK

A0A6J1DX46 LOW QUALITY PROTEIN: uncharacterized protein LOC1110252684.5e-1851.3Show/hide
Query:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN
        FVP+ FH      L+ LRQGSKSVE YY EM TL+  L++ ED    MARF  GLNKEIA ++DLQ Y ++E M+H+ IKIEK+LQR+S RY+  KP  +
Subjt:  FVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILN

Query:  VHSTLEKVSLPIENK
          ST +  S   ++K
Subjt:  VHSTLEKVSLPIENK

A0A6J1DYV9 uncharacterized protein LOC1110255393.5e-1832.77Show/hide
Query:  HFVPKFFHNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILNVHST
        HF+      L+ LRQG+KSVE YY +M+TL+E ++I E+E  TMARF  GLN E+A ++DLQ YED+E +VH ++KIEK++QR+ +RY   KP  N  S+
Subjt:  HFVPKFFHNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILNVHST

Query:  LEKVSLPIENKPIVEVESGKIQNEMLCN---LVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELGKKAKKVETKACQLFEKESCEKMSDREK
         +K     ++K  V  E  K + E          TH    +R L C         +   P    + +   EL   A +V+ +       ES   +   E+
Subjt:  LEKVSLPIENKPIVEVESGKIQNEMLCN---LVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELGKKAKKVETKACQLFEKESCEKMSDREK

Query:  HEISLWLETNERKANNSSLCERGCEFKSILFSKQNIFY
         E + +L    R+A ++ + E   E +     ++N+F+
Subjt:  HEISLWLETNERKANNSSLCERGCEFKSILFSKQNIFY

A0A6J1G2Q3 uncharacterized protein LOC1114502865.0e-1733.33Show/hide
Query:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK
        M +HFVP++F       L+ L+QG KSVE YY EM TL++ LE+ ED    MARF  GLN EIA + DLQ Y ++E ++HI IKIE+++QR+S RY+  K
Subjt:  MWEHFVPKFF-----HNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK

Query:  PILNVHSTLEKVSLPIE----NKPIVE------------------VESGKIQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG
           N  ST +K S  I+    N+ I E                  VE   ++N  L          ++R  DC +  +         T DE H D+ E  
Subjt:  PILNVHSTLEKVSLPIE----NKPIVE------------------VESGKIQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG

Query:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF
                      E +  E+ S+ +   ISL      R+A N+ + E G + +  LF
Subjt:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF

A0A6J1I8S0 uncharacterized protein LOC1114724891.5e-1632.56Show/hide
Query:  MWEHFVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK
        M + FVP++FH      L+ L+QG KSVE YY EM TL++ LE+ ED    MARF  GLN EIA + DLQ Y ++E ++HI IKIE+++QR+S RY+  K
Subjt:  MWEHFVPKFFH-----NLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPK

Query:  PILNVHSTLEKVSLPIEN---------KPIVEVESGK-------------IQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG
           N  ST ++ S  I+          KP  + E G+             ++N  L          ++R  DC +  +         T DE H D+ E  
Subjt:  PILNVHSTLEKVSLPIEN---------KPIVEVESGK-------------IQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELG

Query:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF
                      E +  E+ S+ +   ISL      R+A N+ + E G + +  LF
Subjt:  KKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLCERGCEFKSILF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGAGCACTTTGTTCCTAAGTTTTTCCATAACTTGCGAACTTTGAGACAAGGGAGCAAAAGTGTGGAGGCTTACTACATGGAGATGCAAACGTTGCTTGAAGATCT
CGAAATTTATGAGGATGAGACGGTTACAATGGCTCGATTCTTTAGAGGACTTAACAAGGAGATTGCAATTCAACTTGATCTTCAATCTTATGAGGATTTGGAGAGGATGG
TGCACATAACCATAAAGATTGAAAAACGTCTCCAAAGAAAGTCTACAAGGTACACTAAACCTAAACCAATTTTAAATGTTCATTCAACTTTGGAAAAGGTTAGTTTACCT
ATTGAAAATAAGCCTATTGTTGAGGTAGAAAGTGGTAAGATTCAAAATGAAATGCTTTGTAATTTAGTGTCAACACATGCTAGTGAGTTTACTAGGAGATTAGATTGCAA
AGACTTGTTTGTTTGTAACACTTCTACTATTGTAACACCTACTCTTGATGAAGTACATGTTGATGTGTGTGAGTTAGGGAAAAAGGCAAAAAAGGTCGAGACAAAAGCAT
GTCAATTGTTTGAAAAAGAATCTTGTGAGAAAATGAGTGATAGAGAAAAGCATGAGATTAGTCTATGGCTAGAGACAAATGAGAGAAAAGCCAATAATAGTAGCTTGTGT
GAAAGAGGTTGTGAGTTTAAATCTATTTTGTTTTCAAAGCAAAATATTTTTTATCTTTTATATAAAGGAACTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTGAATCATGGAATGTTCTTAAGCAAATAATGTGGGAGCACTTTGTTCCTAAGTTTTTCCATAACTTGCGAACTTTGAGACAAGGGAGCAAAAGTGTGGAGGCTTACTA
CATGGAGATGCAAACGTTGCTTGAAGATCTCGAAATTTATGAGGATGAGACGGTTACAATGGCTCGATTCTTTAGAGGACTTAACAAGGAGATTGCAATTCAACTTGATC
TTCAATCTTATGAGGATTTGGAGAGGATGGTGCACATAACCATAAAGATTGAAAAACGTCTCCAAAGAAAGTCTACAAGGTACACTAAACCTAAACCAATTTTAAATGTT
CATTCAACTTTGGAAAAGGTTAGTTTACCTATTGAAAATAAGCCTATTGTTGAGGTAGAAAGTGGTAAGATTCAAAATGAAATGCTTTGTAATTTAGTGTCAACACATGC
TAGTGAGTTTACTAGGAGATTAGATTGCAAAGACTTGTTTGTTTGTAACACTTCTACTATTGTAACACCTACTCTTGATGAAGTACATGTTGATGTGTGTGAGTTAGGGA
AAAAGGCAAAAAAGGTCGAGACAAAAGCATGTCAATTGTTTGAAAAAGAATCTTGTGAGAAAATGAGTGATAGAGAAAAGCATGAGATTAGTCTATGGCTAGAGACAAAT
GAGAGAAAAGCCAATAATAGTAGCTTGTGTGAAAGAGGTTGTGAGTTTAAATCTATTTTGTTTTCAAAGCAAAATATTTTTTATCTTTTATATAAAGGAACTTGTTGA
Protein sequenceShow/hide protein sequence
MWEHFVPKFFHNLRTLRQGSKSVEAYYMEMQTLLEDLEIYEDETVTMARFFRGLNKEIAIQLDLQSYEDLERMVHITIKIEKRLQRKSTRYTKPKPILNVHSTLEKVSLP
IENKPIVEVESGKIQNEMLCNLVSTHASEFTRRLDCKDLFVCNTSTIVTPTLDEVHVDVCELGKKAKKVETKACQLFEKESCEKMSDREKHEISLWLETNERKANNSSLC
ERGCEFKSILFSKQNIFYLLYKGTC