; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021202 (gene) of Snake gourd v1 genome

Gene IDTan0021202
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG08:23356520..23358968
RNA-Seq ExpressionTan0021202
SyntenyTan0021202
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]3.6e-2344.9Show/hide
Query:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS
        +C KIGH   +CY+R+  +        +R+SSP  +    Y        +I T     D NWYP  GA  H+T N  NL  S+++ G NQVHVGN   LS
Subjt:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS

Query:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
        I H G     SP  ++   LN+LLHVPSITKNL+SVS+FA DN VFF
Subjt:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-2445.64Show/hide
Query:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL
        IC K+G++A RC+FRY P S  +   P + ++S       N         ++   + N D NWYP  GA  HLT++  NLS+ S+Y G NQ++  N   L
Subjt:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL

Query:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
         I+H+G  S  S   P ++F LNNLL VPSITKNLISVSQFA DN VFF
Subjt:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

KYP46802.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.1e-2344.22Show/hide
Query:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS
        IC KIGH A  CY+RY     N     N    P        G        + TP+  QD  WYP  GA YH+T++  NLS+ S Y G + V++GN   L 
Subjt:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS

Query:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
        ISH G+    +P  ++ F  +NLLHVPSITKNL+SVS+FA DN VFF
Subjt:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-2445.64Show/hide
Query:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL
        IC K+G++A RC+FRY P S  +   P + ++S       N         ++   + N D NWYP  GA  HLT++  NLS+ S+Y G NQ++  N   L
Subjt:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL

Query:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
         I+H+G  S  S   P ++F LNNLL VPSITKNLISVSQFA DN VFF
Subjt:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

XP_022147537.1 uncharacterized protein LOC111016442 [Momordica charantia]4.5e-2662.14Show/hide
Query:  SPHM-VLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNS
        SP M  L+TT + N D NWYP  GA  HLT+NF NL V ++YL +NQV VGN     I HFG+ S  SP N+  HLNNLLHVP ITKNLISVSQFA DNS
Subjt:  SPHM-VLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNS

Query:  VFF
        +FF
Subjt:  VFF

TrEMBL top hitse value%identityAlignment
A0A151RW78 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-2344.22Show/hide
Query:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS
        IC KIGH A  CY+RY     N     N    P        G        + TP+  QD  WYP  GA YH+T++  NLS+ S Y G + V++GN   L 
Subjt:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS

Query:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
        ISH G+    +P  ++ F  +NLLHVPSITKNL+SVS+FA DN VFF
Subjt:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-2445.64Show/hide
Query:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL
        IC K+G++A RC+FRY P S  +   P + ++S       N         ++   + N D NWYP  GA  HLT++  NLS+ S+Y G NQ++  N   L
Subjt:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL

Query:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
         I+H+G  S  S   P ++F LNNLL VPSITKNLISVSQFA DN VFF
Subjt:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-2445.64Show/hide
Query:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL
        IC K+G++A RC+FRY P S  +   P + ++S       N         ++   + N D NWYP  GA  HLT++  NLS+ S+Y G NQ++  N   L
Subjt:  ICNKIGHTATRCYFRYAP-SRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDL

Query:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
         I+H+G  S  S   P ++F LNNLL VPSITKNLISVSQFA DN VFF
Subjt:  SISHFGYGSITSP--PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

A0A6J1D2M1 uncharacterized protein LOC1110164422.2e-2662.14Show/hide
Query:  SPHM-VLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNS
        SP M  L+TT + N D NWYP  GA  HLT+NF NL V ++YL +NQV VGN     I HFG+ S  SP N+  HLNNLLHVP ITKNLISVSQFA DNS
Subjt:  SPHM-VLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNS

Query:  VFF
        +FF
Subjt:  VFF

A5BFT3 Integrase catalytic domain-containing protein1.7e-2344.9Show/hide
Query:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS
        +C KIGH   +CY+R+  +        +R+SSP  +    Y        +I T     D NWYP  GA  H+T N  NL  S+++ G NQVHVGN   LS
Subjt:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLS

Query:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF
        I H G     SP  ++   LN+LLHVPSITKNL+SVS+FA DN VFF
Subjt:  ISHFGYGSITSP-PNQTFHLNNLLHVPSITKNLISVSQFANDNSVFF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1135.37Show/hide
Query:  ICNKIGHTATRC-YFRYAPSRLNVVPPGNRSSSPEDFGQP--NYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDV
        IC   GH+A RC   ++  S +N   P     SP    QP  N  + SP+             NW    GA +H+T++F NLS+   Y G + V V +  
Subjt:  ICNKIGHTATRC-YFRYAPSRLNVVPPGNRSSSPEDFGQP--NYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDV

Query:  DLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNSV
         + ISH G  S+ S  ++  +L+N+L+VP+I KNLISV +  N N V
Subjt:  DLSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-1134.25Show/hide
Query:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQP--NYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVD
        IC+  GH+A RC   +   +        +S+SP    QP  N  V SP         YN + NW    GA +H+T++F NLS    Y G + V + +   
Subjt:  ICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQP--NYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVD

Query:  LSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNSV
        + I+H G  S+ +  +++  LN +L+VP+I KNLISV +  N N V
Subjt:  LSISHFGYGSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNSV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAATTTAATTTGTAATAAAATTGGTCATACAGCTACTAGATGTTATTTTCGTTATGCTCCTTCTCGTCTGAATGTTGTGCCTCCAGGTAACCGTTCTTCTTCCCC
TGAAGATTTTGGACAACCTAATTATGGCGTTCAATCTCCTCATATGGTTTTGATTACTACACCAAACTACAATCAAGATTGCAATTGGTACCCTGGTTTTGGAGCAAAAT
ATCACCTTACAAACAATTTTGGCAATCTTTCAGTGAGTTCTGACTATCTTGGCAATAATCAAGTTCATGTTGGTAATGATGTCGATTTGTCTATCTCTCACTTTGGTTAT
GGCAGTATTACTTCCCCTCCCAACCAAACGTTTCACCTAAATAATCTTTTGCATGTTCCTTCAATAACCAAAAATTTGATTAGTGTTAGTCAATTTGCTAATGATAATTC
TGTTTTTTTTTTAATTTCATCCTAA
mRNA sequenceShow/hide mRNA sequence
GTTTTATGTGTTTCTGCCTCTCTATATATTTGAGACTACCCTCTATATCAAAATAATTCATTGAGAATTCAATGACTTGAATACGTTTGGCATTTTGTTCAGTTTGATTT
TCAGTATTTTTGTTACATTTATAATGGTATCAGAGCTACAGTCTTGCGACTACTTTTTTTTTCTTCTCCTTTTCGATGGAGTCTTCTAAAGCAAACTCTGAATTGTCAAG
CTTCAAACTACATCAATAATCAACCTGGGAATAAGGTCTTGATGGTCAAATTGAGTAATGAAAATTTTCTTCTCTGGAAGTTTCAAATCGAATTTGCACTTGAAGGTCAT
GGGCTCTTGGAAAAGCATTTTGGTGAAAATTTAGGCCCTCCTCCCAAAACCCTACAATGTGTTCAGAGAATGTTCAAGATGTTATGGCACTCCTCCTCACTCATGAGAAT
CAGATTCAGAGTAAGGCTATAACACGGATGGAACACTACCCTCTGCTCATCTTTCCACTCATAATGTTATCTCTAAAGAAGAAGAAAACCAAAGAAGTGTTAACCATATG
ATGAGTTACCAAAATTTTAATAATAACAAGGGTCGAGGGCGAATGAATAATGGTCAAAATAGAGGAGGGCGGTATTCATGGAGCAACTGCAATAAGCCACAATGCCAAAT
TTAATTTGTAATAAAATTGGTCATACAGCTACTAGATGTTATTTTCGTTATGCTCCTTCTCGTCTGAATGTTGTGCCTCCAGGTAACCGTTCTTCTTCCCCTGAAGATTT
TGGACAACCTAATTATGGCGTTCAATCTCCTCATATGGTTTTGATTACTACACCAAACTACAATCAAGATTGCAATTGGTACCCTGGTTTTGGAGCAAAATATCACCTTA
CAAACAATTTTGGCAATCTTTCAGTGAGTTCTGACTATCTTGGCAATAATCAAGTTCATGTTGGTAATGATGTCGATTTGTCTATCTCTCACTTTGGTTATGGCAGTATT
ACTTCCCCTCCCAACCAAACGTTTCACCTAAATAATCTTTTGCATGTTCCTTCAATAACCAAAAATTTGATTAGTGTTAGTCAATTTGCTAATGATAATTCTGTTTTTTT
TTTAATTTCATCCTAAATTTTGTTTTGTGAAGAATCTAACAACTTGTTATTCTAAGGATACTCTCCATGATGGCCTATACCGTTTCAATCTATCGCCTCAACATCATTTA
TTCAACAAGCATCCTCAAGCTAATGTTTCCTCAACAATATTGTCTTTTCCCTTAGGTTCTGTTTTTTTCACCTATGCGGACCCTCCTAGTCTTGACTTGTGGCATCAATG
ATAAGGGTAATCTAGTCTGTTAATTGTTCAACAAGTTGTTAAACATTGTTAATCGTCTATTGTGTCTAATAAAGCTTCTTCTTTTTGTTATACTTGTGTTATTGGAAAGA
ATCATGTTCTCCCTTTGTGATGCTTGTGCTAATGTGTTAAGGGATTGTGCATTGTATGATGTGAGAATTCATGTGTTATTGGCATAGCATAATGTGGAATGTATGTGTTA
AAGTGGTCTTTGGTTTATATTGCTTGTGTTTGAATGCAAAGCATGATGTTTAAGTTAGTTAGTTGCAAAAGTATGTTTGGATGTGTGTGAACAACATGTTGATTGTTGTT
TGTTGGTTGGTGGCTCCCAGGTATGAGCTAATTACTCAGTGCGTGTTATATATTGACCCATCACCAGATTTTGCAGGTGATGACTAACTATGAGGAACTTGGTGATGTTG
ACAAGGAAGCATGCTAAGGAGACCCTTCTTTGTTGGAGTTCTGCTAGGGTTGCTAGATAGGGTTTAGATGTTCATTTTCATGAATAAGAGCATGTAAAGACATTTTTGGT
TATTTAATTATCATATTCGTTTTGTGTTGCATATATAGGTTCTTTTGGTAGGGCTTAAGTTCAGGATTATTGGCTTTTGGTTTACTAACGTATTTATTGTTTCCGCTCTG
ATTTAATTTGAATACGTGTTAAGTAGTTGAGGTTTTTTGTACTTCGAAGTGTTGGATTTGAAACGTAGTTTGAAGGTGGAGCTTTGGG
Protein sequenceShow/hide protein sequence
MPNLICNKIGHTATRCYFRYAPSRLNVVPPGNRSSSPEDFGQPNYGVQSPHMVLITTPNYNQDCNWYPGFGAKYHLTNNFGNLSVSSDYLGNNQVHVGNDVDLSISHFGY
GSITSPPNQTFHLNNLLHVPSITKNLISVSQFANDNSVFFLISS