; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014711 (gene) of Snake gourd v1 genome

Gene IDTan0014711
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhistone-lysine N-methyltransferase SETD1A
Genome locationLG04:9904599..9905474
RNA-Seq ExpressionTan0014711
SyntenyTan0014711
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601423.1 hypothetical protein SDJN03_06656, partial [Cucurbita argyrosperma subsp. sororia]9.6e-4163.24Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISS-----------PSSNQNNNISHSQPQPTTTTDSASLQDYTNF
        + SKRQRDEAQ+EEM+EGEELKRQKSY+QILSLLEEEEEE VEDLSSII++LQQEISS           P S QNN       +   + +  S++DY   
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISS-----------PSSNQNNNISHSQPQPTTTTDSASLQDYTNF

Query:  ASTSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM
         ++    SSSSV         EEE  ER+RVMRHLLEASDDELGIPN+EF+V    DGVALCDALWELEDEAANYYTLF SQLFM
Subjt:  ASTSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM

XP_008446332.1 PREDICTED: histone-lysine N-methyltransferase SETD1A [Cucumis melo]2.8e-4064.32Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS
        + SKRQR +A+ E++ EGEELKRQKSYNQILSLL   EE+EEE ++DLSSIITTLQQEISSP   Q       ++HS         S SLQD ++ +S+S
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS

Query:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM
        +S+SSSSV  SP     KE  EE+ ER++VMRHLLEASDDELGIPN EF V     DGVALCDALWELEDEAANYYTLFQSQLFM
Subjt:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM

XP_011655674.1 histone-lysine N-methyltransferase SETD1A [Cucumis sativus]2.7e-4369.06Show/hide
Query:  LVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLE--EEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTS
        + SKRQR +A+I EE+ EGEELKRQKSYNQILSLLE  EE+EE +EDLSSIITTLQQEISSP       I  +Q   T  T S SLQDY++ +S+S+S+S
Subjt:  LVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLE--EEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTS

Query:  SSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM
        SSSV  SP     KE  EEE ER++VMRHLLEASDDELGIPN EF V     DGVALCDALWELEDEAANYYTLFQSQLFM
Subjt:  SSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM

XP_022151083.1 histone-lysine N-methyltransferase SETD1A [Momordica charantia]1.2e-4066.48Show/hide
Query:  MAGLVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQE-ISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTS
        MAG+ SKRQR++AQI E+M+EGE+LKRQKSYNQI+SLLEEEEEE +EDLSSIITTLQQE I SP  N   N   + P  TTT         TN  S+S+S
Subjt:  MAGLVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQE-ISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTS

Query:  TSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDE----LGIPNAEFLVAD-GV-ALCDALWELEDEAANYYTLFQSQLFM
        +S SS  SP++    +E+EEER+RVMRHLLEASDDE    LGIPN EFLV + GV +LCDALWELEDEAANYYTLFQSQLFM
Subjt:  TSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDE----LGIPNAEFLVAD-GV-ALCDALWELEDEAANYYTLFQSQLFM

XP_038891727.1 uncharacterized protein LOC120081124 [Benincasa hispida]6.6e-4267.43Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTSSSS
        + SKRQR+E + EE+ +GEELKRQKSYNQILSLLEEEEEE +EDLSSIITTLQQEISSP      ++           DS SLQDY    S+S+S+SSS 
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTSSSS

Query:  VTSPN-STKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM
          SP    KE  EEE ER++VMRHLLEASDDELGIPN EF V    DGVAL DALWELEDEAANYYTLFQSQLFM
Subjt:  VTSPN-STKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM

TrEMBL top hitse value%identityAlignment
A0A1S3BFM2 histone-lysine N-methyltransferase SETD1A1.3e-4064.32Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS
        + SKRQR +A+ E++ EGEELKRQKSYNQILSLL   EE+EEE ++DLSSIITTLQQEISSP   Q       ++HS         S SLQD ++ +S+S
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS

Query:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM
        +S+SSSSV  SP     KE  EE+ ER++VMRHLLEASDDELGIPN EF V     DGVALCDALWELEDEAANYYTLFQSQLFM
Subjt:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM

A0A5D3D3N5 Histone-lysine N-methyltransferase SETD1A1.3e-4064.32Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS
        + SKRQR +A+ E++ EGEELKRQKSYNQILSLL   EE+EEE ++DLSSIITTLQQEISSP   Q       ++HS         S SLQD ++ +S+S
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLL---EEEEEEPVEDLSSIITTLQQEISSPSSNQ----NNNISHSQPQPTTTTDSASLQDYTNFASTS

Query:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM
        +S+SSSSV  SP     KE  EE+ ER++VMRHLLEASDDELGIPN EF V     DGVALCDALWELEDEAANYYTLFQSQLFM
Subjt:  TSTSSSSV-TSPNS--TKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV----ADGVALCDALWELEDEAANYYTLFQSQLFM

A0A6J1DB83 histone-lysine N-methyltransferase SETD1A6.0e-4166.48Show/hide
Query:  MAGLVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQE-ISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTS
        MAG+ SKRQR++AQI E+M+EGE+LKRQKSYNQI+SLLEEEEEE +EDLSSIITTLQQE I SP  N   N   + P  TTT         TN  S+S+S
Subjt:  MAGLVSKRQRDEAQI-EEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQE-ISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTS

Query:  TSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDE----LGIPNAEFLVAD-GV-ALCDALWELEDEAANYYTLFQSQLFM
        +S SS  SP++    +E+EEER+RVMRHLLEASDDE    LGIPN EFLV + GV +LCDALWELEDEAANYYTLFQSQLFM
Subjt:  TSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDE----LGIPNAEFLVAD-GV-ALCDALWELEDEAANYYTLFQSQLFM

A0A6J1H1H5 uncharacterized protein LOC1114587173.0e-4061.05Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISS----------------PSSNQNNNISHSQPQPTTTTDSASLQ
        + SKRQRDEAQ+EEM+EGEELKRQKSY+QILSLLEEEEEE VEDLSSII++LQQEISS                P S QNN       +   + +  S++
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISS----------------PSSNQNNNISHSQPQPTTTTDSASLQ

Query:  DYTNFASTSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLVA---DGVALCDALWELEDEAANYYTLFQSQLFM
        DY    ++   +SSSSV         EEE  ER+RVMRHLLEASDDELGIPN+EF+V    DGVALCDALWELEDEAANYYTLF SQLF+
Subjt:  DYTNFASTSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLVA---DGVALCDALWELEDEAANYYTLFQSQLFM

A0A6J1JJY7 uncharacterized protein LOC1114851171.3e-4062.3Show/hide
Query:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTT---------TTDSASLQDYTNFAS
        + SKRQRDEAQ+EEM+EGEE+KRQKSYNQILSLLEEEEEE VEDLSSII++LQQEISS SS+ ++  S S    +T          T  A +++    A 
Subjt:  LVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTT---------TTDSASLQDYTNFAS

Query:  TSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM
          +    +S    +S+   EEE  ER+RVMRHLLEASDDELGIPN+EF+V    DGVALCDALWELEDEAANYYTLF SQLFM
Subjt:  TSTSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEFLV---ADGVALCDALWELEDEAANYYTLFQSQLFM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26920.1 unknown protein3.5e-1740.21Show/hide
Query:  KRQRDEAQIEEMNEGEELKRQK--------SYNQILSLLEEEEEEPV--EDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTS
        KR R++   E + E E  KRQK        SYNQIL LL + +E+     DL+S I  LQQEIS            S  Q    +++++++D        
Subjt:  KRQRDEAQIEEMNEGEELKRQK--------SYNQILSLLEEEEEEPV--EDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTS

Query:  TSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEF----------------LVADGVALCDALWELEDEAANYYTLFQSQLFM
              S +S  S+KE E E++ +++VM+HLLEASDDELGIPN +F                 + DG    DA WELEDEAANYYTL QS+LFM
Subjt:  TSTSSSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEF----------------LVADGVALCDALWELEDEAANYYTLFQSQLFM

AT1G69760.1 unknown protein8.1e-2245.31Show/hide
Query:  KRQRDEAQIEEMNEGEELKRQK------SYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTS
        KRQRDE + E +   EE KRQK      SYNQ+L+LL++E E    D++S+ITTLQQEI    SN+  N +  +  P+               S+S S+S
Subjt:  KRQRDEAQIEEMNEGEELKRQK------SYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTS

Query:  SSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEF--------------LVADGVALCD----ALWELEDEAANYYTLFQSQLFM
        SSS TS    KE +E + ++++VM+HLLEA+DDELGIPN EF                 +G +L D     LWELEDEAANYYTL QS+LFM
Subjt:  SSSVTSPNSTKELEEEEEERDRVMRHLLEASDDELGIPNAEF--------------LVADGVALCD----ALWELEDEAANYYTLFQSQLFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGATTGGTGTCAAAACGTCAAAGGGATGAAGCCCAAATAGAAGAAATGAATGAAGGGGAAGAGCTAAAACGGCAAAAATCATACAACCAAATACTGTCTCTTTT
AGAGGAAGAGGAAGAGGAACCCGTTGAGGATTTGTCCTCAATCATCACCACTCTCCAACAAGAAATCTCCTCTCCTTCTTCAAATCAAAACAACAACATCAGCCATAGCC
AGCCCCAGCCCACTACCACCACAGACTCTGCTTCTCTACAAGATTATACCAATTTTGCTTCTACTTCTACTTCTACTTCTTCCTCCTCTGTAACTTCTCCTAATTCTACC
AAGGAATTAGAGGAAGAAGAAGAAGAAAGAGACAGAGTTATGAGACACCTTCTCGAAGCTTCCGATGACGAGCTGGGGATTCCCAACGCTGAGTTTTTGGTGGCCGATGG
CGTCGCCTTGTGTGATGCCTTGTGGGAGTTGGAAGATGAGGCTGCTAATTACTACACCTTGTTTCAGTCCCAGCTTTTTATGTAG
mRNA sequenceShow/hide mRNA sequence
CTCCACTCTCTCTCCTCATCTTTTCTCTTGCCTTAGCCTTAGCCTTATTTCTGTCTCTGAAAATTTGTGCAAGAAAATGGCTGGATTGGTGTCAAAACGTCAAAGGGATG
AAGCCCAAATAGAAGAAATGAATGAAGGGGAAGAGCTAAAACGGCAAAAATCATACAACCAAATACTGTCTCTTTTAGAGGAAGAGGAAGAGGAACCCGTTGAGGATTTG
TCCTCAATCATCACCACTCTCCAACAAGAAATCTCCTCTCCTTCTTCAAATCAAAACAACAACATCAGCCATAGCCAGCCCCAGCCCACTACCACCACAGACTCTGCTTC
TCTACAAGATTATACCAATTTTGCTTCTACTTCTACTTCTACTTCTTCCTCCTCTGTAACTTCTCCTAATTCTACCAAGGAATTAGAGGAAGAAGAAGAAGAAAGAGACA
GAGTTATGAGACACCTTCTCGAAGCTTCCGATGACGAGCTGGGGATTCCCAACGCTGAGTTTTTGGTGGCCGATGGCGTCGCCTTGTGTGATGCCTTGTGGGAGTTGGAA
GATGAGGCTGCTAATTACTACACCTTGTTTCAGTCCCAGCTTTTTATGTAGCCTTTTTCCTTTCATGGATTTTAATTATAGGAGTAAATTTAATGTTAAAACAAAACAGA
GGAATATTATATTGATTGAAAAACAGGGTGTTCTCTGATCTCTGTCTCTAATGGAAGTAAATGGGAAGATAAGTGTAGGATGTCTCCGATCTTAATCATTTTACAATGAA
ATGAACAGTCGAAGACTAGTTACAACAATCCCATCCATTCTTTCTCTTCATTTTCCTTCTACATCTCTCTTTCTTTTAATTACTCTGTTTTTTTTTAACAATCCGG
Protein sequenceShow/hide protein sequence
MAGLVSKRQRDEAQIEEMNEGEELKRQKSYNQILSLLEEEEEEPVEDLSSIITTLQQEISSPSSNQNNNISHSQPQPTTTTDSASLQDYTNFASTSTSTSSSSVTSPNST
KELEEEEEERDRVMRHLLEASDDELGIPNAEFLVADGVALCDALWELEDEAANYYTLFQSQLFM