; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021894 (gene) of Snake gourd v1 genome

Gene IDTan0021894
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMyb transcription factor
Genome locationLG01:2215537..2216374
RNA-Seq ExpressionTan0021894
SyntenyTan0021894
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0034470 - ncRNA processing (biological process)
GO:0016020 - membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439216.1 PREDICTED: transcription factor LAF1-like isoform X1 [Cucumis melo]7.7e-6766.18Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQLW
        ATCD+LW
Subjt:  ATCDQLW

XP_008439218.1 PREDICTED: transcription factor LAF1-like isoform X2 [Cucumis melo]1.6e-6768.16Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGFATCDQL
                      +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGFATCD+L
Subjt:  -------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGFATCDQL

Query:  W
        W
Subjt:  W

XP_008439219.1 PREDICTED: transcription factor LAF1-like isoform X3 [Cucumis melo]7.7e-6766.18Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQLW
        ATCD+LW
Subjt:  ATCDQLW

XP_008439220.1 PREDICTED: transcription factor LAF1-like isoform X4 [Cucumis melo]1.1e-6566.02Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQL
        ATCD+L
Subjt:  ATCDQL

XP_011652772.1 LOW QUALITY PROTEIN: transcription factor LAF1 [Cucumis sativus]1.6e-6767.77Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGL SPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  ------------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEG--RTSREGYSFEMFNWDLDFEAQ
                                S VFS +DPPLEEMGSS +E+  SQP+VLFAEWLSV D NGG++MEGSFD EG  RTSREGY FEM NWDLDFE  
Subjt:  ------------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEG--RTSREGYSFEMFNWDLDFEAQ

Query:  FSDGFATCDQL
         SDGFATCDQL
Subjt:  FSDGFATCDQL

TrEMBL top hitse value%identityAlignment
A0A0A0L6I1 Uncharacterized protein2.2e-5966.67Show/hide
Query:  EKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-----------------------------------
        EKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN                                   
Subjt:  EKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-----------------------------------

Query:  --SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEG--RTSREGYSFEMFNWDLDFEAQFSDGFATCDQL
          S VFS +DPPLEEMGSS +E+  SQP+VLFAEWLSV D NGG++MEGSFD EG  RTSREGY FEM NWDLDFE   SDGFATCDQL
Subjt:  --SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEG--RTSREGYSFEMFNWDLDFEAQFSDGFATCDQL

A0A1S3AXV7 transcription factor LAF1-like isoform X27.5e-6868.16Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGFATCDQL
                      +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGFATCD+L
Subjt:  -------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGFATCDQL

Query:  W
        W
Subjt:  W

A0A1S3AY85 transcription factor LAF1-like isoform X45.4e-6666.02Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQL
        ATCD+L
Subjt:  ATCDQL

A0A1S3AYA7 transcription factor LAF1-like isoform X33.7e-6766.18Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQLW
        ATCD+LW
Subjt:  ATCDQLW

A0A1S3AYW8 transcription factor LAF1-like isoform X13.7e-6766.18Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------
        MG  E+EKEK KKKGLWSPEEDEKLRSFILKN H CWTSVPIKAGLLR+SKSCRLRW NYLRPGLKRGMFSQQE+EKILTLHRLLGN             
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN-------------

Query:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF
                            +V S +DPPLE MGSS  ++  ++P+VLFAEWLSV D N G++MEGSFD EGR  TSREGY FEM NWDLDFE   SDGF
Subjt:  -------------------SRVFSKQDPPLEEMGSSEDEQRRSQPKVLFAEWLSVIDNNGGNAMEGSFDDEGR--TSREGYSFEMFNWDLDFEAQFSDGF

Query:  ATCDQLW
        ATCD+LW
Subjt:  ATCDQLW

SwissProt top hitse value%identityAlignment
P20027 Myb-related protein Hv334.2e-2365.82Show/hide
Query:  KPK-KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        +PK +KGLWSPEEDEKL + I+++   CW+SVP  A L R  KSCRLRWINYLRP LKRG FSQQEE+ I+ LH++LGN
Subjt:  KPK-KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

Q8LPH6 Transcription factor MYB864.2e-2360.92Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        MG      ++  +KGLWSPEEDEKL ++I ++ H CW+SVP  AGL R  KSCRLRWINYLRP LKRG FSQ EE  I+ LH  LGN
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

Q9C6U1 Transcription factor MYB831.2e-2264.56Show/hide
Query:  KPK-KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        KPK +KGLWSP+EDEKL  ++L N   CW+ +   AGLLR  KSCRLRWINYLRP LKRG FS QEE+ I  LH +LGN
Subjt:  KPK-KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

Q9LTV4 Transcription factor MYB101.2e-2265.33Show/hide
Query:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        K+G WS EE E+LRSFILKN H  W S+P  AGL+R  KSCRLRWINYLRPGLKRG F+++EE+ I+ LH+  GN
Subjt:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

Q9M0K4 Transcription factor LAF13.3e-2877.33Show/hide
Query:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        +KGLWSPEEDEKLRSFIL   H+CWT+VPIKAGL R+ KSCRLRWINYLRPGLKR M S +EEE ILT H  LGN
Subjt:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

Arabidopsis top hitse value%identityAlignment
AT1G57560.1 myb domain protein 507.8e-2566.67Show/hide
Query:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        +KGLWSPEEDEKL ++I K+ H CW+SVP  AGL R  KSCRLRWINYLRP LKRG FS +E+  I+ LH +LGN
Subjt:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

AT3G48920.1 myb domain protein 456.4e-2758.82Show/hide
Query:  SEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        + E + ++ ++KGLWSPEEDEKLRS +LK  H CW+++P++AGL R+ KSCRLRW+NYLRPGLK+ +F++QEE  +L+LH +LGN
Subjt:  SEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

AT4G01680.1 myb domain protein 551.0e-2462.07Show/hide
Query:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        MG      ++  +KGLWSPEEDEKL  +I K  H CW+SVP +AGL R  KSCRLRWINYLRP LKRG FSQ EE  I+ LH +LGN
Subjt:  MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

AT4G25560.1 myb domain protein 182.3e-2977.33Show/hide
Query:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        +KGLWSPEEDEKLRSFIL   H+CWT+VPIKAGL R+ KSCRLRWINYLRPGLKR M S +EEE ILT H  LGN
Subjt:  KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN

AT5G52260.1 myb domain protein 193.6e-3074.39Show/hide
Query:  EKPK---KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN
        E+PK   +KGLWSPEED+KL+SFIL   HACWT+VPI AGL R+ KSCRLRWINYLRPGLKRG FS++EEE ILTLH  LGN
Subjt:  EKPK---KKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCTGAAGAATCAGAGAAAGAAAAGCCAAAGAAGAAAGGGTTATGGTCACCTGAGGAAGATGAAAAGTTGAGAAGCTTCATCCTCAAGAATGCCCATGCCTGCTG
GACCTCTGTCCCCATTAAAGCAGGGCTGTTGAGAAGCTCAAAGAGTTGCAGATTGAGATGGATCAATTACTTGAGACCTGGATTGAAAAGAGGAATGTTTAGCCAACAAG
AAGAAGAGAAAATCTTGACCCTTCATCGCTTGTTAGGCAATAGTCGAGTTTTTTCGAAGCAAGATCCGCCATTGGAGGAGATGGGAAGTTCAGAAGATGAGCAGAGGAGA
AGCCAGCCCAAGGTTTTGTTTGCAGAATGGCTTTCTGTGATTGATAATAATGGTGGGAACGCCATGGAAGGAAGCTTTGATGATGAAGGAAGAACAAGCAGAGAGGGCTA
TAGTTTTGAGATGTTCAATTGGGACCTTGATTTTGAAGCACAGTTTTCTGATGGCTTTGCAACTTGTGATCAATTATGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCTGAAGAATCAGAGAAAGAAAAGCCAAAGAAGAAAGGGTTATGGTCACCTGAGGAAGATGAAAAGTTGAGAAGCTTCATCCTCAAGAATGCCCATGCCTGCTG
GACCTCTGTCCCCATTAAAGCAGGGCTGTTGAGAAGCTCAAAGAGTTGCAGATTGAGATGGATCAATTACTTGAGACCTGGATTGAAAAGAGGAATGTTTAGCCAACAAG
AAGAAGAGAAAATCTTGACCCTTCATCGCTTGTTAGGCAATAGTCGAGTTTTTTCGAAGCAAGATCCGCCATTGGAGGAGATGGGAAGTTCAGAAGATGAGCAGAGGAGA
AGCCAGCCCAAGGTTTTGTTTGCAGAATGGCTTTCTGTGATTGATAATAATGGTGGGAACGCCATGGAAGGAAGCTTTGATGATGAAGGAAGAACAAGCAGAGAGGGCTA
TAGTTTTGAGATGTTCAATTGGGACCTTGATTTTGAAGCACAGTTTTCTGATGGCTTTGCAACTTGTGATCAATTATGGTAA
Protein sequenceShow/hide protein sequence
MGSEESEKEKPKKKGLWSPEEDEKLRSFILKNAHACWTSVPIKAGLLRSSKSCRLRWINYLRPGLKRGMFSQQEEEKILTLHRLLGNSRVFSKQDPPLEEMGSSEDEQRR
SQPKVLFAEWLSVIDNNGGNAMEGSFDDEGRTSREGYSFEMFNWDLDFEAQFSDGFATCDQLW