; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003897 (gene) of Snake gourd v1 genome

Gene IDTan0003897
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFormin-like protein 4 isoform X2
Genome locationLG01:92175766..92176698
RNA-Seq ExpressionTan0003897
SyntenyTan0003897
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032748.1 hypothetical protein E6C27_scaffold853G00910 [Cucumis melo var. makuwa]2.7e-3047.16Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q  +T  +G+G TR V +E+YV+ +G   I+I+ EDRK +                   
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA
                 VS+EVK EIV+ L NYFI+D+NEP+V+DYL+HEM VL+RDFRCSLH +Y+KY+SP +A+KHR KRVA
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA

KAE8651942.1 hypothetical protein Csa_006405 [Cucumis sativus]2.6e-3368.27Show/hide
Query:  IMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRK
        I ++++P+V+DYLDHEM VL+RDF CSLH +Y+K +SPTEA+KH DKRVA+DSDW RLCDRWE E FKS SEAN+KAR+ LPFTHRGGT+ FLRH++K K
Subjt:  IMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRK

Query:  KRVK
        +  K
Subjt:  KRVK

TYJ98779.1 hypothetical protein E5676_scaffold156G00880 [Cucumis melo var. makuwa]2.7e-3047.16Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q  +T  +G+G TR V +E+YV+ +G   I+I+ EDRK +                   
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA
                 VS+EVK EIV+ L NYFI+D+NEP+V+DYL+HEM VL+RDFRCSLH +Y+KY+SP +A+KHR KRVA
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA

XP_022156301.1 uncharacterized protein LOC111023227 [Momordica charantia]1.1e-2844.44Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSS + GPGNR    SN    E +       Q  +++ ++     T ++GR  T G+ ++++V+ HG I I IE ED K VC +S KL S IG  VR  +
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSD
         L CEHWSDV+Q+ K+ I+ RL N FI+++N+PIV  YL+HEM   H DFR  LH  Y +Y SP EA+ H+ K VAR  D
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSD

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]5.7e-3339.66Show/hide
Query:  KRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLH
        +R RG +R + ++R+V  HGRI I+I+ E  K VC N++K ++ IG + R TIPL+C+ WSDVS+EV+  +V++LL+YF  D+ +  V+ Y+   +    
Subjt:  KRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLH

Query:  RDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRK
        +++R  L+  YR++  P EA+    KR+   +DWN LC+RWET E+K  +E N K+R+ +P+ HR G+  F++ + + K
Subjt:  RDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRK

TrEMBL top hitse value%identityAlignment
A0A0A0LM17 Uncharacterized protein1.0e-4351.47Show/hide
Query:  TEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQEVKSEIVE
        +  SN+++  EQE    +S  Q        +G+    A+E+YV+A+G I IKIEL+DRK +CK+SSK NS IG+                          
Subjt:  TEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQEVKSEIVE

Query:  RLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLR
           NYFI+D+++P+V+DYLDHEM VL+RDF CSLH +Y+K +SPTEA+KH DKRVA+DSDW RLCDRWE E FKS SEAN+KAR+ LPFTHRGGT+ FLR
Subjt:  RLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLR

Query:  HRKK
        H++K
Subjt:  HRKK

A0A5A7SQC8 DUF4218 domain-containing protein1.3e-3047.16Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q  +T  +G+G TR V +E+YV+ +G   I+I+ EDRK +                   
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA
                 VS+EVK EIV+ L NYFI+D+NEP+V+DYL+HEM VL+RDFRCSLH +Y+KY+SP +A+KHR KRVA
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA

A0A5D3BIG6 DUF4218 domain-containing protein1.3e-3047.16Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q  +T  +G+G TR V +E+YV+ +G   I+I+ EDRK +                   
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA
                 VS+EVK EIV+ L NYFI+D+NEP+V+DYL+HEM VL+RDFRCSLH +Y+KY+SP +A+KHR KRVA
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVA

A0A5D3C2Z5 Formin-like protein 4 isoform X29.2e-2959.02Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSSR+FGPGNRSR  SN      SN+++  EQE  S +S  Q  +T  +GRG TRGV +E+YV+A+G I I+I+ EDRK +CK+SSKLNS IG+ VR  +
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERL
        PL+CEHWSDVS+EVK EIV+RL
Subjt:  PLQCEHWSDVSQEVKSEIVERL

A0A6J1DUJ0 uncharacterized protein LOC1110232275.4e-2944.44Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI
        MSS + GPGNR    SN    E +       Q  +++ ++     T ++GR  T G+ ++++V+ HG I I IE ED K VC +S KL S IG  VR  +
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSD
         L CEHWSDV+Q+ K+ I+ RL N FI+++N+PIV  YL+HEM   H DFR  LH  Y +Y SP EA+ H+ K VAR  D
Subjt:  PLQCEHWSDVSQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCGAGAAGTTTCGGACCAGGAAATCGTAGTAGGCCCATATCAAACGAACAACCAACAGAGGAATCTAATGAAGATACTACTAGAGAGCAAGAAGATGATTCTAC
AAATTCAAGTGCTCAGGGGTTCAATACTAACAAAAGAGGTCGAGGAGTAACTAGAGGGGTCGCAATTGAGAGATATGTTCGAGCTCATGGTCGGATTCATATTAAAATTG
AGCTTGAAGATAGAAAACTAGTTTGCAAGAACTCATCCAAATTGAACTCTACCATTGGACAACTTGTACGGGGAACAATACCTCTTCAGTGCGAACATTGGTCAGATGTC
TCTCAAGAGGTTAAAAGCGAGATAGTAGAAAGACTTTTGAATTACTTCATTATGGATATCAATGAACCCATAGTACGAGACTACCTAGATCATGAGATGGGTGTATTACA
CAGGGATTTTCGTTGTTCGTTACATAGTACTTATAGAAAATACAATTCTCCAACTGAAGCACAAAAACATCGAGACAAACGAGTGGCACGAGATTCAGATTGGAATCGCC
TGTGTGATCGGTGGGAAACAGAAGAATTTAAGAGTCTTTCTGAAGCAAACTCTAAGGCTCGAGCCTCTCTTCCATTCACTCATAGAGGTGGTACTATGCCATTTTTACGC
CATAGAAAAAAAAGAAAGAAGAGGGTAAAGAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCGAGAAGTTTCGGACCAGGAAATCGTAGTAGGCCCATATCAAACGAACAACCAACAGAGGAATCTAATGAAGATACTACTAGAGAGCAAGAAGATGATTCTAC
AAATTCAAGTGCTCAGGGGTTCAATACTAACAAAAGAGGTCGAGGAGTAACTAGAGGGGTCGCAATTGAGAGATATGTTCGAGCTCATGGTCGGATTCATATTAAAATTG
AGCTTGAAGATAGAAAACTAGTTTGCAAGAACTCATCCAAATTGAACTCTACCATTGGACAACTTGTACGGGGAACAATACCTCTTCAGTGCGAACATTGGTCAGATGTC
TCTCAAGAGGTTAAAAGCGAGATAGTAGAAAGACTTTTGAATTACTTCATTATGGATATCAATGAACCCATAGTACGAGACTACCTAGATCATGAGATGGGTGTATTACA
CAGGGATTTTCGTTGTTCGTTACATAGTACTTATAGAAAATACAATTCTCCAACTGAAGCACAAAAACATCGAGACAAACGAGTGGCACGAGATTCAGATTGGAATCGCC
TGTGTGATCGGTGGGAAACAGAAGAATTTAAGAGTCTTTCTGAAGCAAACTCTAAGGCTCGAGCCTCTCTTCCATTCACTCATAGAGGTGGTACTATGCCATTTTTACGC
CATAGAAAAAAAAGAAAGAAGAGGGTAAAGAGTTAA
Protein sequenceShow/hide protein sequence
MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQGFNTNKRGRGVTRGVAIERYVRAHGRIHIKIELEDRKLVCKNSSKLNSTIGQLVRGTIPLQCEHWSDV
SQEVKSEIVERLLNYFIMDINEPIVRDYLDHEMGVLHRDFRCSLHSTYRKYNSPTEAQKHRDKRVARDSDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLR
HRKKRKKRVKS