; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016615 (gene) of Snake gourd v1 genome

Gene IDTan0016615
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAT-hook motif nuclear-localized protein 20-like
Genome locationLG10:5885515..5886076
RNA-Seq ExpressionTan0016615
SyntenyTan0016615
Gene Ontology termsGO:0005622 - intracellular (cellular component)
InterPro domainsIPR040381 - Uncharacterized protein At4g14450-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049279.1 hypothetical protein E6C27_scaffold171G005190 [Cucumis melo var. makuwa]4.3e-2871.7Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-
        M+DAER+ A          APTRLQSQAPASIEIKR  +WNV IPLLSPLVSPSSCGNSG E+ L MA+N A REE KG TFTKWQHPAAPFYY PVPR 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-

Query:  TPFVPV
          FVPV
Subjt:  TPFVPV

KAE8650334.1 hypothetical protein Csa_009641 [Cucumis sativus]3.5e-3075.47Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-
        M+DAER+ A          APTRLQSQAPASIEIKRA NWNVAIPLLSPLVSPSSCGNS  E+ L MAEN A REE KG TFTKWQHPAAPFYY PVPR 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-

Query:  TPFVPV
         PFVPV
Subjt:  TPFVPV

KAG6582368.1 AT-hook motif nuclear-localized protein 20, partial [Cucurbita argyrosperma subsp. sororia]1.1e-2065.09Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-
        MA +ERRN            PTRLQSQAPASI I RASNWNVAIPLL+PLVS S CGNS Q + LLM ENKA REE      TKWQHPA P Y GP+P  
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-

Query:  TPFVPV
        TPFVPV
Subjt:  TPFVPV

XP_022924297.1 AT-hook motif nuclear-localized protein 20-like [Cucurbita moschata]9.6e-2065.62Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP
        MA +ERRN            PTRLQSQAPASI I RASNWNVAIPLL+PLVS S CGNS Q + LLM ENKA REE KG   TKWQHPA PF   P
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP

XP_023526926.1 AT-hook motif nuclear-localized protein 20-like [Cucurbita pepo subsp. pepo]4.3e-2066.67Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP
        MA +ERRN            PTRLQSQAPASI I RASNWNVAIPLL+PLVS S CGNS Q + LL AENKA REE KG T TKWQHPA PF   P
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP

TrEMBL top hitse value%identityAlignment
A0A0A0L8D3 Uncharacterized protein1.7e-3075.47Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-
        M+DAER+ A          APTRLQSQAPASIEIKRA NWNVAIPLLSPLVSPSSCGNS  E+ L MAEN A REE KG TFTKWQHPAAPFYY PVPR 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-

Query:  TPFVPV
         PFVPV
Subjt:  TPFVPV

A0A1S4D3W7 uncharacterized protein At4g14450, chloroplastic-like1.1e-1347.75Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLV-SPSSCGNSGQEEALLMAENKAIREEAKGP-------TFTKWQHPAAPF
        MAD ++R++++   G +RQ P+RLQ +APASI++ RA++WNVAIPLLSPL+ SP+S      + A+    + A REE +          F KWQHPAAPF
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLV-SPSSCGNSGQEEALLMAENKAIREEAKGP-------TFTKWQHPAAPF

Query:  YYGPVPRTPFV
         Y P P  PFV
Subjt:  YYGPVPRTPFV

A0A5D3D1A6 Uncharacterized protein2.1e-2871.7Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-
        M+DAER+ A          APTRLQSQAPASIEIKR  +WNV IPLLSPLVSPSSCGNSG E+ L MA+N A REE KG TFTKWQHPAAPFYY PVPR 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPR-

Query:  TPFVPV
          FVPV
Subjt:  TPFVPV

A0A6J1CRZ0 uncharacterized protein At4g14450, chloroplastic-like6.7e-1960Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEA----KGPTFTKWQHPAAPFYYGP
        M DAERR A      S  Q  TRLQ +AP+SI+I R ++WNVAIPLLSPLVSP       ++  +LM ENKA REEA    K  TFT+W+HPAAPFYYGP
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEA----KGPTFTKWQHPAAPFYYGP

Query:  VPR-TPFVPV
        V R TPFVPV
Subjt:  VPR-TPFVPV

A0A6J1E8H9 AT-hook motif nuclear-localized protein 20-like4.6e-2065.62Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP
        MA +ERRN            PTRLQSQAPASI I RASNWNVAIPLL+PLVS S CGNS Q + LLM ENKA REE KG   TKWQHPA PF   P
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP

SwissProt top hitse value%identityAlignment
Q6NN02 Uncharacterized protein At4g14450, chloroplastic1.5e-0743.12Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPA-SIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKA---IREEA-KGPTFTKWQHPAAPFYYG
        MA   +R +++ A  ++RQ  ++LQ +AP+  I+    SNWNVAIPLLSPL    S  +S  +  +   +NK    + EE  K P F KWQHPA+PF Y 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPA-SIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKA---IREEA-KGPTFTKWQHPAAPFYYG

Query:  PVPRTPFVP
        P   T FVP
Subjt:  PVPRTPFVP

Arabidopsis top hitse value%identityAlignment
AT1G04330.1 unknown protein3.0e-1144.09Show/hide
Query:  NAAAEAVGSQRQAPTRLQSQAPASIEIKRA-SNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEA-KGPTFTKWQHPAAPFYYGPVP
        + A    GS  +  +RLQ +AP  ++I    +NW VAIPLLSP  SP       +  A++  E +   +EA K P F KWQHPAAPFYY P P
Subjt:  NAAAEAVGSQRQAPTRLQSQAPASIEIKRA-SNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEA-KGPTFTKWQHPAAPFYYGPVP

AT3G23170.1 unknown protein1.3e-0941.96Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEI---KRASNWNVAIPLLSPL-VSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP
        M+   +R   +   G  R+ P+RL  + PA   +     A+NWN AIPLLSPL +SP S  +   +  +   ++ A+  E K P F KWQHPAAPFYY  
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEI---KRASNWNVAIPLLSPL-VSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGP

Query:  ---VPRTPFVPV
           VP  PFVPV
Subjt:  ---VPRTPFVPV

AT4G14450.1 unknown protein1.1e-0843.12Show/hide
Query:  MADAERRNAAAEAVGSQRQAPTRLQSQAPA-SIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKA---IREEA-KGPTFTKWQHPAAPFYYG
        MA   +R +++ A  ++RQ  ++LQ +AP+  I+    SNWNVAIPLLSPL    S  +S  +  +   +NK    + EE  K P F KWQHPA+PF Y 
Subjt:  MADAERRNAAAEAVGSQRQAPTRLQSQAPA-SIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKA---IREEA-KGPTFTKWQHPAAPFYYG

Query:  PVPRTPFVP
        P   T FVP
Subjt:  PVPRTPFVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATGCAGAGAGAAGAAACGCCGCCGCCGAGGCCGTGGGTTCTCAGAGGCAAGCGCCGACGCGGTTGCAGAGCCAGGCACCGGCGTCGATAGAGATAAAACGGGC
GTCGAATTGGAACGTGGCTATACCGTTGTTGTCGCCGCTTGTATCGCCTTCTTCTTGTGGGAATTCAGGGCAAGAGGAGGCGTTGTTGATGGCTGAGAATAAGGCGATCA
GAGAGGAAGCCAAAGGGCCGACCTTTACGAAGTGGCAACATCCGGCGGCTCCATTCTATTATGGGCCGGTGCCAAGGACTCCCTTTGTGCCTGTGTGA
mRNA sequenceShow/hide mRNA sequence
AAGCAGTCAAGGATATTCCCAACTCAACATTTGACCACTGACCCTTAAATCTCCTCCAAACTCTTGTGTTTCTTTGGCAATTTGTGTGTTTGAAATGGCCGATGCAGAGA
GAAGAAACGCCGCCGCCGAGGCCGTGGGTTCTCAGAGGCAAGCGCCGACGCGGTTGCAGAGCCAGGCACCGGCGTCGATAGAGATAAAACGGGCGTCGAATTGGAACGTG
GCTATACCGTTGTTGTCGCCGCTTGTATCGCCTTCTTCTTGTGGGAATTCAGGGCAAGAGGAGGCGTTGTTGATGGCTGAGAATAAGGCGATCAGAGAGGAAGCCAAAGG
GCCGACCTTTACGAAGTGGCAACATCCGGCGGCTCCATTCTATTATGGGCCGGTGCCAAGGACTCCCTTTGTGCCTGTGTGAATATATTTATATCTTTTTTTAAAAAATT
TTCTCCTTTGATCTTCTTCTTCTTCTTCTTCTTCTTCTTAATCCATCAAACGATTTGTATGTATAAAATTGCTCTGGATTGACTTTTCCTTTTCATAATTGTATACAAGT
TATTTTCTATTA
Protein sequenceShow/hide protein sequence
MADAERRNAAAEAVGSQRQAPTRLQSQAPASIEIKRASNWNVAIPLLSPLVSPSSCGNSGQEEALLMAENKAIREEAKGPTFTKWQHPAAPFYYGPVPRTPFVPV