; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014706 (gene) of Snake gourd v1 genome

Gene IDTan0014706
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4228 domain protein
Genome locationLG09:69516147..69516845
RNA-Seq ExpressionTan0014706
SyntenyTan0014706
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023330.1 hypothetical protein SDJN02_14355, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-7588.1Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASC PS+ASNGAAKVL+LDG LQSY KPV+AAELMIEHSGKFLCDS  L VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFGRIFPVLI++L I  SDVNRLKS DGDREN+SSK VQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC

XP_022921761.1 uncharacterized protein LOC111429917 [Cucurbita moschata]5.2e-7689.29Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASCAPS+ASNGAAKVL+LDG LQSY KPV+AAELMIEHSGKFLCDS  L VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEM SLTY ATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFGRIFPVLI++L I  SDVNRLKS DGDREN+SSK VQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC

XP_022965466.1 uncharacterized protein LOC111465363 [Cucurbita maxima]1.7e-7489.41Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSA CAPS+A NGAAKVLTLDGNLQS+TKPVKAAELMIEHSGKFLCDSG LKVGHRI GLL +E LECRRLYFLLPMDLLYSVLTIEEM+SLTYIATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFGRIFPVLISEL IFPSDVN LKS DGD E N+SSK VQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT

XP_022987495.1 uncharacterized protein LOC111485040 [Cucurbita maxima]2.0e-7588.69Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASCAPS+ASNGAAKVL+LDG LQSYTK V+AAELMIEHSGKFLCDS  L VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFGRIFPVLI++L I  SDVNRLKS DGDREN+SSK VQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC

XP_038879989.1 uncharacterized protein LOC120071684 [Benincasa hispida]9.8e-7586.39Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNS SCAPS+ASNGAAKVL+LDG LQS+TKPVKAAELMIEHSGKFLCDS  LK+GHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM+SL++IATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSCT
        ALKHGNSSGFGRIFPVLISEL I P+DV++LK  DGDREN+SSK+V+RLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSCT

TrEMBL top hitse value%identityAlignment
A0A6J1C0W6 uncharacterized protein LOC1110072541.4e-7487.13Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASCAPS+ SNGAAKVL+LDG L+SYTKPVKAAELMIE+SGKFLCDSG LKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM+SLTYIATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDR--ENKSSKSVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFGRIFPVLISEL I PS+VNRLKS   DR  EN + K VQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDR--ENKSSKSVQRLMSKQRSWKPALETIAETSCT

A0A6J1E2A0 uncharacterized protein LOC1114299172.5e-7689.29Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASCAPS+ASNGAAKVL+LDG LQSY KPV+AAELMIEHSGKFLCDS  L VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEM SLTY ATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFGRIFPVLI++L I  SDVNRLKS DGDREN+SSK VQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC

A0A6J1F5K3 uncharacterized protein LOC1114410402.2e-7287.65Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSA CAPS+ASNGAAKVLTLDGNLQS+ KPVKAAELMIEHSGKFLCDSG LKVGHRI GLL +E LE  RLYFLLPMDLLYSVLTIEEM+SLTYIATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT
        A+K GNSSGFGRIFPVLISEL IFPSDVN LKS DGD E N+SSK VQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT

A0A6J1HLR9 uncharacterized protein LOC1114653638.1e-7589.41Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSA CAPS+A NGAAKVLTLDGNLQS+TKPVKAAELMIEHSGKFLCDSG LKVGHRI GLL +E LECRRLYFLLPMDLLYSVLTIEEM+SLTYIATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFGRIFPVLISEL IFPSDVN LKS DGD E N+SSK VQRLMSKQRSWKPALETIAETSCT
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRE-NKSSKSVQRLMSKQRSWKPALETIAETSCT

A0A6J1JH13 uncharacterized protein LOC1114850409.6e-7688.69Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MGNSASCAPS+ASNGAAKVL+LDG LQSYTK V+AAELMIEHSGKFLCDS  L VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC
        ALKHGNSSGFGRIFPVLI++L I  SDVNRLKS DGDREN+SSK VQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein2.0e-2539.2Show/hide
Query:  MGNSASCAPSL----ASNGAAKVLT-LDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMNSL
        MGN++SCAP +    +S+G  K+L    G L+ ++KP+K ++++  HSG F+ DS  L++ HR+  + PDE L  RR LY LLP D+L+SVLT EE++ +
Subjt:  MGNSASCAPSL----ASNGAAKVLT-LDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMNSL

Query:  TYIATKALKHGNSSGFGRIFPVLI----SELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAET
        +  A + L     +   RIFPV I     + R  PS VN  ++ DG    ++        SK  SW+P LETI E+
Subjt:  TYIATKALKHGNSSGFGRIFPVLI----SELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAET

AT3G03280.1 unknown protein6.9e-1033.53Show/hide
Query:  MGNSASCA-PSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLTYI
        MGN  SCA    +S+  AKV+  DG ++    P KAAELM+E    FL D+  +KVG +   L  D+DL+     +Y   PM    S     +M  L Y+
Subjt:  MGNSASCA-PSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLTYI

Query:  ATKALKHGNSSGFGRIFPVL--ISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAE
          K  K   + G  R+ P      ++R+    +N       D E  S+      +S  +S KP LETIAE
Subjt:  ATKALKHGNSSGFGRIFPVL--ISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAE

AT4G37240.1 unknown protein9.0e-1028.1Show/hide
Query:  CAPSLASNGA-AKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATKALKHG
        C+ S ++  A AK++  DG +  +  PVK   +++++   F+C+S  +     +  +  DE+L+  ++YF LP+  L   L  EEM +L   A+ AL  G
Subjt:  CAPSLASNGA-AKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATKALKHG

Query:  NSSGFGR--IFPVLISELRI--------FPSDVNRLKSTDGDRENKSSKSVQR
           G  R  + P++  +LR+          S   R K  +GD     S S +R
Subjt:  NSSGFGR--IFPVLISELRI--------FPSDVNRLKSTDGDRENKSSKSVQR

AT5G17350.1 unknown protein1.2e-0931.69Show/hide
Query:  MGNSASCA---PSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLT
        MGN  S A    S +S+ AAKV+  DG +++   P+KAAELM+E    FL D+  LK+G +   L  D+DL+ +   +Y   PM    S     ++  L 
Subjt:  MGNSASCA---PSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMNSLT

Query:  YIATKALKHGNSSGF-----------GRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETS
          A K  +H   S             GR+ P    +     S  ++L     D E  S+      +S  +S KP LETI E S
Subjt:  YIATKALKHGNSSGF-----------GRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETS

AT5G66580.1 unknown protein2.0e-0933.64Show/hide
Query:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK
        MG  AS   SL S+ +AK++ LDG LQ ++ PVK  +++ ++   F+C+S  +     +  +  +E+L   +LYF+LP+  L   L  EEM +L   A+ 
Subjt:  MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATK

Query:  ALKHGNSSGF
        AL      G+
Subjt:  ALKHGNSSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACTCAGCGTCCTGTGCGCCTTCACTGGCCTCCAATGGCGCCGCGAAGGTCTTAACCTTAGACGGAAATTTGCAGAGCTACACGAAGCCAGTGAAGGCCGCCGA
ACTAATGATCGAGCATTCTGGCAAATTCCTCTGCGATTCTGGCCATCTCAAGGTCGGTCACCGGATTCAAGGTCTTTTGCCGGACGAAGATCTCGAGTGCCGCCGATTGT
ACTTTCTACTTCCGATGGATCTTCTGTACTCTGTGTTGACGATCGAAGAAATGAATTCGCTCACTTACATCGCTACAAAGGCTCTGAAACATGGAAATTCGAGTGGATTT
GGAAGGATCTTCCCTGTTTTAATCAGTGAACTCCGTATTTTTCCGTCGGATGTGAATCGATTGAAATCGACGGACGGCGATCGTGAGAATAAGAGTTCGAAATCGGTGCA
GAGATTGATGTCGAAACAGAGATCGTGGAAGCCGGCGCTTGAAACCATTGCTGAAACTTCGTGCACATAG
mRNA sequenceShow/hide mRNA sequence
CATAAATTCAAGCACTGGCGAGAGAGATTCCATTATCGGCGCAACTCCCACTTTCTCCACTTTCGCCAAACACGCGCTCTGTATGGGGAACTCAGCGTCCTGTGCGCCTT
CACTGGCCTCCAATGGCGCCGCGAAGGTCTTAACCTTAGACGGAAATTTGCAGAGCTACACGAAGCCAGTGAAGGCCGCCGAACTAATGATCGAGCATTCTGGCAAATTC
CTCTGCGATTCTGGCCATCTCAAGGTCGGTCACCGGATTCAAGGTCTTTTGCCGGACGAAGATCTCGAGTGCCGCCGATTGTACTTTCTACTTCCGATGGATCTTCTGTA
CTCTGTGTTGACGATCGAAGAAATGAATTCGCTCACTTACATCGCTACAAAGGCTCTGAAACATGGAAATTCGAGTGGATTTGGAAGGATCTTCCCTGTTTTAATCAGTG
AACTCCGTATTTTTCCGTCGGATGTGAATCGATTGAAATCGACGGACGGCGATCGTGAGAATAAGAGTTCGAAATCGGTGCAGAGATTGATGTCGAAACAGAGATCGTGG
AAGCCGGCGCTTGAAACCATTGCTGAAACTTCGTGCACATAGAAAAAAAAAGACGATCGCAATCGTCGCAAATATGTATGAGATTAATTACATTACGAATAATTGGTAGC
ATTTTTGCTTCGATAATCGTAGATATAATAAACCTTGGA
Protein sequenceShow/hide protein sequence
MGNSASCAPSLASNGAAKVLTLDGNLQSYTKPVKAAELMIEHSGKFLCDSGHLKVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMNSLTYIATKALKHGNSSGF
GRIFPVLISELRIFPSDVNRLKSTDGDRENKSSKSVQRLMSKQRSWKPALETIAETSCT