; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001401 (gene) of Snake gourd v1 genome

Gene IDTan0001401
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGPI transamidase component Gpi16 subunit family protein isoform 1
Genome locationLG06:14467637..14470389
RNA-Seq ExpressionTan0001401
SyntenyTan0001401
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593003.1 hypothetical protein SDJN03_12479, partial [Cucurbita argyrosperma subsp. sororia]1.1e-10381.56Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFCPSLSTI PFVAS D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE  NLNG RLK +S+LVKNK+LKYM+ GSK +DSP+ KCSPNDYKR+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I+ TAN  PRT+YSCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

XP_022959564.1 uncharacterized protein LOC111460597 isoform X1 [Cucurbita moschata]3.2e-10381.56Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFCPSLSTI PFVAS D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE  NLNG RLK +S+LVKNK+LK+M+ GSK +DSPV KCSPNDYKR+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I+ TAN  PRT+YSCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

XP_023004810.1 uncharacterized protein LOC111498000 isoform X1 [Cucurbita maxima]4.2e-10381.15Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFC SLSTI PFVAS+D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE GNLNG RLK +S+LVKNK+LK+M+ GSK +DSPVSKCSPNDY+R+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I++TAN  PRT+ SCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

XP_023513937.1 uncharacterized protein LOC111778382 isoform X1 [Cucurbita pepo subsp. pepo]8.5e-10481.97Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFCPSLSTI PFVAS D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE GNLNG RLK +S+LVKNK+LK+M+ GSK +DSPV KCSPNDYKR+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I++TAN  PRT YSCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

XP_038897802.1 uncharacterized protein LOC120085717 isoform X1 [Benincasa hispida]1.8e-9375.1Show/hide
Query:  EIKIDNKKTGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVDGKL
        E++I  +KT QKLRTI LFCPSLSTI PF+ASDDHG+DIGSIA +FGL+PS++KLNGHFLSRGLDLVS VTW SLLSFFS KRLPIG SD+DAL+VDGKL
Subjt:  EIKIDNKKTGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVDGKL

Query:  SKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGR
        SK+GVKRAHG QEIV+ DCC+A EE  N+N  R+K +S+LVKNKK+KYMDLGSKHMDSP SK +PN YKR+Q+MEEV LLKKLKLNETKSGFDELSD   
Subjt:  SKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGR

Query:  GISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        G+SD ANV    +YSCS+NS N+KRMRE+ETLVSA CKR+R
Subjt:  GISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

TrEMBL top hitse value%identityAlignment
A0A6J1CVB3 uncharacterized protein LOC1110145921.0e-9174.37Show/hide
Query:  IDNKKTGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVDGKLSKI
        ++ + T  K R IKL CPSLS I PF+ASD H IDIG+IAT FGL+PSTVKLNGHFLSRG DL+SSVTWKSLLSFFSAKRLP+G SD+D LVVDGKLSKI
Subjt:  IDNKKTGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVDGKLSKI

Query:  GVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGRGIS
        G+KRA G QEIV+  CCEA EE  NLN        +LVKNKKLK+ D GSKH+DS V KCSPN YKR+Q MEEV LLKKLKLNETKSG DELSD  +G+S
Subjt:  GVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGRGIS

Query:  DTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        D ANVVPR  YSCSYNSKN+KRMREDETLVSAFCKRTR
Subjt:  DTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

A0A6J1H6M8 uncharacterized protein LOC111460597 isoform X25.4e-8881.31Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFCPSLSTI PFVAS D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE  NLNG RLK +S+LVKNK+LK+M+ GSK +DSPV KCSPNDYKR+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPR
        AGR I+ TAN  PR
Subjt:  AGRGISDTANVVPR

A0A6J1H8F0 uncharacterized protein LOC111460597 isoform X11.6e-10381.56Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFCPSLSTI PFVAS D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE  NLNG RLK +S+LVKNK+LK+M+ GSK +DSPV KCSPNDYKR+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I+ TAN  PRT+YSCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

A0A6J1KT59 uncharacterized protein LOC111498000 isoform X26.3e-8981.31Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFC SLSTI PFVAS+D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE GNLNG RLK +S+LVKNK+LK+M+ GSK +DSPVSKCSPNDY+R+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPR
        AGR I++TAN  PR
Subjt:  AGRGISDTANVVPR

A0A6J1KVM1 uncharacterized protein LOC111498000 isoform X12.0e-10381.15Show/hide
Query:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD
        MEI ++ KK   GQ+LRTIKLFC SLSTI PFVAS+D  IDIGSIAT+FGLEPSTVKLNGHFLSRGLDLVSSVTW SLLSFFSAKRLP G SDDDALVVD
Subjt:  MEIKIDNKK--TGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVD

Query:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEI N DCCEA EE GNLNG RLK +S+LVKNK+LK+M+ GSK +DSPVSKCSPNDY+R+QHMEEV LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHGPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSD

Query:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR
        AGR I++TAN  PRT+ SCSYNSKN+KRMREDE LV AFCKRT+
Subjt:  AGRGISDTANVVPRTSYSCSYNSKNVKRMREDETLVSAFCKRTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G07150.1 unknown protein1.2e-3444.16Show/hide
Query:  RTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSS-VTWKSLLSFFSAKRLPIGRSDDDALVVDGKLSKIGVKRAHGPQ
        R IKLFCPS+S I  +VA +D  +D  +IA  FGLEPSTVKLNGHF+SRG DLV++ VTW+SLL+FFSA+ L  G+ + DAL+V GKLSK+G KRA    
Subjt:  RTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSS-VTWKSLLSFFSAKRLPIGRSDDDALVVDGKLSKIGVKRAHGPQ

Query:  EIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLK-YMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGRGISDTANVVPR
        + + +  C               +D  L+K KKLK    +G    +S +S C+    KR+   E+   LKKLKLN      D+ S  G G         +
Subjt:  EIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLK-YMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGRGISDTANVVPR

Query:  TSYSCSYNSKN-VKRMREDETLVSAFCKRTR
        T   CS+ S N +KR RED+ + SA CK+ R
Subjt:  TSYSCSYNSKN-VKRMREDETLVSAFCKRTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGATTGACAACAAGAAGACAGGCCAAAAACTCAGAACCATCAAGCTATTTTGCCCTTCACTCTCCACCATTACTCCATTCGTCGCATCCGATGATCATGG
CATCGATATCGGCTCCATAGCCACCGTTTTCGGCCTCGAGCCCTCAACGGTGAAGCTCAATGGCCACTTCCTCAGCCGAGGCCTCGATCTCGTCTCCTCTGTTACTTGGA
AGTCTCTTCTCTCTTTCTTCTCTGCGAAACGGCTGCCTATTGGACGCTCCGATGACGATGCCCTTGTTGTTGATGGAAAGCTCTCTAAAATTGGCGTCAAGAGAGCTCAT
GGCCCTCAGGAAATTGTAAATGAAGATTGTTGCGAGGCTTATGAAGAATATGGTAATCTTAATGGTCGAAGGCTAAAATCAGACAGCAGCCTGGTCAAGAATAAGAAGTT
GAAGTATATGGACTTAGGAAGCAAACATATGGATTCTCCAGTATCCAAATGTAGTCCCAATGATTATAAACGAGAACAACACATGGAAGAAGTCAGCTTACTCAAGAAAT
TGAAGTTAAACGAAACTAAATCAGGTTTTGACGAATTATCCGATGCAGGCAGAGGAATAAGCGACACAGCCAATGTTGTCCCACGTACGTCATATTCGTGTAGCTACAAT
AGTAAGAATGTGAAAAGGATGAGAGAAGATGAGACTCTTGTTTCTGCCTTCTGCAAGAGAACTAGATGA
mRNA sequenceShow/hide mRNA sequence
GACCATTAGGAAGGAAAATCAGCAAACACGGTCTGTCGCCATAGCCATTCCAAAAATGTTCATTCCACCTTCTTCTTCCCACAACAACCAGTTTAATCGAGCTTCCTATT
TCCATTGTAATGGAGATTAAGATTGACAACAAGAAGACAGGCCAAAAACTCAGAACCATCAAGCTATTTTGCCCTTCACTCTCCACCATTACTCCATTCGTCGCATCCGA
TGATCATGGCATCGATATCGGCTCCATAGCCACCGTTTTCGGCCTCGAGCCCTCAACGGTGAAGCTCAATGGCCACTTCCTCAGCCGAGGCCTCGATCTCGTCTCCTCTG
TTACTTGGAAGTCTCTTCTCTCTTTCTTCTCTGCGAAACGGCTGCCTATTGGACGCTCCGATGACGATGCCCTTGTTGTTGATGGAAAGCTCTCTAAAATTGGCGTCAAG
AGAGCTCATGGCCCTCAGGAAATTGTAAATGAAGATTGTTGCGAGGCTTATGAAGAATATGGTAATCTTAATGGTCGAAGGCTAAAATCAGACAGCAGCCTGGTCAAGAA
TAAGAAGTTGAAGTATATGGACTTAGGAAGCAAACATATGGATTCTCCAGTATCCAAATGTAGTCCCAATGATTATAAACGAGAACAACACATGGAAGAAGTCAGCTTAC
TCAAGAAATTGAAGTTAAACGAAACTAAATCAGGTTTTGACGAATTATCCGATGCAGGCAGAGGAATAAGCGACACAGCCAATGTTGTCCCACGTACGTCATATTCGTGT
AGCTACAATAGTAAGAATGTGAAAAGGATGAGAGAAGATGAGACTCTTGTTTCTGCCTTCTGCAAGAGAACTAGATGA
Protein sequenceShow/hide protein sequence
MEIKIDNKKTGQKLRTIKLFCPSLSTITPFVASDDHGIDIGSIATVFGLEPSTVKLNGHFLSRGLDLVSSVTWKSLLSFFSAKRLPIGRSDDDALVVDGKLSKIGVKRAH
GPQEIVNEDCCEAYEEYGNLNGRRLKSDSSLVKNKKLKYMDLGSKHMDSPVSKCSPNDYKREQHMEEVSLLKKLKLNETKSGFDELSDAGRGISDTANVVPRTSYSCSYN
SKNVKRMREDETLVSAFCKRTR