; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022124 (gene) of Snake gourd v1 genome

Gene IDTan0022124
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAAI domain-containing protein
Genome locationLG06:77714341..77715756
RNA-Seq ExpressionTan0022124
SyntenyTan0022124
Gene Ontology termsNA
InterPro domainsIPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574972.1 hypothetical protein SDJN03_25611, partial [Cucurbita argyrosperma subsp. sororia]1.2e-7474.88Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I V+ SLI +LTASMA+VAMS PTGCTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKLIALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIES--PPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASL
         CPL+   + E NSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + S  PP  PADE PPSPSSATAKG S+ +DCIGLFL+ SL
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIES--PPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASL

Query:  ------PFLIHISSI
              PFL H S I
Subjt:  ------PFLIHISSI

KAG7013541.1 hypothetical protein SDJN02_23707, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-7378.17Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I V+ SLIA+LTASMA+VAMS PTGCTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKLIALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIES--PPSAPADESPPSPSSATAKGLSLTKDCIGLFLS
         CPL+   + E NSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + S  PP  PADE PPSPSSATAKG S+ +DCIGLFL+
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIES--PPSAPADESPPSPSSATAKGLSLTKDCIGLFLS

XP_022959376.1 uncharacterized protein LOC111460365 [Cucurbita moschata]9.4e-7577.56Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I V+ SLIA+LTASMA+VAMS PTGCTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKLIALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPP--SAPADESPPSPSSATAKGLSLTKDCIGLFLSASL
         CPL+   + E NSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + SPP    PADE PPSPSSATAKG S+ +DCIGLFL+ SL
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPP--SAPADESPPSPSSATAKGLSLTKDCIGLFLSASL

Query:  PFLIH
         FL+H
Subjt:  PFLIH

XP_023006001.1 uncharacterized protein LOC111498879 [Cucurbita maxima]2.8e-6671.92Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I VV  LIA+LTASM +VAMS PT CTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKL ALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPF
         CPL+   + EKNSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + SPP + ADE P SPSSATAKG S+     G ++      
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPF

Query:  LIH
        LI+
Subjt:  LIH

XP_038874252.1 non-specific lipid transfer protein GPI-anchored 25 [Benincasa hispida]6.1e-7477.67Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMS-SPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALS
        MAAII +V  L+A LT SMAVVAMS  PTGC TRELLLLSPCLPFISAPPNNLSDTVPS CCEAFSSAY S GGICLCYFLREP+ILGFPLN TKLIALS
Subjt:  MAAIIIVVASLIALLTASMAVVAMS-SPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALS

Query:  SFCPLNGRFNS--EKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSAS
        SFCPLNG      EK+SSLDS+CAAS+TLPPL SS+IP IQEPDSPADEN PAP +G PP A  S PSAPADE PPS SSAT K  SL KDCIGLF S S
Subjt:  SFCPLNGRFNS--EKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSAS

Query:  LPFLIHISSIFRAVF
        L FLIHISSI   +F
Subjt:  LPFLIHISSIFRAVF

TrEMBL top hitse value%identityAlignment
A0A1S3BHM9 uncharacterized protein LOC1034896792.3e-6678.19Show/hide
Query:  MAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCPLNGR--FNSEKNSSL
        MAVVAMS P GCTTRELLLLSPCLPFISAPPNNLSDTVPS CC+AFSSAY + GGICLCYFLREP+ILGFPLN TKLIALSSFCP N       EKNSSL
Subjt:  MAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCPLNGR--FNSEKNSSL

Query:  DSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPFLIHI
        DSVCAASQTLPPLQSS+IP IQEPDSPADENT    +G PP AI S PSAPAD+  P PSSATA+   L K+CIGLF S SL FLIHI
Subjt:  DSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPFLIHI

A0A5D3D2B0 Non-specific lipid transfer protein GPI-anchored 2-like2.3e-6678.19Show/hide
Query:  MAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCPLNGR--FNSEKNSSL
        MAVVAMS P GCTTRELLLLSPCLPFISAPPNNLSDTVPS CC+AFSSAY + GGICLCYFLREP+ILGFPLN TKLIALSSFCP N       EKNSSL
Subjt:  MAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCPLNGR--FNSEKNSSL

Query:  DSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPFLIHI
        DSVCAASQTLPPLQSS+IP IQEPDSPADENT    +G PP AI S PSAPAD+  P PSSATA+   L K+CIGLF S SL FLIHI
Subjt:  DSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPFLIHI

A0A6J1CD91 uncharacterized protein LOC1110096743.0e-6676.24Show/hide
Query:  MAAIIIVV-------ASLIALLTASMAVVAMS--SP-TGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPL
        MAAII VV        SLIA LT SM+VVAMS  SP TGCTTRELLLLSPCLPFISAPPNNLSDTVPS CC+AFSSAY+SAGGICLCYFLREP+ILGFPL
Subjt:  MAAIIIVV-------ASLIALLTASMAVVAMS--SP-TGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPL

Query:  NNTKLIALSSFCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALG-SPPPAIESPPSAPADESPPSPSSATAKGLSLTKDC
        N++KLIALSS CPL+G   SE +SSL+S+CAASQTLPPLQS+KIPGI+EPDSP D++TPAP  G SPPP+    PSAPADE PPSPSSATAK L   KDC
Subjt:  NNTKLIALSSFCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALG-SPPPAIESPPSAPADESPPSPSSATAKGLSLTKDC

Query:  IG
        IG
Subjt:  IG

A0A6J1H4P5 uncharacterized protein LOC1114603654.6e-7577.56Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I V+ SLIA+LTASMA+VAMS PTGCTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKLIALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPP--SAPADESPPSPSSATAKGLSLTKDCIGLFLSASL
         CPL+   + E NSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + SPP    PADE PPSPSSATAKG S+ +DCIGLFL+ SL
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPP--SAPADESPPSPSSATAKGLSLTKDCIGLFLSASL

Query:  PFLIH
         FL+H
Subjt:  PFLIH

A0A6J1KWK3 uncharacterized protein LOC1114988791.3e-6671.92Show/hide
Query:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS
        MA  I VV  LIA+LTASM +VAMS PT CTTRELLLLSPCLPFISAPPNNLSD+VPS CCEAFSSAY S GGICLCYFLR+P+ILGFPLNNTKL ALSS
Subjt:  MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSS

Query:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPF
         CPL+   + EKNSSLDS+C+AS+TLPPLQSSKIPGIQEPDSPA+ENTP PA+ SPP  + SPP + ADE P SPSSATAKG S+     G ++      
Subjt:  FCPLNGRFNSEKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPF

Query:  LIH
        LI+
Subjt:  LIH

SwissProt top hitse value%identityAlignment
F4JIG1 Non-specific lipid transfer protein GPI-anchored 259.0e-2038.59Show/hide
Query:  IIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCP
        I+ +  S  + +TA+    + S P    T EL++ SPCLP++S+PPNN+S+T    CC  F+S+  S+ G CLCY LR+P ILGFPL+ ++LI+LS  C 
Subjt:  IIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCP

Query:  LNGRFNSEKNSSLDSVCAASQT--LPPLQSSK-----IPGIQEPDSP-----ADENTPAPALGSPPPAIESPPSAPADESPPSP
             +     S +S+C+ S++  LPPLQS +     + G     SP     A E +P+  L SP  A  +PP  P    PP P
Subjt:  LNGRFNSEKNSSLDSVCAASQT--LPPLQSSK-----IPGIQEPDSP-----ADENTPAPALGSPPPAIESPPSAPADESPPSP

Arabidopsis top hitse value%identityAlignment
AT4G14805.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein6.4e-2138.59Show/hide
Query:  IIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCP
        I+ +  S  + +TA+    + S P    T EL++ SPCLP++S+PPNN+S+T    CC  F+S+  S+ G CLCY LR+P ILGFPL+ ++LI+LS  C 
Subjt:  IIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCP

Query:  LNGRFNSEKNSSLDSVCAASQT--LPPLQSSK-----IPGIQEPDSP-----ADENTPAPALGSPPPAIESPPSAPADESPPSP
             +     S +S+C+ S++  LPPLQS +     + G     SP     A E +P+  L SP  A  +PP  P    PP P
Subjt:  LNGRFNSEKNSSLDSVCAASQT--LPPLQSSK-----IPGIQEPDSP-----ADENTPAPALGSPPPAIESPPSAPADESPPSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGATAATTATCGTCGTCGCATCGTTAATCGCACTTCTCACGGCTTCGATGGCGGTGGTTGCGATGTCGTCGCCGACGGGATGCACCACCAGAGAGCTGCTCTT
GCTCTCGCCATGTCTGCCTTTCATTTCTGCTCCGCCGAACAATCTTTCGGATACAGTTCCCTCCGGCTGCTGCGAGGCGTTCTCCTCCGCTTACGAGTCCGCCGGCGGCA
TTTGCCTCTGTTACTTTCTTCGAGAGCCTCGGATTTTGGGCTTCCCGTTGAATAATACGAAGCTGATCGCTCTGTCTTCGTTTTGTCCTCTCAATGGCAGATTCAATTCG
GAGAAGAATAGTTCTCTGGACTCGGTCTGTGCTGCTTCACAAACTCTGCCTCCCCTTCAAAGCTCAAAGATTCCAGGAATTCAAGAACCTGATAGTCCTGCTGATGAGAA
TACCCCAGCTCCTGCGCTAGGCTCACCACCACCTGCAATTGAATCACCACCATCAGCACCTGCAGATGAATCGCCGCCGTCTCCGTCATCTGCAACAGCTAAAGGTTTGT
CATTGACAAAAGATTGTATTGGTTTGTTCTTGTCAGCTTCACTCCCCTTCCTCATCCACATTTCGTCCATTTTTAGAGCAGTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGATAATTATCGTCGTCGCATCGTTAATCGCACTTCTCACGGCTTCGATGGCGGTGGTTGCGATGTCGTCGCCGACGGGATGCACCACCAGAGAGCTGCTCTT
GCTCTCGCCATGTCTGCCTTTCATTTCTGCTCCGCCGAACAATCTTTCGGATACAGTTCCCTCCGGCTGCTGCGAGGCGTTCTCCTCCGCTTACGAGTCCGCCGGCGGCA
TTTGCCTCTGTTACTTTCTTCGAGAGCCTCGGATTTTGGGCTTCCCGTTGAATAATACGAAGCTGATCGCTCTGTCTTCGTTTTGTCCTCTCAATGGCAGATTCAATTCG
GAGAAGAATAGTTCTCTGGACTCGGTCTGTGCTGCTTCACAAACTCTGCCTCCCCTTCAAAGCTCAAAGATTCCAGGAATTCAAGAACCTGATAGTCCTGCTGATGAGAA
TACCCCAGCTCCTGCGCTAGGCTCACCACCACCTGCAATTGAATCACCACCATCAGCACCTGCAGATGAATCGCCGCCGTCTCCGTCATCTGCAACAGCTAAAGGTTTGT
CATTGACAAAAGATTGTATTGGTTTGTTCTTGTCAGCTTCACTCCCCTTCCTCATCCACATTTCGTCCATTTTTAGAGCAGTTTTTTAA
Protein sequenceShow/hide protein sequence
MAAIIIVVASLIALLTASMAVVAMSSPTGCTTRELLLLSPCLPFISAPPNNLSDTVPSGCCEAFSSAYESAGGICLCYFLREPRILGFPLNNTKLIALSSFCPLNGRFNS
EKNSSLDSVCAASQTLPPLQSSKIPGIQEPDSPADENTPAPALGSPPPAIESPPSAPADESPPSPSSATAKGLSLTKDCIGLFLSASLPFLIHISSIFRAVF