; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019929 (gene) of Snake gourd v1 genome

Gene IDTan0019929
OrganismTrichosanthes anguina (Snake gourd v1)
Description101 kDa malaria antigen-like
Genome locationLG05:77868825..77870720
RNA-Seq ExpressionTan0019929
SyntenyTan0019929
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583890.1 hypothetical protein SDJN03_19822, partial [Cucurbita argyrosperma subsp. sororia]1.9e-3553Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK  +  LHL+AE AL F+AFLLS+FIY S FNLSLS   L   T FWFC+SNTLIFIIAA   AFSPP              +A  IP    P++N 
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPNSGKPTEH-EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEFI
                NSG PTE+ +++IP  TE+ I+   + +KSY R+KSEK  R   KE KI M+RSKT  R++A++T+ EKEE KKE AEM++EELN+RVEEFI
Subjt:  KIVVIEPPNSGKPTEH-EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEFI

Query:  ERFNRQIRLQEIGHGDE
        ERFNRQ++LQ IG  +E
Subjt:  ERFNRQIRLQEIGHGDE

KAG7019508.1 hypothetical protein SDJN02_18469, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-3553Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK  +  LHL+AE AL F+AFLLS+FIY S FNLSLS   L   T FWFC+SNTLIFIIAA   AFSPP              +A  IP    P++N 
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPNSGKPTEH-EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEFI
                NSG PTE+ +++IP  TE+ I+   + +KSY R+KSEK  R   KE KI M+RSKT  R++A++T+ EKEE KKE AEM++EELN+RVEEFI
Subjt:  KIVVIEPPNSGKPTEH-EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEFI

Query:  ERFNRQIRLQEIGHGDE
        ERFNRQ+RLQ IG  ++
Subjt:  ERFNRQIRLQEIGHGDE

XP_022927241.1 uncharacterized protein LOC111434146 [Cucurbita moschata]7.4e-3552.07Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK  + +LHL+AE AL F AFLLS+FIY S FNLSLS   L   T FWFC+SNTLIFIIAA   AFS PP           N AA+ +     P +N 
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEF
                NSG PTE+  +++IP  TE+ I+   + +KSY R+KSEK  R   KE KI M+RSKT  R++A+ T+ EKEE K E AEM++EELN++VEEF
Subjt:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEF

Query:  IERFNRQIRLQEIGHGD
        IERFNRQ+RLQ IG  +
Subjt:  IERFNRQIRLQEIGHGD

XP_023520396.1 uncharacterized protein LOC111783708 isoform X2 [Cucurbita pepo subsp. pepo]1.5e-3552.53Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK  + +LHL+AE AL F AFLLS+FIY S FNLSLS   L   T FWFC+SNTLIFIIAA   AFSPPP           N AA  IP    P++N 
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKKEFAEMTDEELNRRVEEFIE
                NSG PTE+  +++IP  TE+ I+   +S+KSY  +KSEK  R   KE K+ M+RSKT  R+E     ++E K E AEM++EELN+RVEEFIE
Subjt:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKKEFAEMTDEELNRRVEEFIE

Query:  RFNRQIRLQEIGHGDEE
        RFNRQ+RLQ IG  +EE
Subjt:  RFNRQIRLQEIGHGDEE

XP_038895571.1 uncharacterized protein LOC120083777 [Benincasa hispida]5.5e-3856.25Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        MG+K  K +LHLFA+ AL    FLLS FIY S+      + +L N T FWF ++NTLIFIIAAD GAFSPP              A   +P    P+D  
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVI-EPPNSGKPTEHEEE----IPHTTEVFIHP--IPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKK-EFAEMTDEELNR
        KIVV+ EPPNS   T++EEE    I  TTE    P    +S+KSY+R+KSEK +KR+ EK  KITMKRSKT IRH+ + TK+KEE+K EFAEMT+EELNR
Subjt:  KIVVI-EPPNSGKPTEHEEE----IPHTTEVFIHP--IPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKK-EFAEMTDEELNR

Query:  RVEEFIERFNRQIRLQEIG-HGDE
        RVEEFIERFNRQIRLQEI  HG+E
Subjt:  RVEEFIERFNRQIRLQEIG-HGDE

TrEMBL top hitse value%identityAlignment
A0A0A0LUF0 Uncharacterized protein6.7e-3450.68Show/hide
Query:  KNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLK--IVV
        K +L+LFA+GAL  I+F    FIY S+      + DL N T FWF +SNTLIF+IA D GAFS P            +   A  PN  +P+ N    IVV
Subjt:  KNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLK--IVV

Query:  IEPPNSGKPTEHEEE---IPHTTEV-----FIHPIPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKK-EFAEMTDEELNRRVE
         + PNS  P ++EEE   IP TTE+     F +PI    K Y+R+KSEK +KRM EK KK+ M+RSKT I+   + TKEKEE+  EF +MTDEELNRRVE
Subjt:  IEPPNSGKPTEHEEE---IPHTTEV-----FIHPIPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKK-EFAEMTDEELNRRVE

Query:  EFIERFNRQIRLQEIGHGDEE
        EFIERFNRQIRLQE+   + E
Subjt:  EFIERFNRQIRLQEIGHGDEE

A0A1S3B9V3 101 kDa malaria antigen-like6.7e-3452.23Show/hide
Query:  KNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLK--IVV
        K +LHLFA+ AL  I F    F +   F+LS    D  N T FWF +SNTLIFIIA D GAFS P                A  PN  +P+ N    IVV
Subjt:  KNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLK--IVV

Query:  IEPPNSGKPTEHEEE---------IPHTTEVFIHP-IPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEK--------EEKKEFAEMT
         E PNS  P + EEE         IP TTE+ I P   +  K Y+R+KSEK +KRMA K KKITMKRSKT IR +A+ TKEK        EEK EF +MT
Subjt:  IEPPNSGKPTEHEEE---------IPHTTEVFIHP-IPSSKKSYRRNKSEK-VKRMAEKEKKITMKRSKTTIRHEASMTKEK--------EEKKEFAEMT

Query:  DEELNRRVEEFIERFNRQIRLQEI
        DEELNRRVEEFIERFNRQIRLQ+I
Subjt:  DEELNRRVEEFIERFNRQIRLQEI

A0A6J1CEX1 uncharacterized protein LOC1110108731.1e-3150.47Show/hide
Query:  AKNYLHLFAEGALSFIA-FLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLKIVV
        A+ YL+L AEGA SF A FLLSIF Y+SV NLS     L +D KFWFCISNTLIFIIA D GAFSPPP                QI ++Q+    +KIV 
Subjt:  AKNYLHLFAEGALSFIA-FLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLKIVV

Query:  IEPPNSGKPTEHEEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKT-TIRHEASMTKEKEEKKEFAEMTDEELNRRVEEFIERFNRQ
          P  S + T                  +S K+YRR+KSEK     EKE+KI M+RSKT TI+      +  +E  EF EMTDEELNRRVEEFIERFNR+
Subjt:  IEPPNSGKPTEHEEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKT-TIRHEASMTKEKEEKKEFAEMTDEELNRRVEEFIERFNRQ

Query:  IRLQEIGHGDEE
        IRLQ++ +GD +
Subjt:  IRLQEIGHGDEE

A0A6J1EH50 uncharacterized protein LOC1114341463.6e-3552.07Show/hide
Query:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK  + +LHL+AE AL F AFLLS+FIY S FNLSLS   L   T FWFC+SNTLIFIIAA   AFS PP           N AA+ +     P +N 
Subjt:  MGMK-AKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEF
                NSG PTE+  +++IP  TE+ I+   + +KSY R+KSEK  R   KE KI M+RSKT  R++A+ T+ EKEE K E AEM++EELN++VEEF
Subjt:  KIVVIEPPNSGKPTEH--EEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTK-EKEE-KKEFAEMTDEELNRRVEEF

Query:  IERFNRQIRLQEIGHGD
        IERFNRQ+RLQ IG  +
Subjt:  IERFNRQIRLQEIGHGD

A0A6J1KIQ0 uncharacterized protein LOC1114955971.8e-3451.14Show/hide
Query:  MGMKAKN-YLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL
        M MK ++  LHL+AE AL F AFLLS+FIY S FNLSLS   L     FWFC+SNTL+ IIAA   AFSPPP                          N 
Subjt:  MGMKAKN-YLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNL

Query:  KIVVIEPPN---SGKPTEH-EEEIPHTTEVFIH-PIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKKEFAEMTDEELNRRVEE
         I+   PPN   SG PTE+ E++IP  TE+ I   + +S+KSY R+KSEK  R   KE KI M+RSKT  R+E     E+E K E AEM++EELN+RVEE
Subjt:  KIVVIEPPN---SGKPTEH-EEEIPHTTEVFIH-PIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKKEFAEMTDEELNRRVEE

Query:  FIERFNRQIRLQEIGHGDE
        FIERFNRQIRLQ IG  +E
Subjt:  FIERFNRQIRLQEIGHGDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein1.5e-0932.13Show/hide
Query:  SFIAFLLSIFIYISVFNL-SLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPH------------FHRRGDR---------RRNKAAAQIPNQ---
        S +   L IF YI +F++  +S   +  DTK  F ISNTLI IIAAD G+FS                    R D          R N    +I N    
Subjt:  SFIAFLLSIFIYISVFNL-SLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPH------------FHRRGDR---------RRNKAAAQIPNQ---

Query:  --QNPEDN-----LKIVVIEPPN------SGKPTEHE---EEIPHTTEVFI----------HPIPSSKKSYRRNKSEKVKRM-----AEKEKKITMKRSK
          +NPE+        I+ + PP       S K    +   EE    TE  +          H  P+  K Y R+KS+K +R       E  K+ +  R K
Subjt:  --QNPEDN-----LKIVVIEPPN------SGKPTEHE---EEIPHTTEVFI----------HPIPSSKKSYRRNKSEKVKRM-----AEKEKKITMKRSK

Query:  TTIRHEASMTKE----KEEKKEFAEMTDEELNRRVEEFIERFNRQIRLQ
        +       + ++    KEE +EF+++++EELN+RVEEFI+RFNRQIR Q
Subjt:  TTIRHEASMTKE----KEEKKEFAEMTDEELNRRVEEFIERFNRQIRLQ

AT2G34610.1 unknown protein7.4e-0929.39Show/hide
Query:  SFIAFLLSIFIYISVFN-LSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSP---------------------PPPHFHRRG------DRRRNK-----
        S +  +LSIF YI +F+ L +S   + NDTK  F ISN LI IIAAD GAF+                      P P     G       + R K     
Subjt:  SFIAFLLSIFIYISVFN-LSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSP---------------------PPPHFHRRG------DRRRNK-----

Query:  AAAQIPNQQNPEDNLKIV--VIEPPNSGKPTEHE-EEIPHTTEVFIHPIPS-----------SKKSYRRNKSEKVKRMA---EKEKKITMKRSKTTIRHE
         A  + +Q  P    K+   +I+  +  +P     E+    TE  +H I +           + K+Y R+KS+K +      E+ ++    R K+  R +
Subjt:  AAAQIPNQQNPEDNLKIV--VIEPPNSGKPTEHE-EEIPHTTEVFIHPIPS-----------SKKSYRRNKSEKVKRMA---EKEKKITMKRSKTTIRHE

Query:  ASMTK-----------------------EKEEKKEFAEMTDEELNRRVEEFIERFNRQIRLQ
        +  +K                        KEE +EF++M++EELNRRVE+FI+RFNR I+ Q
Subjt:  ASMTK-----------------------EKEEKKEFAEMTDEELNRRVEEFIERFNRQIRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATGAAAGCCAAAAACTACCTCCATTTATTTGCAGAGGGAGCTCTTTCTTTCATCGCCTTTCTTCTCTCCATTTTCATTTACATCTCTGTTTTCAATCTCTCTCT
TTCAAACGATGATCTCTTAAACGACACAAAGTTCTGGTTTTGCATCTCCAACACTCTCATTTTCATCATTGCCGCCGATTGCGGCGCTTTCTCTCCTCCGCCGCCACACT
TCCACCGCCGAGGAGATCGCCGGAGGAACAAAGCTGCCGCCCAAATCCCAAACCAACAAAACCCAGAAGATAACTTGAAGATTGTCGTTATAGAACCTCCTAATTCAGGC
AAACCAACTGAACATGAAGAAGAAATCCCACATACCACAGAAGTTTTCATTCATCCCATACCTAGTTCGAAGAAATCTTATCGACGAAACAAGTCGGAGAAAGTGAAAAG
AATGGCGGAGAAAGAAAAGAAGATCACAATGAAGAGATCGAAGACGACGATAAGACACGAAGCGTCGATGACGAAGGAGAAAGAAGAGAAGAAGGAGTTTGCAGAGATGA
CAGATGAAGAACTCAACAGAAGAGTTGAAGAATTTATTGAGAGATTCAACAGACAGATAAGACTTCAAGAAATTGGACATGGAGATGAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
GTCTTCCTTCTTTAGTTCACAACCAAACCATTAATGGGCATGAAAGCCAAAAACTACCTCCATTTATTTGCAGAGGGAGCTCTTTCTTTCATCGCCTTTCTTCTCTCCAT
TTTCATTTACATCTCTGTTTTCAATCTCTCTCTTTCAAACGATGATCTCTTAAACGACACAAAGTTCTGGTTTTGCATCTCCAACACTCTCATTTTCATCATTGCCGCCG
ATTGCGGCGCTTTCTCTCCTCCGCCGCCACACTTCCACCGCCGAGGAGATCGCCGGAGGAACAAAGCTGCCGCCCAAATCCCAAACCAACAAAACCCAGAAGATAACTTG
AAGATTGTCGTTATAGAACCTCCTAATTCAGGCAAACCAACTGAACATGAAGAAGAAATCCCACATACCACAGAAGTTTTCATTCATCCCATACCTAGTTCGAAGAAATC
TTATCGACGAAACAAGTCGGAGAAAGTGAAAAGAATGGCGGAGAAAGAAAAGAAGATCACAATGAAGAGATCGAAGACGACGATAAGACACGAAGCGTCGATGACGAAGG
AGAAAGAAGAGAAGAAGGAGTTTGCAGAGATGACAGATGAAGAACTCAACAGAAGAGTTGAAGAATTTATTGAGAGATTCAACAGACAGATAAGACTTCAAGAAATTGGA
CATGGAGATGAGGAGTAGAACTATACCAAAACAGAAGCTTTTGTTGTCTATACAATAATATGCATTAATTTCTTGTTTTGTTTTTTCTTGTGTGTGTTTGTCTTCTTCCT
CAGTTGTCTCTTTCCAAATAGCTACAGTCTTTTTTTAAAAAAAAAAAATTATTTCAACTTTTTTTTAATTTATGGAGCTCTCTTTATGATCTTACACATATAATCATCTC
CCCTTCTCCTTGTTATTGTTTTGATTCTTCATAGGTTTTTTGAAATTGACCAATATGATGCCTTTTTTAATTTGTGTCATATTGTACATATCAACCATGGTTGATTCTCT
TTATCTAAATTCAGTTAGG
Protein sequenceShow/hide protein sequence
MGMKAKNYLHLFAEGALSFIAFLLSIFIYISVFNLSLSNDDLLNDTKFWFCISNTLIFIIAADCGAFSPPPPHFHRRGDRRRNKAAAQIPNQQNPEDNLKIVVIEPPNSG
KPTEHEEEIPHTTEVFIHPIPSSKKSYRRNKSEKVKRMAEKEKKITMKRSKTTIRHEASMTKEKEEKKEFAEMTDEELNRRVEEFIERFNRQIRLQEIGHGDEE