; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000362 (gene) of Snake gourd v1 genome

Gene IDTan0000362
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSnoaL-like domain-containing protein
Genome locationLG11:5998221..6001692
RNA-Seq ExpressionTan0000362
SyntenyTan0000362
Gene Ontology termsNA
InterPro domainsIPR002075 - Nuclear transport factor 2
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142769.1 uncharacterized protein LOC101216887 isoform X1 [Cucumis sativus]2.6e-9180.09Show/hide
Query:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA
        MNS+ S  S S  IFNH+ S  P    P S S T    P+   Y PLRV SSSSDNP VTVPSP TD PLDTLRSAS+VV+DFYDG+NRHDLASVE LIA
Subjt:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA

Query:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA
        +NCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDIST+D SA+GVLWHLEWKGKEFPFSKGCSFYRL   D K+QIIYARDSVEPAFKPGEMA
Subjt:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA

Query:  LTAIRGVTWLLEQFPQLADRI
        LTAIRGVTWLLEQFPQLADRI
Subjt:  LTAIRGVTWLLEQFPQLADRI

XP_008458855.1 PREDICTED: uncharacterized protein LOC103498136 isoform X1 [Cucumis melo]2.3e-9280.54Show/hide
Query:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA
        MNS+ S  S S  IFNH+ S PP    P S S T    PR T Y PLRV SSSSDNP VTVPSP TD PLDTLRSAS VV++FYDG+NRHDLASVE LIA
Subjt:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA

Query:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA
        +NCVYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDIST+D SA+GVLWHLEWKGKEFPFSKGCSFYRL   D K+QIIYARDSVEPAFKPGEMA
Subjt:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA

Query:  LTAIRGVTWLLEQFPQLADRI
        LTAIRGVTWLLEQFPQLADRI
Subjt:  LTAIRGVTWLLEQFPQLADRI

XP_022955016.1 uncharacterized protein LOC111457092 isoform X2 [Cucurbita moschata]3.5e-8879.37Show/hide
Query:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        M+SV SPPS  QS   FNH+ SAPPP     SQS T+TRTPRTTL FPLRVSS S          +++ P+DTL+SASDVV+ FYDG+NRHDLASVE LI
Subjt:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE
        A+NCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDISTQD SAVGVLWHLEWKGKEFPFSKGCSFYRLVV DA KRQIIYARDSVEPA KPGE
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE

Query:  MALTAIRGVTWLLEQFPQLADRI
        MALT IRGVTWLLE+FPQLADRI
Subjt:  MALTAIRGVTWLLEQFPQLADRI

XP_022991155.1 uncharacterized protein LOC111487838 isoform X2 [Cucurbita maxima]9.1e-8979.82Show/hide
Query:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        M SV SPPS  QS   FNH+ SAPPP     SQS T+TRTPRTTL FPLRVSSSS          +++ P+DTL+SASDVV+ FYDG+NRHDLASVE LI
Subjt:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE
        A+NCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDISTQD SAVGVLWHLEWKGKEFPFSKGCSFYRLVV DA KRQIIYARDSVEPA KPGE
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE

Query:  MALTAIRGVTWLLEQFPQLADRI
        MALT IRGVTWLLE+FPQLADRI
Subjt:  MALTAIRGVTWLLEQFPQLADRI

XP_023518153.1 uncharacterized protein LOC111781680 isoform X3 [Cucurbita pepo subsp. pepo]2.0e-8879.37Show/hide
Query:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        M+SV SPPS  QS   FNH+ SAPPP     SQS T+TRTPRTTL FPLR SSSS          +++ P+DTL+SASDVV+ FYDG+NRHDLASVE LI
Subjt:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE
        A+NCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDISTQD SAVGVLWHLEWKGKEFPFSKGCSFYRLVV DA KRQIIYARDSVEPA KPGE
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE

Query:  MALTAIRGVTWLLEQFPQLADRI
        MALT IRGVTWLLE+FPQLADRI
Subjt:  MALTAIRGVTWLLEQFPQLADRI

TrEMBL top hitse value%identityAlignment
A0A0A0KQZ4 SnoaL-like domain-containing protein1.2e-9180.09Show/hide
Query:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA
        MNS+ S  S S  IFNH+ S  P    P S S T    P+   Y PLRV SSSSDNP VTVPSP TD PLDTLRSAS+VV+DFYDG+NRHDLASVE LIA
Subjt:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA

Query:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA
        +NCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDIST+D SA+GVLWHLEWKGKEFPFSKGCSFYRL   D K+QIIYARDSVEPAFKPGEMA
Subjt:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA

Query:  LTAIRGVTWLLEQFPQLADRI
        LTAIRGVTWLLEQFPQLADRI
Subjt:  LTAIRGVTWLLEQFPQLADRI

A0A1S3C8U3 uncharacterized protein LOC103498136 isoform X11.1e-9280.54Show/hide
Query:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA
        MNS+ S  S S  IFNH+ S PP    P S S T    PR T Y PLRV SSSSDNP VTVPSP TD PLDTLRSAS VV++FYDG+NRHDLASVE LIA
Subjt:  MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIA

Query:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA
        +NCVYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDIST+D SA+GVLWHLEWKGKEFPFSKGCSFYRL   D K+QIIYARDSVEPAFKPGEMA
Subjt:  DNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMA

Query:  LTAIRGVTWLLEQFPQLADRI
        LTAIRGVTWLLEQFPQLADRI
Subjt:  LTAIRGVTWLLEQFPQLADRI

A0A6J1BWM3 uncharacterized protein LOC111006367 isoform X18.3e-8877.93Show/hide
Query:  MNSVFSPPSQSTSIF-NHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        MNSV SPPS+S +IF N + SAPP  F       T +R P  T Y  LR+ SSSSDNP+V V SPTT+ PLD LRSASDVV+DFY GINR DL SV  LI
Subjt:  MNSVFSPPSQSTSIF-NHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRV-SSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEM
        ADNCVYEDLIFSRPFVGR+DILLFFKKFNDSISKDLQFVIDDIST+D SAVGVLWHLEWKGKEFPFSKGCSFYRLV+AD  RQIIY RDSVEPA KPGEM
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEM

Query:  ALTAIRGVTWLLEQFPQLADRI
        ALTAI+GVTWLLEQFPQLADR+
Subjt:  ALTAIRGVTWLLEQFPQLADRI

A0A6J1GSJ5 uncharacterized protein LOC111457092 isoform X21.7e-8879.37Show/hide
Query:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        M+SV SPPS  QS   FNH+ SAPPP     SQS T+TRTPRTTL FPLRVSS S          +++ P+DTL+SASDVV+ FYDG+NRHDLASVE LI
Subjt:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE
        A+NCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDISTQD SAVGVLWHLEWKGKEFPFSKGCSFYRLVV DA KRQIIYARDSVEPA KPGE
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE

Query:  MALTAIRGVTWLLEQFPQLADRI
        MALT IRGVTWLLE+FPQLADRI
Subjt:  MALTAIRGVTWLLEQFPQLADRI

A0A6J1JS34 uncharacterized protein LOC111487838 isoform X24.4e-8979.82Show/hide
Query:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI
        M SV SPPS  QS   FNH+ SAPPP     SQS T+TRTPRTTL FPLRVSSSS          +++ P+DTL+SASDVV+ FYDG+NRHDLASVE LI
Subjt:  MNSVFSPPS--QSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLI

Query:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE
        A+NCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDISTQD SAVGVLWHLEWKGKEFPFSKGCSFYRLVV DA KRQIIYARDSVEPA KPGE
Subjt:  ADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADA-KRQIIYARDSVEPAFKPGE

Query:  MALTAIRGVTWLLEQFPQLADRI
        MALT IRGVTWLLE+FPQLADRI
Subjt:  MALTAIRGVTWLLEQFPQLADRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein5.0e-6159.51Show/hide
Query:  NSAPPPRFSPPS----QSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIADNCVYEDLIFSRPFV
        N  PP  F P       SLT ++ PR +  +     +++ N VV   +PT         SAS+VV  FY  +N HDL+SV  LIA +CVYEDL+FS PFV
Subjt:  NSAPPPRFSPPS----QSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIADNCVYEDLIFSRPFV

Query:  GRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMALTAIRGVTWLLEQFP
        GRK IL FF KF +S S DLQFVIDDIST+D SAVGV WHLEWKGK FPFSKGCSFYRL V D KRQI+Y RD VEPA KPGE  L AI+GVTWLL++FP
Subjt:  GRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMALTAIRGVTWLLEQFP

Query:  QLADR
        QLAD+
Subjt:  QLADR

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein1.2e-2532.41Show/hide
Query:  SASDVVKDFYDGINRHDLASVEGLIADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRL
        S  D V  FY  IN  +   +   I+ +C  +D  F +PF G+++ + FF++   S+ ++++F ++++   D  +  V WHLEWKG++ PF++GCSFY  
Subjt:  SASDVVKDFYDGINRHDLASVEGLIADNCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRL

Query:  VVADAKRQIIYARDSVEPAFKPGEMALTAIRGVTWLLEQFPQLAD
        +    +  I  AR  +E   KPG + L+ ++ +T+L ++FP+ A+
Subjt:  VVADAKRQIIYARDSVEPAFKPGEMALTAIRGVTWLLEQFPQLAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCCGTCTTCTCGCCTCCATCTCAATCTACTTCCATTTTCAACCACAGAAATTCCGCTCCTCCACCTCGTTTCTCTCCTCCATCTCAATCTCTCACTGTTACTCG
AACACCCAGAACCACACTCTATTTCCCTCTTCGAGTTTCTTCTTCTTCAGATAATCCGGTCGTCACCGTTCCATCTCCGACCACAGATATCCCTCTTGACACACTTCGAT
CAGCTTCGGATGTCGTAAAGGACTTTTACGATGGAATCAATCGCCACGACCTCGCCTCCGTCGAGGGCCTCATTGCTGACAATTGCGTTTACGAGGACCTTATCTTTTCT
CGCCCTTTCGTCGGTCGCAAGGACATTCTTCTTTTCTTCAAAAAGTTTAACGATTCCATCAGCAAGGATCTCCAGTTTGTTATTGACGATATATCCACCCAAGACCCATC
TGCTGTGGGTGTCCTTTGGCATCTAGAATGGAAAGGGAAAGAGTTTCCTTTTAGCAAGGGATGCAGCTTTTATCGCTTGGTTGTTGCTGATGCCAAGAGACAGATAATCT
ATGCACGAGACAGCGTTGAGCCTGCATTCAAGCCTGGAGAGATGGCTTTGACAGCCATTAGAGGTGTGACTTGGCTTCTGGAACAATTCCCTCAGCTAGCAGATCGGATA
TAA
mRNA sequenceShow/hide mRNA sequence
AATAGCCAAATTGTAACTAACTCATCCATTGATTAATTTCTCTAGAAAAGCATACTATTGCCCATTGGAATGGAAGTTGGAACAACCACTCGTTCGTTCATGTATAGTAA
CAGAAAGAGAAGAGAATACGCCACCGATGAACTCCGTCTTCTCGCCTCCATCTCAATCTACTTCCATTTTCAACCACAGAAATTCCGCTCCTCCACCTCGTTTCTCTCCT
CCATCTCAATCTCTCACTGTTACTCGAACACCCAGAACCACACTCTATTTCCCTCTTCGAGTTTCTTCTTCTTCAGATAATCCGGTCGTCACCGTTCCATCTCCGACCAC
AGATATCCCTCTTGACACACTTCGATCAGCTTCGGATGTCGTAAAGGACTTTTACGATGGAATCAATCGCCACGACCTCGCCTCCGTCGAGGGCCTCATTGCTGACAATT
GCGTTTACGAGGACCTTATCTTTTCTCGCCCTTTCGTCGGTCGCAAGGACATTCTTCTTTTCTTCAAAAAGTTTAACGATTCCATCAGCAAGGATCTCCAGTTTGTTATT
GACGATATATCCACCCAAGACCCATCTGCTGTGGGTGTCCTTTGGCATCTAGAATGGAAAGGGAAAGAGTTTCCTTTTAGCAAGGGATGCAGCTTTTATCGCTTGGTTGT
TGCTGATGCCAAGAGACAGATAATCTATGCACGAGACAGCGTTGAGCCTGCATTCAAGCCTGGAGAGATGGCTTTGACAGCCATTAGAGGTGTGACTTGGCTTCTGGAAC
AATTCCCTCAGCTAGCAGATCGGATATAATCCTACTCTCTCGAGGAAGAATTTATTAATGTATTACAAATTTACAACCGATATTCTTAAGTACATATGTACATTTAATTT
AGCAATAGATCAAATTGTATAAATCCTTACAAAATCCACAAACAGTGATGTTCTGTCATTGCTGTATGTTTTTTTTCCTTGCTAGTTTATAATTTAAAAATAAAATGAAC
GTCAGATGAGGCTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MNSVFSPPSQSTSIFNHRNSAPPPRFSPPSQSLTVTRTPRTTLYFPLRVSSSSDNPVVTVPSPTTDIPLDTLRSASDVVKDFYDGINRHDLASVEGLIADNCVYEDLIFS
RPFVGRKDILLFFKKFNDSISKDLQFVIDDISTQDPSAVGVLWHLEWKGKEFPFSKGCSFYRLVVADAKRQIIYARDSVEPAFKPGEMALTAIRGVTWLLEQFPQLADRI