; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019709 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019709
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionthymocyte nuclear protein 1
Genome locationchr5:44772523..44772999
RNA-Seq ExpressionLag0019709
SyntenyLag0019709
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR002740 - EVE domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600275.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7691.08Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA+AAAG KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWYSE DG DAVVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
        AVDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

KAG7030934.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-7690.45Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA+AAAG KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGV++VAREWYSE DG DAVVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
        AVDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

XP_022941981.1 thymocyte nuclear protein 1 [Cucurbita moschata]6.4e-7690.45Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA+AAAG KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWY E DG DAVVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
        AVDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

XP_023530564.1 thymocyte nuclear protein 1 [Cucurbita pepo subsp. pepo]3.7e-7690.45Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA AA G KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARR+VGVV+VAREWYSE DG DAVVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
         VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

XP_038905606.1 thymocyte nuclear protein 1 [Benincasa hispida]8.3e-7691.77Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA   AG +KQYWLLKTEPA+WSWADQAAN GRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS AD  D VVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSEG
         VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEGGHGSEG
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSEG

TrEMBL top hitse value%identityAlignment
A0A1S4E0I6 thymocyte nuclear protein 18.4e-7488.54Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MAN  AG K+QYWLLKTEPA+WSWADQAAN GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS  +  + VVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
         VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEGGHGS+
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

A0A5D3E131 Thymocyte nuclear protein 18.4e-7488.54Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MAN  AG K+QYWLLKTEPA+WSWADQAAN GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS  +  + VVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
         VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEGGHGS+
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

A0A6J1FML1 thymocyte nuclear protein 13.1e-7690.45Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA+AAAG KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWY E DG DAVVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
        AVDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

A0A6J1JS35 thymocyte nuclear protein 12.2e-7487.9Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MA+AA G KKQ+WLLKTEPA+WSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARR+VGVV+VAREWYSE DG D VVDVE VGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
         VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HG+E
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

E5GBH2 EVE domain-containing protein8.4e-7488.54Show/hide
Query:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE
        MAN  AG K+QYWLLKTEPA+WSWADQAAN GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS  +  + VVDVEAVGEMRE
Subjt:  MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMRE

Query:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
         VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEGGHGS+
Subjt:  AVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE

SwissProt top hitse value%identityAlignment
Q6P3E0 Thymocyte nuclear protein 15.2e-1233.54Show/hide
Query:  YWLLKTEPAD---------WSWAD-QAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SEAD
        YWL+K+EP           +S  D QA     T WDGV+N QA   L++MKL D  FFYHS  K   +VG++ + +E Y                  E +
Subjt:  YWLLKTEPAD---------WSWAD-QAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SEAD

Query:  GSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L E+K      K   G +K+  LF + RLSV P+ +E ++ I  L
Subjt:  GSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL

Q6PFL8 Thymocyte nuclear protein 11.4e-0930.67Show/hide
Query:  WLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY----------------SEADGS
        WL+K+EP           +   D  A   +T  WDGV+N QA+  ++ MK+G   FFYHS  K   + G++ + +E Y                S+AD  
Subjt:  WLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY----------------SEADGS

Query:  D-AVVDVEAVGEMREAVDLKEMKK-----GMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL
           +VDV+    ++  + L E+KK      ++G  +K+ ALF + RLSV P+  E +E +  L
Subjt:  D-AVVDVEAVGEMREAVDLKEMKK-----GMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL

Q90679 Thymocyte nuclear protein 15.2e-1232.32Show/hide
Query:  YWLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---SEADGSD-----------
        +WLLK+EP           +S  D  A   +T  W+GV+N QA+  L++MKLG   FFYHS  K   +VG+V + +E Y   ++ D  D           
Subjt:  YWLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---SEADGSD-----------

Query:  ---AVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V   +  + L E+K      K   G +KN  LF + RLS+ P+ +E ++ +  L
Subjt:  ---AVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL

Q91YJ3 Thymocyte nuclear protein 12.7e-1333.92Show/hide
Query:  GGK--KQYWLLKTEPAD---------WSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY--------------
        GGK    YWL+K+EP           +S  D  A   +T  WDGV+N QA+  L++MKL D  FFYHS  K   +VG++ + +E Y              
Subjt:  GGK--KQYWLLKTEPAD---------WSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY--------------

Query:  ---SEADGSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL
            E D   ++VDV+ V  M+  + L+E+K      K   G +K+  LF + RLSV P+ +E ++ I  L
Subjt:  ---SEADGSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL

Q9P016 Thymocyte nuclear protein 16.7e-1231.1Show/hide
Query:  YWLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SEAD
        +WL+K+EP           +S  D  A   +T  WDGV+N QA+  L++MKLG+  FFYHS  K   + G++ + +E Y                  E +
Subjt:  YWLLKTEP---------ADWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SEAD

Query:  GSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L E+K      K   G +KN  LF + RLS+ P+ +E ++ +  L
Subjt:  GSDAVVDVEAVGEMREAVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWEKICEL

Arabidopsis top hitse value%identityAlignment
AT2G14660.1 unknown protein3.1e-5266.03Show/hide
Query:  GGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS-EADG--SDAVVDVEAVGEMREAVD
        G  K+YWLLKTEP +WSW+DQ +NGG +KWDGVKNKQAQK+LKSM LGDLCFFYHSG K+R VVGVV V+REWY+ +A+G   +  VDV+A+GEMR+ VD
Subjt:  GGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS-EADG--SDAVVDVEAVGEMREAVD

Query:  LKEMKKGMEGM--KNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE
        LKEM KG +G+  K F LFRQPRLSVVPV +++W  ICELG GF GDG E    S+
Subjt:  LKEMKKGMEGM--KNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCGCCGCCGGAGGCAAAAAACAGTACTGGCTTCTGAAGACGGAGCCGGCGGACTGGTCTTGGGCCGACCAAGCCGCGAACGGCGGGCGAACAAAGTGGGA
CGGCGTCAAGAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTGGGCGTGGTCG
CCGTCGCACGAGAGTGGTACTCGGAGGCAGACGGCAGCGACGCCGTCGTCGACGTCGAGGCGGTGGGGGAAATGAGGGAAGCGGTGGATTTGAAAGAGATGAAGAAGGGG
ATGGAAGGGATGAAGAATTTCGCGCTGTTTCGGCAACCGAGGCTGTCGGTTGTGCCGGTCGCGAAGGAGATTTGGGAGAAGATCTGCGAATTGGGAGGCGGATTTGAAGG
GGATGGAACGGAGGGCGGCCATGGGAGTGAGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCGCCGCCGGAGGCAAAAAACAGTACTGGCTTCTGAAGACGGAGCCGGCGGACTGGTCTTGGGCCGACCAAGCCGCGAACGGCGGGCGAACAAAGTGGGA
CGGCGTCAAGAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTGGGCGTGGTCG
CCGTCGCACGAGAGTGGTACTCGGAGGCAGACGGCAGCGACGCCGTCGTCGACGTCGAGGCGGTGGGGGAAATGAGGGAAGCGGTGGATTTGAAAGAGATGAAGAAGGGG
ATGGAAGGGATGAAGAATTTCGCGCTGTTTCGGCAACCGAGGCTGTCGGTTGTGCCGGTCGCGAAGGAGATTTGGGAGAAGATCTGCGAATTGGGAGGCGGATTTGAAGG
GGATGGAACGGAGGGCGGCCATGGGAGTGAGGGATAG
Protein sequenceShow/hide protein sequence
MANAAAGGKKQYWLLKTEPADWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSEADGSDAVVDVEAVGEMREAVDLKEMKKG
MEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGGHGSEG