; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000271 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000271
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:2702376..2705458
RNA-Seq ExpressionLag0000271
SyntenyLag0000271
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR017956 - AT hook, DNA-binding motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8597368.1 hypothetical protein GDO81_002266 [Engystomops pustulosus]2.5e-1033.52Show/hide
Query:  RRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSL
        R GH       LG S+ R  +FPGP                 S+ R R+FPGP   R  H        G S+ R R+FPGP     GH        G S+
Subjt:  RRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSL

Query:  GRPRNFPGPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKRCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTL
         R R+F GP   R GH       LG S+ R  +FPGP   R GH                 FPGP   R+GH  G   L
Subjt:  GRPRNFPGPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKRCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTL

KAG9267222.1 hypothetical protein AMEX_G18039, partial [Astyanax mexicanus]3.3e-0731.76Show/hide
Query:  YRTQ-SSPIPCLMVGLSLDMRTSAFPAWFEKFEGSLGRLWDFPR-------PLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLG
        Y+TQ  SP+P           T+A PA   +    L +    PR       P  K  GHPR   T    +    R  P PL KH G PR  Y        
Subjt:  YRTQ-SSPIPCLMVGLSLDMRTSAFPAWFEKFEGSLGRLWDFPR-------PLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLG

Query:  RPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPG----PLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKR
             P PL K+ GHP+  Y     S       P PL KH GHPR           RP   P     P  K  GHPR   T    +    R  P PL K 
Subjt:  RPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPG----PLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKR

Query:  CGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPR
           PR  Y             P PL K+ GHPR
Subjt:  CGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPR

XP_015472393.2 basic proline-rich protein-like, partial [Parus major]4.1e-1338Show/hide
Query:  GSLGRLWDFPRPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKH
        G  G     P P  K  G P       G + G P   PGPL    G    A   LGK+ G P    GPL    G P  A   LG   G P   PGP    
Subjt:  GSLGRLWDFPRPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKH

Query:  CGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKRCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKKLG
         G P  A  L G++   PR  PGPL K  G P       G   G P   PGPL K  G P      LGK+LG PR  PGPL K  G P     LLG   G
Subjt:  CGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKRCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKKLG

TrEMBL top hitse value%identityAlignment
No hits found
SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCAAGAGAAGAGTTAGGGAAAAGGCTAGGGTCGGCCTCGGCCTCAAGCCCCGTTAGCCTAAAATTTGACTCCCTGCATCCATATGAGAATACCATTAAAGTTTG
TGCCTGGCCTCGGCAAGAGGTGCTCGGGCCAGAGGAGGTGGATAAACTGGACAAAGTAGAGTTGAGAGATTACAACAATGGCTTGGTCGGCCTCGGGAAGAGGCCGAGCA
CATCAATCTCTCTCTTTAGCTTATCTGGTCGGCCTCGGCCTCGGGAAGAGGCCGAGCACAGCAGCTTCCCGTTTTGGGTTATCTGGTCGGCCTTGGGAAAAGGCCGAGCA
CAGCAGCTAAACCCCTGTACTTGGTCATTCGGAACGCTTCATTCCAGATCCGGGGCTTTTGCAGCATCTCCCTGCGGCTTTGGAGCTAGGATGCTCGGCCTCGGCCTTGG
CATGAGGCTGAGCATTTCTTCTGCTTTGCTTGCTACTTTTCTCTGTTGTATCCATCCCGAGGGGGCGGAACTCGACCTTGGCCTCTGGATGGTCGGCCTCGGCCTTGGCA
TAAGGACGACCAGCTCCTCTGTGTTGTTCGCTTGCTTGTTTTCCATTCCAAAGTTCATTCTTGAAGCTGAGGCAGCCGGCCCTGACCTTGGCATGAGGTTGAGCCTCGCT
TCTCCATCACTTGTTTGTTTGCCTCATAGTTGGACTCATGTGCAGATACCTGGTCTGCGAAGTGACTACCGGACTCAGTCGAGCCCTATTCCTTGTCTAATGGTCGGCCT
TAGCCTTGACATGAGGACGAGCGCTTTTCCTGCATGGTTTGAGAAGTTTGAGGGAAGCTTGGGGAGGCTTTGGGATTTCCCAAGGCCTCTCTTCAAGCGTCGTGGTCATC
CCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGG
AAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTGGTCATCCCAAGGGTGCGTATACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTT
CCCAGGGCCTCTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTG
GTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTTGTGGTCATCCCAGGGGTGCGTACACTCTT
CTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAAGCTTGGGGAGGCCTG
GGAATTTCTCAAGCCTCTCTTCAAGCTGCATAGTCATCCCAGGGAGCATCTCCCTTTCGGCTTGAACCGCTTCGGCTACGATAACAGGTCAAGTGTTGTGGTCTTCCCAG
GGGTGCGTACACTCCTCTGGGGAAAAGCTTTGGGAGGCCTGGGAATCTCCCGAGTCTCTCTGCAAGCGCTGTCAAAACTTTTGAAACAAAATTCAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCAAGAGAAGAGTTAGGGAAAAGGCTAGGGTCGGCCTCGGCCTCAAGCCCCGTTAGCCTAAAATTTGACTCCCTGCATCCATATGAGAATACCATTAAAGTTTG
TGCCTGGCCTCGGCAAGAGGTGCTCGGGCCAGAGGAGGTGGATAAACTGGACAAAGTAGAGTTGAGAGATTACAACAATGGCTTGGTCGGCCTCGGGAAGAGGCCGAGCA
CATCAATCTCTCTCTTTAGCTTATCTGGTCGGCCTCGGCCTCGGGAAGAGGCCGAGCACAGCAGCTTCCCGTTTTGGGTTATCTGGTCGGCCTTGGGAAAAGGCCGAGCA
CAGCAGCTAAACCCCTGTACTTGGTCATTCGGAACGCTTCATTCCAGATCCGGGGCTTTTGCAGCATCTCCCTGCGGCTTTGGAGCTAGGATGCTCGGCCTCGGCCTTGG
CATGAGGCTGAGCATTTCTTCTGCTTTGCTTGCTACTTTTCTCTGTTGTATCCATCCCGAGGGGGCGGAACTCGACCTTGGCCTCTGGATGGTCGGCCTCGGCCTTGGCA
TAAGGACGACCAGCTCCTCTGTGTTGTTCGCTTGCTTGTTTTCCATTCCAAAGTTCATTCTTGAAGCTGAGGCAGCCGGCCCTGACCTTGGCATGAGGTTGAGCCTCGCT
TCTCCATCACTTGTTTGTTTGCCTCATAGTTGGACTCATGTGCAGATACCTGGTCTGCGAAGTGACTACCGGACTCAGTCGAGCCCTATTCCTTGTCTAATGGTCGGCCT
TAGCCTTGACATGAGGACGAGCGCTTTTCCTGCATGGTTTGAGAAGTTTGAGGGAAGCTTGGGGAGGCTTTGGGATTTCCCAAGGCCTCTCTTCAAGCGTCGTGGTCATC
CCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGG
AAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTGGTCATCCCAAGGGTGCGTATACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTT
CCCAGGGCCTCTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTG
GTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTTGTGGTCATCCCAGGGGTGCGTACACTCTT
CTGGGGAAAAGCTTGGGGAGGCCTAGGAATTTCCCAGGGCCTCTCTTCAAGCGTCGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAAGCTTGGGGAGGCCTG
GGAATTTCTCAAGCCTCTCTTCAAGCTGCATAGTCATCCCAGGGAGCATCTCCCTTTCGGCTTGAACCGCTTCGGCTACGATAACAGGTCAAGTGTTGTGGTCTTCCCAG
GGGTGCGTACACTCCTCTGGGGAAAAGCTTTGGGAGGCCTGGGAATCTCCCGAGTCTCTCTGCAAGCGCTGTCAAAACTTTTGAAACAAAATTCAGAGTAA
Protein sequenceShow/hide protein sequence
MSAREELGKRLGSASASSPVSLKFDSLHPYENTIKVCAWPRQEVLGPEEVDKLDKVELRDYNNGLVGLGKRPSTSISLFSLSGRPRPREEAEHSSFPFWVIWSALGKGRA
QQLNPCTWSFGTLHSRSGAFAASPCGFGARMLGLGLGMRLSISSALLATFLCCIHPEGAELDLGLWMVGLGLGIRTTSSSVLFACLFSIPKFILEAEAAGPDLGMRLSLA
SPSLVCLPHSWTHVQIPGLRSDYRTQSSPIPCLMVGLSLDMRTSAFPAWFEKFEGSLGRLWDFPRPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLG
KSLGRPRNFPGPLFKRRGHPKGAYTLLGKSLGRPRNFPGPLFKHCGHPRGAYTLLGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKSLGRPRNFPGPLFKRCGHPRGAYTL
LGKSLGRPRNFPGPLFKRRGHPRGAYTLLGKKLGEAWEFLKPLFKLHSHPREHLPFGLNRFGYDNRSSVVVFPGVRTLLWGKALGGLGISRVSLQALSKLLKQNSE