; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030138 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030138
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAdenine deaminase
Genome locationscaffold6:11133397..11142613
RNA-Seq ExpressionSpg030138
SyntenySpg030138
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136930.1 uncharacterized protein LOC111008504 [Momordica charantia]7.9e-11184.17Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        Q A VSNGVHVQEKLAKV  LDEAERHCSLEILPILFE+ASFPFQ+SS RDSSG  S EEFDNSPDCDPHLAFLS LEVTHPTKSRMSLE+SDTRLTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKTSK C+G FE+LKT+NTLVRIEKVLQRQSSLK+GVKLV YLLDHGLMLLKFS+K                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSV
          TERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ  DGSV
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSV

XP_022941601.1 uncharacterized protein LOC111446909 [Cucurbita moschata]2.1e-11184.67Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        QAAT+SNGVHVQEKLAKVT  DEAERH SLEILP LFE+ASFP Q+S ARDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT S+MSL +SD  LTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLL+FSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
        S TERVHD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

XP_022991750.1 uncharacterized protein LOC111488280 [Cucurbita maxima]2.3e-11083.91Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        QAA +SNGVHVQEKLAKVT  DE ERH SLEILP LF++ASFP Q+S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD  LTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
        S TERVHD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

XP_023531596.1 uncharacterized protein LOC111793785 [Cucurbita pepo subsp. pepo]8.8e-11083.91Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        QAAT+SNGVHVQEKLAKV   DE ERH SLEILP LFE+ASFP Q+S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD  LTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTF +LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
        S TERVHD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

XP_038903131.1 uncharacterized protein LOC120089804 isoform X1 [Benincasa hispida]5.7e-10982.38Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        Q A VSNGV VQEKLAKV+ L+E+ERHCSLEILP LFE+ASFPFQNSSARDSSGFLSTEEFDNSP+CDPHLAFLSFLEVTH TKSRMSLE+SD RLTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKL+ SKSCEG+ E++KTENTL+RIEKVLQRQSSLKMG KL  YLLDHGLMLLKFSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
          TER  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ  DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

TrEMBL top hitse value%identityAlignment
A0A6J1C5A8 uncharacterized protein LOC1110085043.8e-11184.17Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        Q A VSNGVHVQEKLAKV  LDEAERHCSLEILPILFE+ASFPFQ+SS RDSSG  S EEFDNSPDCDPHLAFLS LEVTHPTKSRMSLE+SDTRLTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKTSK C+G FE+LKT+NTLVRIEKVLQRQSSLK+GVKLV YLLDHGLMLLKFS+K                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSV
          TERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ  DGSV
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSV

A0A6J1ENF8 uncharacterized protein LOC1114360863.5e-10479.17Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        Q A VSNGV VQEKLAKVTGLDE ERHCSLEILPILFE+ SFPFQNS A DSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK+RMSLE+SDTRLTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDS
        VIDIHVN GDA SSCIVNIDIDK   DKLKTSKS EG+FE+L+TE+TLVRIEKVLQRQSSLKMG KL+ YLLDHGLMLLKFSSK                
Subjt:  VIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDS

Query:  TEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
         EKS  ER  D  NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRV+QQ  DG VAM
Subjt:  TEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

A0A6J1FLJ9 uncharacterized protein LOC1114469091.0e-11184.67Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        QAAT+SNGVHVQEKLAKVT  DEAERH SLEILP LFE+ASFP Q+S ARDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT S+MSL +SD  LTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLL+FSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
        S TERVHD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

A0A6J1JCA6 uncharacterized protein LOC1114831621.7e-10680.68Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        Q A VSNGV VQEKLAKVT LDE ERHCSLEILPILFE+ SFPFQNS ARDSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK+RMSLE+SDTRLTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDS
        VIDIHVN GDA SSCIVNIDIDK   DKLKTSKSCEGTFE+L+TE+TLVRIEKVLQRQSSLKMG KLV YLLDHGLMLLKFSSK                
Subjt:  VIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDS

Query:  TEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
         EKS  E+  D  NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRVRQQ  DGSVAM
Subjt:  TEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

A0A6J1JTU6 uncharacterized protein LOC1114882801.1e-11083.91Show/hide
Query:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN
        QAA +SNGVHVQEKLAKVT  DE ERH SLEILP LF++ASFP Q+S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD  LTCQN
Subjt:  QAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQN

Query:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK
        VIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSK                 EK
Subjt:  VIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEK

Query:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM
        S TERVHD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  SATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSADGSVAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39900.1 unknown protein2.4e-3336.84Show/hide
Query:  KLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQNVIDIHV
        KL K T   +  +H ++++ P+L ++A+FP +     D+S         +  +E +  P C  H   LSF++   P+K++M +++       QN I++ +
Subjt:  KLAKVTGLDEAERHCSLEILPILFEQASFPFQNSSARDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQNVIDIHV

Query:  NGGDAYSSCIVNIDIDK-DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEKSATER
         G D+Y SC+V+I+++K +  +T  S +    ++K+E+  V ++KVLQRQ+SL                                      ST+K+ +ER
Subjt:  NGGDAYSSCIVNIDIDK-DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEKSATER

Query:  VHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQ
         HD P NRWR+YKRAASFDSRKIVILFS+LSS+GTLILIYLTLRV+Q
Subjt:  VHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAGGAAGCGAACCCCGGAGGAGGGTGAGCCGAAGGGATTGGGCCGTCTTGGCCCGCTCATGTGGGCTGAGTCTCCTCCCCTCCACTCGTTCTCTGGTGCCCCTAG
ACGTCTCGGTTCCACTTGGTTCAGCCCGAATCGTCTCCGAAAGCCTAGAAACCCTAAAGGCAGGAAAAGGCCACGTCTTCCCCCATCTCATACAAATTCACTGCTGGTGT
CACATGAAGGCGCTGCTTTTGATGATGCATTTAACTTTTTCGTCTCTTTCCAGGCTGCAACTGTGTCGAATGGAGTTCACGTCCAAGAAAAACTAGCGAAAGTTACTGGA
CTTGACGAGGCGGAGCGCCATTGTTCTTTAGAAATCTTGCCGATTCTCTTTGAGCAGGCGTCGTTCCCCTTTCAAAATTCTTCGGCCCGTGATTCCTCTGGCTTTTTAAG
TACCGAGGAATTCGACAACAGTCCAGATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCATTGGAAAGTTCAGATA
CCCGCTTGACTTGCCAGAACGTAATTGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAAGCTCAAAACATCTAAA
TCCTGTGAAGGAACTTTTGAAAATTTGAAAACTGAGAATACATTGGTGCGCATAGAAAAGGTATTGCAGAGACAGTCCAGCCTTAAAATGGGGGTGAAACTTGTGCATTA
CTTGTTGGACCATGGACTAATGTTACTGAAGTTCTCGTCTAAAGATGACATTTCTTCTGATCATCATTTTCGGTTTCTTGTGTGTGATTCAACAGAAAAATCAGCAACCG
AGAGGGTTCACGATATGCCAAACAACAGGTGGAGAAAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCAGTATTGTCAAGCTTGGGAACC
TTGATATTGATATATTTGACTCTGAGAGTAAGGCAGCAGAGTGCAGATGGATCTGTTGCTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAGGAAGCGAACCCCGGAGGAGGGTGAGCCGAAGGGATTGGGCCGTCTTGGCCCGCTCATGTGGGCTGAGTCTCCTCCCCTCCACTCGTTCTCTGGTGCCCCTAG
ACGTCTCGGTTCCACTTGGTTCAGCCCGAATCGTCTCCGAAAGCCTAGAAACCCTAAAGGCAGGAAAAGGCCACGTCTTCCCCCATCTCATACAAATTCACTGCTGGTGT
CACATGAAGGCGCTGCTTTTGATGATGCATTTAACTTTTTCGTCTCTTTCCAGGCTGCAACTGTGTCGAATGGAGTTCACGTCCAAGAAAAACTAGCGAAAGTTACTGGA
CTTGACGAGGCGGAGCGCCATTGTTCTTTAGAAATCTTGCCGATTCTCTTTGAGCAGGCGTCGTTCCCCTTTCAAAATTCTTCGGCCCGTGATTCCTCTGGCTTTTTAAG
TACCGAGGAATTCGACAACAGTCCAGATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCATTGGAAAGTTCAGATA
CCCGCTTGACTTGCCAGAACGTAATTGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAAGCTCAAAACATCTAAA
TCCTGTGAAGGAACTTTTGAAAATTTGAAAACTGAGAATACATTGGTGCGCATAGAAAAGGTATTGCAGAGACAGTCCAGCCTTAAAATGGGGGTGAAACTTGTGCATTA
CTTGTTGGACCATGGACTAATGTTACTGAAGTTCTCGTCTAAAGATGACATTTCTTCTGATCATCATTTTCGGTTTCTTGTGTGTGATTCAACAGAAAAATCAGCAACCG
AGAGGGTTCACGATATGCCAAACAACAGGTGGAGAAAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCAGTATTGTCAAGCTTGGGAACC
TTGATATTGATATATTTGACTCTGAGAGTAAGGCAGCAGAGTGCAGATGGATCTGTTGCTATGTGA
Protein sequenceShow/hide protein sequence
MIRKRTPEEGEPKGLGRLGPLMWAESPPLHSFSGAPRRLGSTWFSPNRLRKPRNPKGRKRPRLPPSHTNSLLVSHEGAAFDDAFNFFVSFQAATVSNGVHVQEKLAKVTG
LDEAERHCSLEILPILFEQASFPFQNSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSK
SCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKDDISSDHHFRFLVCDSTEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGT
LILIYLTLRVRQQSADGSVAM