; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033699 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033699
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationscaffold13:38061433..38063407
RNA-Seq ExpressionSpg033699
SyntenySpg033699
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]3.0e-8765.22Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CA      F+  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH

Query:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                    +Q A MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-8565.19Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CA      F+  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH

Query:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP
                    +Q A MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P
Subjt:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]4.6e-8866Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F+  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK  TV+S +  P+    
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHH

Query:  HHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                          MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]7.2e-8966.22Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+  QCAS D   FT  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH
        K SK K SL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK                
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH

Query:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
              ++ S + Q A MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]6.7e-8765.22Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F   E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH
        K +K KPS +K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH

Query:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                    +Q A MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein8.6e-7256.72Show/hide
Query:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSA-----VPAPPPP----SKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DSDDF+  E  VAQ+L +LPLL+Q+S FSLGL P+W +RRKRSA+DSPPD++      P PPPP    S++ KESSPT+PL ++SLPLSR
Subjt:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSA-----VPAPPPP----SKKLKESSPTSPLVINSLPLSR

Query:  SESDENTN-AKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDENT  AK SK K  ++K SQ+LE I++LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  P+ GTS+S AM++ K TVKS  S
Subjt:  SESDENTN-AKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK-----NNGA
          E      + H + +PSM NQ   +AE QSN  QN+Q+P+G IPLYDP SL PMGIPDLN+SLE+I  +NY++ +AA+ARQNRIQI K K     NNGA
Subjt:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK-----NNGA

Query:  ARVQS
         ++QS
Subjt:  ARVQS

A0A1S3BAR4 uncharacterized protein LOC1034880497.7e-7358.75Show/hide
Query:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPSKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DSDDF+  EL VAQ+L +LPLL+QKS FSLGL P+W +RRKRSA+DSPPD+          P P PP S++ KESSPT+PL +NSLPLSR
Subjt:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPSKKLKESSPTSPLVINSLPLSR

Query:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDEN T AK SK K  ++K SQ+LE ID+LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  PEIGTSSS AM+V K TVKS  S
Subjt:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAAR
          E      + H + +PSM NQ    AE Q N ++N+Q+P+G IPLYDP SL PMGIPDLN+SLE+I  ++Y++ +AARARQNRIQI K K   NNGA +
Subjt:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAAR

Query:  VQS
        +QS
Subjt:  VQS

A0A5A7VHE1 Uncharacterized protein7.7e-7358.75Show/hide
Query:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPSKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DSDDF+  EL VAQ+L +LPLL+QKS FSLGL P+W +RRKRSA+DSPPD+          P P PP S++ KESSPT+PL +NSLPLSR
Subjt:  ASPRQCA-SSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPSKKLKESSPTSPLVINSLPLSR

Query:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDEN T AK SK K  ++K SQ+LE ID+LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  PEIGTSSS AM+V K TVKS  S
Subjt:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAAR
          E      + H + +PSM NQ    AE Q N ++N+Q+P+G IPLYDP SL PMGIPDLN+SLE+I  ++Y++ +AARARQNRIQI K K   NNGA +
Subjt:  TPEIHHHHHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAAR

Query:  VQS
        +QS
Subjt:  VQS

A0A6J1GI34 uncharacterized protein LOC1114543522.2e-8866Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F+  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK  TV+S +  P+    
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHH

Query:  HHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                          MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

A0A6J1KP15 uncharacterized protein LOC1114962953.5e-8966.22Show/hide
Query:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+  QCAS D   FT  E +VAQ+LLE     +KS   LG IP W+LRRKRSAL SPP+S+   P PPSKK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH
        K SK K SL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG    KNESAIPEIGTSSSAM+VVK                
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---CKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHH

Query:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
              ++ S + Q A MAEQ +N SQNFQ+P+G IP YDPSSLSPMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQIQPSMNNQAAAMAEQQSNKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCCTCGTCAATGCGCCTCTTCCGATTCCGACGACTTCACCTCTGCAGAACTTCAGGTCGCTCAAGTCCTCCTCGAATTACCTCTCCTCGTTCAGAAATCCGA
GTTTTCTCTCGGCTTAATCCCCGCCTGGTCCCTCCGACGCAAGAGATCCGCCCTAGATTCGCCGCCGGACTCCGCCGTCCCCGCACCTCCGCCGCCCTCCAAGAAGCTCA
AGGAGTCCAGCCCTACCTCTCCTCTCGTCATCAACTCCTTGCCCCTGTCGCGGAGTGAATCCGACGAGAATACCAACGCCAAACGCTCCAAGAACAAACCCTCTCTCAAC
AAGATATCTCAGCATTTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGTTTTGAAAGGGGATGTTGAGGCTATGAAGCAACGTTACAATCATTTGAAAACTATCAA
TTCAGAGCTGAAGGCTCAAAAGCAAGAGATGATTCTGGGTTGTAAGAACGAATCGGCCATTCCAGAAATAGGGACCTCAAGTTCAGCCATGAAAGTCGTGAAGTTCACTG
TGAAATCTCCATCTTCAACTCCCGAAATTCACCACCACCACCACCATCGTCATCATCAAATTCAACCGTCGATGAACAATCAGGCGGCGGCCATGGCGGAACAACAGAGT
AACAAGAGTCAGAATTTCCAAGTCCCAATGGGGGCGATTCCTCTGTATGATCCTTCTTCACTGAGCCCAATGGGGATTCCGGATTTGAACGTGTCTCTTGAAGAAATCAG
TCAGAGGAATTACTCGAGAATCATGGCCGCTCGAGCAAGACAGAACAGGATTCAAATCTGCAAGACCAAGAACAACGGAGCCGCCAGAGTCCAGAGTCCTAATCCTTGTA
TGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTCCTCGTCAATGCGCCTCTTCCGATTCCGACGACTTCACCTCTGCAGAACTTCAGGTCGCTCAAGTCCTCCTCGAATTACCTCTCCTCGTTCAGAAATCCGA
GTTTTCTCTCGGCTTAATCCCCGCCTGGTCCCTCCGACGCAAGAGATCCGCCCTAGATTCGCCGCCGGACTCCGCCGTCCCCGCACCTCCGCCGCCCTCCAAGAAGCTCA
AGGAGTCCAGCCCTACCTCTCCTCTCGTCATCAACTCCTTGCCCCTGTCGCGGAGTGAATCCGACGAGAATACCAACGCCAAACGCTCCAAGAACAAACCCTCTCTCAAC
AAGATATCTCAGCATTTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGTTTTGAAAGGGGATGTTGAGGCTATGAAGCAACGTTACAATCATTTGAAAACTATCAA
TTCAGAGCTGAAGGCTCAAAAGCAAGAGATGATTCTGGGTTGTAAGAACGAATCGGCCATTCCAGAAATAGGGACCTCAAGTTCAGCCATGAAAGTCGTGAAGTTCACTG
TGAAATCTCCATCTTCAACTCCCGAAATTCACCACCACCACCACCATCGTCATCATCAAATTCAACCGTCGATGAACAATCAGGCGGCGGCCATGGCGGAACAACAGAGT
AACAAGAGTCAGAATTTCCAAGTCCCAATGGGGGCGATTCCTCTGTATGATCCTTCTTCACTGAGCCCAATGGGGATTCCGGATTTGAACGTGTCTCTTGAAGAAATCAG
TCAGAGGAATTACTCGAGAATCATGGCCGCTCGAGCAAGACAGAACAGGATTCAAATCTGCAAGACCAAGAACAACGGAGCCGCCAGAGTCCAGAGTCCTAATCCTTGTA
TGTGA
Protein sequenceShow/hide protein sequence
MASPRQCASSDSDDFTSAELQVAQVLLELPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPSKKLKESSPTSPLVINSLPLSRSESDENTNAKRSKNKPSLN
KISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGCKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHHHHHRHHQIQPSMNNQAAAMAEQQS
NKSQNFQVPMGAIPLYDPSSLSPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSPNPCM