; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004470 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004470
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationchr6:4217034..4218393
RNA-Seq ExpressionLag0004470
SyntenyLag0004470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]6.6e-8766.78Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CA      F+  E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK  TV+S +  P     
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ

Query:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                  A MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-8566.78Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CA      F+  E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK  TV+S +  P     
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ

Query:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP
                  A MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P
Subjt:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]1.0e-8767.35Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F+  E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH

Query:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
           HQ +P   MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]6.6e-8767.35Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+  QCAS D   F   E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH
        K SK K SL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH

Query:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
          H Q  P   MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]1.1e-8666.78Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F   E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ
        K +K KPS +K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK  TV+S +  P     
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVK-FTVKSPSSTPEIHHQ

Query:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
                  A MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein2.4e-6655Show/hide
Query:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSA-----VPAPPPP----AKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DS  F+  E  VA+IL + PLL+Q+S FSLGL P+W +RRKRSA+DSPPD++      P PPPP    +++ KESSPT+PL ++SLPLSR
Subjt:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSA-----VPAPPPP----AKKLKESSPTSPLVINSLPLSR

Query:  SESDENTN-AKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDENT  AK SK K  ++K SQ+LE I++LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  P+ GTS+S AM++ K TVKS  S
Subjt:  SESDENTN-AKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK-----NNGAARVQS
          E +H        N    +AE QSN  QN+Q +P+G IP+YDP SLGPMGIPDLN+SLE+I  +NY++ +AA+ARQNRIQI K K     NNGA ++QS
Subjt:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK-----NNGAARVQS

A0A1S3BAR4 uncharacterized protein LOC1034880492.1e-6757.05Show/hide
Query:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPAKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DS  F+  EL VA+IL + PLL+QKS FSLGL P+W +RRKRSA+DSPPD+          P P PP +++ KESSPT+PL +NSLPLSR
Subjt:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPAKKLKESSPTSPLVINSLPLSR

Query:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDEN T AK SK K  ++K SQ+LE ID+LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  PEIGTSSS AM+V K TVKS  S
Subjt:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAARVQS
          E +H        N     AE Q N ++N+Q +P+G IP+YDP SLGPMGIPDLN+SLE+I  ++Y++ +AARARQNRIQI K K   NNGA ++QS
Subjt:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAARVQS

A0A5A7VHE1 Uncharacterized protein2.1e-6757.05Show/hide
Query:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPAKKLKESSPTSPLVINSLPLSR
        +S  QC+ S DS  F+  EL VA+IL + PLL+QKS FSLGL P+W +RRKRSA+DSPPD+          P P PP +++ KESSPT+PL +NSLPLSR
Subjt:  ASPRQCA-SSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDS--------AVPAP-PPPAKKLKESSPTSPLVINSLPLSR

Query:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS
        SESDEN T AK SK K  ++K SQ+LE ID+LT Q Q L+GD+EAMK+ + +LKTINSELKA+KQE++ G  N S  PEIGTSSS AM+V K TVKS  S
Subjt:  SESDEN-TNAKRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSS-AMKVVKFTVKSPSS

Query:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAARVQS
          E +H        N     AE Q N ++N+Q +P+G IP+YDP SLGPMGIPDLN+SLE+I  ++Y++ +AARARQNRIQI K K   NNGA ++QS
Subjt:  TPEIHHQHHRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTK---NNGAARVQS

A0A6J1GI34 uncharacterized protein LOC1114543524.9e-8867.35Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+   CAS D   F+  E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH
        K SK KPSL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH

Query:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
           HQ +P   MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

A0A6J1KP15 uncharacterized protein LOC1114962953.2e-8767.35Show/hide
Query:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA
        MA+  QCAS D   F   E +VA+ILLEF    +KS   LG IP W+LRRKRSAL SPP+S+   P PP+KK+KESSPTSPLV+NSLPLSRSESDE+TNA
Subjt:  MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNA

Query:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH
        K SK K SL+K SQ +EAIDELTKQNQ LKG+ EAMKQ YNHLK INSELKA+KQEMILG   SKNESAIPEIGTSSSAM+VVK      S+        
Subjt:  KRSKNKPSLNKISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILG---SKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQH

Query:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM
          H Q  P   MAEQ +N SQNFQ +P+G IP YDPSSL PMGIPDLN+SLEEI+QRNYSR MAARAR+NRIQICK KNNG  ++Q+P  NPCM
Subjt:  HRHHQNNPAAAMAEQQSNKSQNFQVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSP--NPCM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCCTCGTCAATGCGCCTCTTCCGATTCCGGCCACTTCAACTCTTCAGAACTCCAGGTCGCGAAAATCCTCCTCGAATTCCCTCTCCTCGTTCAGAAATCCGA
GTTTTCTCTCGGTTTAATCCCCGCCTGGTCCCTCCGTCGCAAGAGATCCGCCCTAGATTCGCCGCCGGACTCCGCCGTCCCCGCGCCTCCGCCGCCCGCCAAGAAGCTCA
AGGAGTCCAGCCCTACCTCTCCTCTCGTCATCAACTCCTTGCCCCTGTCGCGGAGTGAATCCGACGAGAACACCAACGCCAAACGCTCCAAGAACAAACCCTCTCTCAAT
AAGATTTCTCAGCATTTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGTTTTGAAAGGGGATGTTGAGGCTATGAAGCAACGTTACAATCATTTGAAAACTATCAA
TTCAGAGCTGAAGGCTCAAAAGCAAGAGATGATTCTGGGTTCTAAGAACGAATCGGCCATTCCAGAAATAGGGACCTCAAGTTCAGCCATGAAAGTCGTGAAGTTCACTG
TGAAATCTCCATCTTCAACTCCAGAAATTCACCACCAGCACCACCGTCATCATCAAAACAATCCGGCGGCGGCGATGGCGGAACAACAGAGTAACAAGAGTCAGAATTTC
CAAGTAGTCCCAATGGGGGCGATTCCTGTGTATGATCCTTCTTCACTGGGGCCAATGGGGATTCCGGATTTGAACGTGTCTCTTGAAGAAATAAGTCAGAGGAATTACTC
GAGAATCATGGCCGCTCGAGCAAGACAGAACAGGATTCAAATCTGCAAGACCAAGAACAACGGAGCCGCCAGAGTCCAGAGTCCTAATCCCTGTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTCCTCGTCAATGCGCCTCTTCCGATTCCGGCCACTTCAACTCTTCAGAACTCCAGGTCGCGAAAATCCTCCTCGAATTCCCTCTCCTCGTTCAGAAATCCGA
GTTTTCTCTCGGTTTAATCCCCGCCTGGTCCCTCCGTCGCAAGAGATCCGCCCTAGATTCGCCGCCGGACTCCGCCGTCCCCGCGCCTCCGCCGCCCGCCAAGAAGCTCA
AGGAGTCCAGCCCTACCTCTCCTCTCGTCATCAACTCCTTGCCCCTGTCGCGGAGTGAATCCGACGAGAACACCAACGCCAAACGCTCCAAGAACAAACCCTCTCTCAAT
AAGATTTCTCAGCATTTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGTTTTGAAAGGGGATGTTGAGGCTATGAAGCAACGTTACAATCATTTGAAAACTATCAA
TTCAGAGCTGAAGGCTCAAAAGCAAGAGATGATTCTGGGTTCTAAGAACGAATCGGCCATTCCAGAAATAGGGACCTCAAGTTCAGCCATGAAAGTCGTGAAGTTCACTG
TGAAATCTCCATCTTCAACTCCAGAAATTCACCACCAGCACCACCGTCATCATCAAAACAATCCGGCGGCGGCGATGGCGGAACAACAGAGTAACAAGAGTCAGAATTTC
CAAGTAGTCCCAATGGGGGCGATTCCTGTGTATGATCCTTCTTCACTGGGGCCAATGGGGATTCCGGATTTGAACGTGTCTCTTGAAGAAATAAGTCAGAGGAATTACTC
GAGAATCATGGCCGCTCGAGCAAGACAGAACAGGATTCAAATCTGCAAGACCAAGAACAACGGAGCCGCCAGAGTCCAGAGTCCTAATCCCTGTATGTGA
Protein sequenceShow/hide protein sequence
MASPRQCASSDSGHFNSSELQVAKILLEFPLLVQKSEFSLGLIPAWSLRRKRSALDSPPDSAVPAPPPPAKKLKESSPTSPLVINSLPLSRSESDENTNAKRSKNKPSLN
KISQHLEAIDELTKQNQVLKGDVEAMKQRYNHLKTINSELKAQKQEMILGSKNESAIPEIGTSSSAMKVVKFTVKSPSSTPEIHHQHHRHHQNNPAAAMAEQQSNKSQNF
QVVPMGAIPVYDPSSLGPMGIPDLNVSLEEISQRNYSRIMAARARQNRIQICKTKNNGAARVQSPNPCM