; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationchr10:4471650..4473750
RNA-Seq ExpressionLag0024626
SyntenyLag0024626
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573128.1 hypothetical protein SDJN03_27015, partial [Cucurbita argyrosperma subsp. sororia]3.0e-11990.61Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKR+EAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]2.5e-11388.7Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE T +GE+PKPSD NPSDQSDHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  QGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV
        Q   GDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQ+P+    KSPLKV
Subjt:  QGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV

Query:  HKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        HKDV+MKPALP RDSF DLPKDSD+YLAKRKEAFRKYEI
Subjt:  HKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

XP_022955309.1 uncharacterized protein LOC111457310 [Cucurbita moschata]8.9e-11990.2Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D G+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKR+EAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

XP_022994449.1 uncharacterized protein LOC111490167 [Cucurbita maxima]5.2e-11990.61Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSP 
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKRKEAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]1.4e-11991.02Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKRKEAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

TrEMBL top hitse value%identityAlignment
A0A1S4E072 uncharacterized protein LOC1079913132.7e-11387.85Show/hide
Query:  MSNPAEPTPEGEE------PKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E T EGEE      PKPSD NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEPTPEGEE------PKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEI--QGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS
        E+I  Q D+ D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLP+  
Subjt:  EEI--QGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS

Query:  PQKSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
          KSPLKVHKDV MKPALP RDSF DLPKDSDIYLAKRKEAFRKYEI
Subjt:  PQKSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

A0A5D3BJS0 Uncharacterized protein2.7e-11387.85Show/hide
Query:  MSNPAEPTPEGEE------PKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E T EGEE      PKPSD NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEPTPEGEE------PKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEI--QGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS
        E+I  Q D+ D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLP+  
Subjt:  EEI--QGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS

Query:  PQKSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
          KSPLKVHKDV MKPALP RDSF DLPKDSDIYLAKRKEAFRKYEI
Subjt:  PQKSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

A0A6J1C586 uncharacterized protein LOC111008597 isoform X11.2e-11388.7Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE T +GE+PKPSD NPSDQSDHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  QGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV
        Q   GDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQ+P+    KSPLKV
Subjt:  QGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV

Query:  HKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        HKDV+MKPALP RDSF DLPKDSD+YLAKRKEAFRKYEI
Subjt:  HKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

A0A6J1GVV9 uncharacterized protein LOC1114573104.3e-11990.2Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D G+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKR+EAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

A0A6J1K2W0 uncharacterized protein LOC1114901672.5e-11990.61Show/hide
Query:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSD NPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       D GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSP 
Subjt:  Q------GDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
        KSPLKVHKDVVMKPALP RDSF DLPKDSD+YLAKRKEAFRKYEI
Subjt:  KSPLKVHKDVVMKPALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein4.3e-5549.4Show/hide
Query:  MSNPAEPTPEGE-------EPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMR---L
        MSN ++ T E E       E      N + +++ SQEWE MARAW+ +FP+AKA  S  EVE WI +N  SLP +L+ MPRS++  RL+SIQ+ MR    
Subjt:  MSNPAEPTPEGE-------EPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMR---L

Query:  STQGKEEIQGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLP
        S Q ++ ++ DQ  P        ARFQRTDQW+PVYSWLESL   E+VKSK+IS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKEKK + +L 
Subjt:  STQGKEEIQGDQGDPGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLP

Query:  EKSPQKSPLKVHKDVVMK-PALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI
            + + ++VHK+   K  A  + D   ++PKDSD+Y  K+KEA R++EI
Subjt:  EKSPQKSPLKVHKDVVMK-PALPTRDSFCDLPKDSDIYLAKRKEAFRKYEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCGGCGGAACCAACCCCAGAAGGAGAAGAGCCTAAGCCATCTGATCCAAACCCTAGCGACCAAAGCGATCACTCTCAAGAATGGGAAATCATGGCGCGGGC
TTGGCTCTGCTCTTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGGCTCTCTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAAAATTTAATGAGACTGTCCACTCAGGGAAAGGAAGAGATTCAGGGTGATCAGGGTGATCCGGGCGATCAGGGCGACCTT
CCACATGCTCGATTTCAACGCACTGACCAATGGATACCAGTTTATTCTTGGTTAGAATCTCTACAGCAAGATGAGGTTGTTAAGTCAAAGGAGATATCTGATTGGTTAAC
TGAAAATCCCACCATCAGAGATCAGTTGTGTTCAAGACATTCTCGTTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAAAGAAAAGAAAAGAAAA
AGGTTTCTCAGCTGCCTGAAAAGTCTCCTCAAAAGTCTCCTCTCAAAGTTCACAAGGATGTTGTGATGAAACCAGCGTTGCCTACACGTGATTCATTTTGCGATCTACCA
AAAGACAGTGACATATACTTGGCAAAACGAAAGGAAGCCTTTCGAAAATATGAAATTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCGAAGTCTCAAGGAGTCAAAT
AACTGACGAAACTTTGGGGACTGCCATTACTTTGAGTATGGGACATGGGACCAAGTTTTTAGCCATTGTCAGGTTTTCAACTTGTTCACTTGTGAATCTTGTGGCCCTAA
TGCCCTGCTTTCTCACTCCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCCGGCGGAACCAACCCCAGAAGGAGAAGAGCCTAAGCCATCTGATCCAAACCCTAGCGACCAAAGCGATCACTCTCAAGAATGGGAAATCATGGCGCGGGC
TTGGCTCTGCTCTTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGGCTCTCTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAAAATTTAATGAGACTGTCCACTCAGGGAAAGGAAGAGATTCAGGGTGATCAGGGTGATCCGGGCGATCAGGGCGACCTT
CCACATGCTCGATTTCAACGCACTGACCAATGGATACCAGTTTATTCTTGGTTAGAATCTCTACAGCAAGATGAGGTTGTTAAGTCAAAGGAGATATCTGATTGGTTAAC
TGAAAATCCCACCATCAGAGATCAGTTGTGTTCAAGACATTCTCGTTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAAAGAAAAGAAAAGAAAA
AGGTTTCTCAGCTGCCTGAAAAGTCTCCTCAAAAGTCTCCTCTCAAAGTTCACAAGGATGTTGTGATGAAACCAGCGTTGCCTACACGTGATTCATTTTGCGATCTACCA
AAAGACAGTGACATATACTTGGCAAAACGAAAGGAAGCCTTTCGAAAATATGAAATTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCGAAGTCTCAAGGAGTCAAAT
AACTGACGAAACTTTGGGGACTGCCATTACTTTGAGTATGGGACATGGGACCAAGTTTTTAGCCATTGTCAGGTTTTCAACTTGTTCACTTGTGAATCTTGTGGCCCTAA
TGCCCTGCTTTCTCACTCCAGAATAG
Protein sequenceShow/hide protein sequence
MSNPAEPTPEGEEPKPSDPNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGDQGDPGDQGDL
PHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKVHKDVVMKPALPTRDSFCDLP
KDSDIYLAKRKEAFRKYEIGVGEVACPQLFEVSRSQITDETLGTAITLSMGHGTKFLAIVRFSTCSLVNLVALMPCFLTPE