; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005946 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005946
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function, DUF601
Genome locationchr6:34193612..34197185
RNA-Seq ExpressionLag0005946
SyntenyLag0005946
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN66039.2 hypothetical protein Csa_020117 [Cucumis sativus]2.7e-1936.53Show/hide
Query:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR
        +D+    T  TS      RR A A RRT   +            I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R
Subjt:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR

Query:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI
         AFT+ + S   +  ++N L LN VPR EL++A  RL EA   L   +A   S   +++  KAQL EAK+ L  A  L E+F KT EF  MQ++I   G+
Subjt:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI

Query:  KWAQKRYSKHHPTVDGSFI
         W+ ++ S  HP +D SF+
Subjt:  KWAQKRYSKHHPTVDGSFI

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.5e-2740.64Show/hide
Query:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLINGPTSVKKWKNGWFFVSGNWLEKTEDG-CFFGVPMRFG
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL K E G  FF VP RFG
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLINGPTSVKKWKNGWFFVSGNWLEKTEDG-CFFGVPMRFG

XP_031737075.1 uncharacterized protein LOC105435920 isoform X1 [Cucumis sativus]2.7e-1936.53Show/hide
Query:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR
        +D+    T  TS      RR A A RRT   +            I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R
Subjt:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR

Query:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI
         AFT+ + S   +  ++N L LN VPR EL++A  RL EA   L   +A   S   +++  KAQL EAK+ L  A  L E+F KT EF  MQ++I   G+
Subjt:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI

Query:  KWAQKRYSKHHPTVDGSFI
         W+ ++ S  HP +D SF+
Subjt:  KWAQKRYSKHHPTVDGSFI

XP_031737083.1 uncharacterized protein LOC105435920 isoform X3 [Cucumis sativus]2.7e-1936.53Show/hide
Query:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR
        +D+    T  TS      RR A A RRT   +            I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R
Subjt:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR

Query:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI
         AFT+ + S   +  ++N L LN VPR EL++A  RL EA   L   +A   S   +++  KAQL EAK+ L  A  L E+F KT EF  MQ++I   G+
Subjt:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI

Query:  KWAQKRYSKHHPTVDGSFI
         W+ ++ S  HP +D SF+
Subjt:  KWAQKRYSKHHPTVDGSFI

XP_031737089.1 uncharacterized protein LOC105435920 isoform X5 [Cucumis sativus]2.7e-1936.53Show/hide
Query:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR
        +D+    T  TS      RR A A RRT   +            I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R
Subjt:  EDEAPRVTAETS------RRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLR

Query:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI
         AFT+ + S   +  ++N L LN VPR EL++A  RL EA   L   +A   S   +++  KAQL EAK+ L  A  L E+F KT EF  MQ++I   G+
Subjt:  DAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEAKAHLGSADFLTEEFKKTSEFYAMQDEIWNDGI

Query:  KWAQKRYSKHHPTVDGSFI
         W+ ++ S  HP +D SF+
Subjt:  KWAQKRYSKHHPTVDGSFI

TrEMBL top hitse value%identityAlignment
A0A2N9FHJ6 Uncharacterized protein6.2e-2233.59Show/hide
Query:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL
        D  PR C ++ +   +  LSF            MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  
Subjt:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  +P+R
Subjt:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR

A0A2N9H8T4 Uncharacterized protein2.4e-2133.2Show/hide
Query:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL
        D  P  C ++ +   +  LSF            MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  
Subjt:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  +P+R
Subjt:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR

A0A2N9HSS2 Uncharacterized protein6.2e-2233.59Show/hide
Query:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL
        D  PR C ++ +   +  LSF            MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  
Subjt:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  +P+R
Subjt:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR

A0A2N9IMR5 Uncharacterized protein6.2e-2233.59Show/hide
Query:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL
        D  PR C ++ +   +  LSF            MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  
Subjt:  DGTPRSCHVKLLVSAILLLSFAT---------HMASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  +P+R
Subjt:  NPAFGDLFYYASTKKG--TLINGPTSVKKWKNGWFFVSG-NW--LEKTEDGCFFGVPMR

A0A6J1DXS5 uncharacterized protein LOC1110255021.7e-2740.64Show/hide
Query:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLINGPTSVKKWKNGWFFVSGNWLEKTEDG-CFFGVPMRFG
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL K E G  FF VP RFG
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLINGPTSVKKWKNGWFFVSGNWLEKTEDG-CFFGVPMRFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G46696.1 Protein of unknown function, DUF6013.4e-0434.83Show/hide
Query:  DGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN
        DG+   + + LS   T+ RL  LR  + IP  + L  P      ENPP G    +   F   G+  PLP  L D +   G+A  QL PN
Subjt:  DGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN

AT1G51172.1 unknown protein3.8e-0827.84Show/hide
Query:  SSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        S DG+   + + LS   T  RL  LR  + IP  + L  P    + E+PP G    +   F + G+  PLP  L D +   G+A  QL PN    ++   
Subjt:  SSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--NGPTSVKKWKNGWFFVSGNWLEKTEDGCFF
        TL      G  + + DFL L+ + ++    +  ++ S +KG  +  + P   + W+  +FF   N L   E    F
Subjt:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--NGPTSVKKWKNGWFFVSGNWLEKTEDGCFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGACACCAACAATGAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAACCTTTTATACCTCT
CTCGCTAGGGTTTTTCCCTTTCCGCGTCGTCCACCGCACGTTGCCTCCTGCTACTGGGCCCAAGGAGATTGACCCGGGCAACCCAAGTGGCCAGTGGGGCGGCGCACCTG
ACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGTCGATGTCACCTAACGGGACCCCACGGTCCTGTCATGTCAAGGAGAGGAGA
GAGAAAAATACGTGGGCTGAGGCTCATCTGCCTCGGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAA
AAAGCCGAGGCCGATGTCACCTGACGGGACCCCACGGTCCTGTCATGTCAAGGAGAGGAGAGAGAAAAATACGTGGGCCGAGGCTCATCTGCCTCGGTCTGGCGTGTCAG
GCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCAGCAAAAAGCCAAGGCCGATGTCACCTGACGGGACCCCACGGTCCTGTCATGTC
AAGCTTCTAGTGAGTGCTATCCTTTTACTTTCTTTTGCTACGCATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGAATCTTATTCCGATGACGGTCCTTC
AAGCTCGGGCTGCTTTGTGGACCCGGAGATTTCGGATAGCAGTGATGGGGAGCCCCCTGCACACTCATCGGACTTATCATCCTCGTTGACCGCAGACCGCTTAGAGTTCT
TGCGGCGCAAGTATGATATTCCCGACGATGTGCATCTTCGGCTCCCCAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAGGTTGCTTTTTACCATGCCATGTTT
AAGTTTGGGGTTCGCTTGCCACTGCCATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAACGGGTGGTGCCACCTCATCGGTTG
CTTTACCCTTTGGGCGATGCACGGTGGGGGATCTCTAATGACCGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCGCTTTTGGTGACCTTTTTTATTACG
CAAGTACCAAAAAAGGCACCTTAATCAACGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTTGTTAGTGGCAATTGGCTTGAAAAAACGGAAGACGGTTGT
TTTTTCGGGGTTCCAATGAGGTTTGGAGAATATGGTGAGCTCTCTATTCTCCTCCCCATTGGAATCCAATCTTTGTTGCTTTTCTTTTCCTTTGCTGACTTTTCTCTTTG
CTTTTCAGTGCCTCGCAACGTTCGACGCTCCCCGACCGCTAAGAAGTTTGCCAAATACGTCCTGACCCTCGAAAAGATTAACCGCCACGGTCCCTTTTTAGTCGATCAAA
GTGTCCTCGAAGCATCTGGGCTAGCCAGGCGCCGCACCATCAACTCTGAAGGTAAACCCCCAACGCCTCTTCCTTCTGGTCGACCTTGGCACGGTCGGCCCCGGCCTGGT
CGTCCTCGGCACGGTCGGCCTCGGCTTGGTCACCAAATTTTTTTTTCCAAAAGTTACATTGCTTTTCCACATTCTCTCACAGAAATGGCCTTCCGTGGAATGTACGATTC
TCAGCGGAAGAGACGTGAAGCACGTAACAGGGCTGGAACCTCTCGGGCCTCTGTGGACCTAATCGAGGATGAGGCTCCACGGGTTACTGCCGAGACCTCTCGTCGTCCTG
CAGCTGCTACCCGCAGAACCCGGTATCAGACGCGCTCCTCGGTCATCGAGACAGATCTTAGCACAGGCATCCCGGTCTTTGCCCTTCTCGAGGACTACGGAAGCGGCGGC
AATGAGGTAGAGGTCCTAACCCAAAACTTCATGTGCTGGCAAGGGTTGCAATCTCGGAGGCCAGAAGGTGAGCTAGGAGTTGAGGATCCTGCCCAAGGAATGCAAGAATT
CCAGAGGCACCTTCGTGATGCTTTCACGAAGGCCACTGCCTCTACGATTAGCCTGACAAAAAAAATCAACGAGCTTCAACTGAACAGCGTCCCACGGAGTGAGCTCATTC
AAGCCCAAGAGAGGCTCAACGAGGCCAACCGCCTGCTAGAGGAAGTGCGAGCGAAGCTCAAATCTAGGGATGCTGAGCTGGAGTCCACAAAAGCTCAACTCATGGAGGCT
AAAGCCCATTTGGGCAGCGCTGATTTTCTAACTGAAGAATTCAAGAAAACCAGCGAGTTCTATGCCATGCAAGACGAGATATGGAACGATGGCATCAAGTGGGCACAAAA
GAGATACAGCAAACACCACCCCACCGTGGATGGTTCCTTCATCCAAGAAGATCTTGCTGCTCTCGCCGCCAATCCTGATGCCTTTGCCTCTTCTGATGACTCCTCCGGCG
GTAGAGACCATATGGACCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGACACCAACAATGAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAACCTTTTATACCTCT
CTCGCTAGGGTTTTTCCCTTTCCGCGTCGTCCACCGCACGTTGCCTCCTGCTACTGGGCCCAAGGAGATTGACCCGGGCAACCCAAGTGGCCAGTGGGGCGGCGCACCTG
ACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGTCGATGTCACCTAACGGGACCCCACGGTCCTGTCATGTCAAGGAGAGGAGA
GAGAAAAATACGTGGGCTGAGGCTCATCTGCCTCGGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAA
AAAGCCGAGGCCGATGTCACCTGACGGGACCCCACGGTCCTGTCATGTCAAGGAGAGGAGAGAGAAAAATACGTGGGCCGAGGCTCATCTGCCTCGGTCTGGCGTGTCAG
GCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCAGCAAAAAGCCAAGGCCGATGTCACCTGACGGGACCCCACGGTCCTGTCATGTC
AAGCTTCTAGTGAGTGCTATCCTTTTACTTTCTTTTGCTACGCATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGAATCTTATTCCGATGACGGTCCTTC
AAGCTCGGGCTGCTTTGTGGACCCGGAGATTTCGGATAGCAGTGATGGGGAGCCCCCTGCACACTCATCGGACTTATCATCCTCGTTGACCGCAGACCGCTTAGAGTTCT
TGCGGCGCAAGTATGATATTCCCGACGATGTGCATCTTCGGCTCCCCAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAGGTTGCTTTTTACCATGCCATGTTT
AAGTTTGGGGTTCGCTTGCCACTGCCATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAACGGGTGGTGCCACCTCATCGGTTG
CTTTACCCTTTGGGCGATGCACGGTGGGGGATCTCTAATGACCGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCGCTTTTGGTGACCTTTTTTATTACG
CAAGTACCAAAAAAGGCACCTTAATCAACGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTTGTTAGTGGCAATTGGCTTGAAAAAACGGAAGACGGTTGT
TTTTTCGGGGTTCCAATGAGGTTTGGAGAATATGGTGAGCTCTCTATTCTCCTCCCCATTGGAATCCAATCTTTGTTGCTTTTCTTTTCCTTTGCTGACTTTTCTCTTTG
CTTTTCAGTGCCTCGCAACGTTCGACGCTCCCCGACCGCTAAGAAGTTTGCCAAATACGTCCTGACCCTCGAAAAGATTAACCGCCACGGTCCCTTTTTAGTCGATCAAA
GTGTCCTCGAAGCATCTGGGCTAGCCAGGCGCCGCACCATCAACTCTGAAGGTAAACCCCCAACGCCTCTTCCTTCTGGTCGACCTTGGCACGGTCGGCCCCGGCCTGGT
CGTCCTCGGCACGGTCGGCCTCGGCTTGGTCACCAAATTTTTTTTTCCAAAAGTTACATTGCTTTTCCACATTCTCTCACAGAAATGGCCTTCCGTGGAATGTACGATTC
TCAGCGGAAGAGACGTGAAGCACGTAACAGGGCTGGAACCTCTCGGGCCTCTGTGGACCTAATCGAGGATGAGGCTCCACGGGTTACTGCCGAGACCTCTCGTCGTCCTG
CAGCTGCTACCCGCAGAACCCGGTATCAGACGCGCTCCTCGGTCATCGAGACAGATCTTAGCACAGGCATCCCGGTCTTTGCCCTTCTCGAGGACTACGGAAGCGGCGGC
AATGAGGTAGAGGTCCTAACCCAAAACTTCATGTGCTGGCAAGGGTTGCAATCTCGGAGGCCAGAAGGTGAGCTAGGAGTTGAGGATCCTGCCCAAGGAATGCAAGAATT
CCAGAGGCACCTTCGTGATGCTTTCACGAAGGCCACTGCCTCTACGATTAGCCTGACAAAAAAAATCAACGAGCTTCAACTGAACAGCGTCCCACGGAGTGAGCTCATTC
AAGCCCAAGAGAGGCTCAACGAGGCCAACCGCCTGCTAGAGGAAGTGCGAGCGAAGCTCAAATCTAGGGATGCTGAGCTGGAGTCCACAAAAGCTCAACTCATGGAGGCT
AAAGCCCATTTGGGCAGCGCTGATTTTCTAACTGAAGAATTCAAGAAAACCAGCGAGTTCTATGCCATGCAAGACGAGATATGGAACGATGGCATCAAGTGGGCACAAAA
GAGATACAGCAAACACCACCCCACCGTGGATGGTTCCTTCATCCAAGAAGATCTTGCTGCTCTCGCCGCCAATCCTGATGCCTTTGCCTCTTCTGATGACTCCTCCGGCG
GTAGAGACCATATGGACCTCTGA
Protein sequenceShow/hide protein sequence
MPKSGQALTCPSRDTNNEFVEIWRRRDLKRKKPFIPLSLGFFPFRVVHRTLPPATGPKEIDPGNPSGQWGGAPDTSGVPVWLQARLGLGKKPRSMSPNGTPRSCHVKERR
EKNTWAEAHLPRSGVSGGAPDTSGVPVWLQARLGLGKKPRPMSPDGTPRSCHVKERREKNTWAEAHLPRSGVSGGAPDTSGVPVWLQARLGLSKKPRPMSPDGTPRSCHV
KLLVSAILLLSFATHMASEGSVTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMF
KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLINGPTSVKKWKNGWFFVSGNWLEKTEDGC
FFGVPMRFGEYGELSILLPIGIQSLLLFFSFADFSLCFSVPRNVRRSPTAKKFAKYVLTLEKINRHGPFLVDQSVLEASGLARRRTINSEGKPPTPLPSGRPWHGRPRPG
RPRHGRPRLGHQIFFSKSYIAFPHSLTEMAFRGMYDSQRKRREARNRAGTSRASVDLIEDEAPRVTAETSRRPAAATRRTRYQTRSSVIETDLSTGIPVFALLEDYGSGG
NEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMQEFQRHLRDAFTKATASTISLTKKINELQLNSVPRSELIQAQERLNEANRLLEEVRAKLKSRDAELESTKAQLMEA
KAHLGSADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKRYSKHHPTVDGSFIQEDLAALAANPDAFASSDDSSGGRDHMDL