; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016717 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016717
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr12:40606909..40613509
RNA-Seq ExpressionLag0016717
SyntenyLag0016717
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53676.1 hypothetical protein EZV62_018932 [Acer yangbiense]8.7e-1838.32Show/hide
Query:  LEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSLMTVDDFLSLH
        +E  R KY IP+++ LRLP   +   +PP + EVA     F+FGV LP   FL+  L     APAQL PN W  LI  + +W       L T  +F++L+
Subjt:  LEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSLMTVDDFLSLH

Query:  TINRNPAFGDLFYYVSA---KKGTLISGPTSVKKWKNGWFFVSGNWLEKTDEGCF-FGVPMRFGEYV
         +   P +   +YYVSA   K+  +   P+S K WKN WFF SG+W  +  E  F   +P RF   V
Subjt:  TINRNPAFGDLFYYVSA---KKGTLISGPTSVKKWKNGWFFVSGNWLEKTDEGCF-FGVPMRFGEYV

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.6e-1831.9Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKN
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       FY  + K  G ++ GPTS+K W  
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKN

Query:  GWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HDPLLVDQSVLESSGLAR----RRTIGLEEMAFR---
         WF+ SG WL K + G  FF VP RFG  V        T   F       E+  R         D LL++  +L+ +   R     R      M  R   
Subjt:  GWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HDPLLVDQSVLESSGLAR----RRTIGLEEMAFR---

Query:  GMYDSQQKRREARNRAGTSR----AAFVDLTEDEAPRVVAETSRRPAPSTRRTRYQTHSSVTETDLSTGIPVFALPEDY
        G+    + R  A   A +S+    A     +ED AP +  E+S  P+   +R R QT +   +T+ +  +P      DY
Subjt:  GMYDSQQKRREARNRAGTSR----AAFVDLTEDEAPRVVAETSRRPAPSTRRTRYQTHSSVTETDLSTGIPVFALPEDY

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.1e-1738.01Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKN
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       FY  + K  G ++ GPTS+K W  
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKN

Query:  GWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISRH---DPLLVDQSVLES
         WF+ SG WL K + G  FF VP RFG  V        T   F       E+  R      L+ D+ +LES
Subjt:  GWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISRH---DPLLVDQSVLES

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.5e-1738.07Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAKKGT--LISGPTSVKKWK
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       F Y+ A+KG   ++ GPTS+K W 
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYVSAKKGT--LISGPTSVKKWK

Query:  NGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISRHDP-------LLVDQSVLES
          WF+ SG WL K + G  FF VP RFG  V  ++R  P   + +    T++    H P       L+ D+ +LES
Subjt:  NGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISRHDP-------LLVDQSVLES

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.6e-2634.8Show/hide
Query:  LRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLH
        LR  + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L  VD  L+  
Subjt:  LRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLH

Query:  TINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKNGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HD
           R       FY  + K  G ++ GPTS+K W   WF+ SG WL K + G  FF VP RFG  V        T   F       E+  R         D
Subjt:  TINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKNGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HD

Query:  PLLVDQSVLESSGLAR--RRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPA
         LL++  +L+ +   R    +    E+A    + S  KR+ ++ RA    AA    ++   P VV   S  PA
Subjt:  PLLVDQSVLESSGLAR--RRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPA

TrEMBL top hitse value%identityAlignment
A0A2N9E4I4 Uncharacterized protein4.2e-1837.58Show/hide
Query:  DRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVDDFL
        D ++ LR +Y IPDDV +R+P+ADE    P  +G+VAFY    K G+R P+  F+++ L    LAP Q+ PNGW  +I C  +W +   G   +TVD+FL
Subjt:  DRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVDDFL

Query:  SLH-TINRNPAFGDLFYYVSAKKGTLISG-PTSVKKWKNGWFFVSG-NWLEKTDEG--CFFGVPM
          +  +    + G   +        ++ G P+S + WK+G+FFV G NW     EG   F GVP+
Subjt:  SLH-TINRNPAFGDLFYYVSAKKGTLISG-PTSVKKWKNGWFFVSG-NWLEKTDEG--CFFGVPM

A0A2N9HAM1 Uncharacterized protein5.5e-1835.8Show/hide
Query:  SLSADRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTV
        S++ D ++ +R +Y IPDDV LR+P++DE    P   G+VAFY    K G+R P+  F+++ L   GLAP Q+ PNGW  +I C  +W +   GS  ++V
Subjt:  SLSADRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTV

Query:  DDFLSLHTINRNPAFGDLFYYVSAKKGT-LISG-PTSVKKWKNGWFFVSG-NW--LEKTDEGCFFGVPMRFGEYVP
        D+FL  +  ++       + + +    + ++ G PTS + WK+ +FFV G NW  L + D   F GV   +G   P
Subjt:  DDFLSLHTINRNPAFGDLFYYVSAKKGT-LISG-PTSVKKWKNGWFFVSG-NW--LEKTDEGCFFGVPMRFGEYVP

A0A2N9HNJ5 Uncharacterized protein8.4e-1933.33Show/hide
Query:  LSADRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVD
        ++ D +  +R +Y IPDDV LR+P++DE    P   G+VAFY    K G+R P+  F+++FL   GLAP Q+ PNGW  +I C  +W +   GS  +TVD
Subjt:  LSADRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVD

Query:  DFLSLHTINRNPAFGDLFYYVSAKKGT-LISG-PTSVKKWKNGWFFVSG-NW--LEKTDEGCFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKI-SR
        +FL  +   +       + + +    T ++ G P+S + WK+ +FFV G NW  L + D   F GV   +G   P  + R      + + +L I  I  R
Subjt:  DFLSLHTINRNPAFGDLFYYVSAKKGT-LISG-PTSVKKWKNGWFFVSG-NW--LEKTDEGCFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKI-SR

Query:  HDPLLVDQSVLESSGL
           + ++  +L S  L
Subjt:  HDPLLVDQSVLESSGL

A0A2N9I4S2 Uncharacterized protein4.2e-1837.58Show/hide
Query:  DRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVDDFL
        D ++ LR +Y IPDDV +R+P+ADE    P  +G+VAFY    K G+R P+  F+++ L    LAP Q+ PNGW  +I C  +W +   G   +TVD+FL
Subjt:  DRLEFLRHKYDIPDDVHLRLPNADESFENPP-DGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSL-MTVDDFL

Query:  SLH-TINRNPAFGDLFYYVSAKKGTLISG-PTSVKKWKNGWFFVSG-NWLEKTDEG--CFFGVPM
          +  +    + G   +        ++ G P+S + WK+G+FFV G NW     EG   F GVP+
Subjt:  SLH-TINRNPAFGDLFYYVSAKKGTLISG-PTSVKKWKNGWFFVSG-NWLEKTDEG--CFFGVPM

A0A6J1DXS5 uncharacterized protein LOC1110255023.2e-2634.8Show/hide
Query:  LRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLH
        LR  + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L  VD  L+  
Subjt:  LRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGS----LMTVDDFLSLH

Query:  TINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKNGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HD
           R       FY  + K  G ++ GPTS+K W   WF+ SG WL K + G  FF VP RFG  V        T   F       E+  R         D
Subjt:  TINRNPAFGDLFYYVSAK-KGTLISGPTSVKKWKNGWFFVSGNWLEKTDEG-CFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISR--------HD

Query:  PLLVDQSVLESSGLAR--RRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPA
         LL++  +L+ +   R    +    E+A    + S  KR+ ++ RA    AA    ++   P VV   S  PA
Subjt:  PLLVDQSVLESSGLAR--RRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G46696.1 Protein of unknown function, DUF6019.0e-0521.08Show/hide
Query:  SADRLEFLRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSLMTVDDF
        ++ RL  LR  + IP  + L  P      ENPP G    +   F   G+  PLP  L D +   G+A  QL PN          L+A+  G +  T    
Subjt:  SADRLEFLRHKYDIPDDVHLRLPNADESFENPPDGEVAFYHTMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSLMTVDDF

Query:  LSLHTINRNPAFGDLFYYVSAKKGTLI--SGPTSVKKWKNGWFFVSGNWLEKTDEGCFFGVPMRFGEYVPRNVRRS--PTARKFAKYVLTIEKISRHDPL
                        +++S +KG  +    P   ++W+  +FF   N L   ++   F       E+  R V+ S  P +  FA +        R D  
Subjt:  LSLHTINRNPAFGDLFYYVSAKKGTLI--SGPTSVKKWKNGWFFVSGNWLEKTDEGCFFGVPMRFGEYVPRNVRRS--PTARKFAKYVLTIEKISRHDPL

Query:  LVDQSVLESSGLARRRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPAPSTRRTRYQTHSSVTETDLSTGIPVFALPED
          D++ ++S       T  LE+   R +  ++ +  ++ +R  T  +       +       +  + PA      R          ++S   P  + P  
Subjt:  LVDQSVLESSGLARRRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAETSRRPAPSTRRTRYQTHSSVTETDLSTGIPVFALPED

Query:  YGSGG---NEVDILTQNFMCWQGLQSRRPEGE
           G      V+   QN +     +S   E E
Subjt:  YGSGG---NEVDILTQNFMCWQGLQSRRPEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCCAAACGGACAATTAATGGACACGGGCTTCAAGGGAGACAAATTCACTTGAAGAAGGGGAAAACTTGGAAAATTTGCATAAAGGAAAAGCTTGACAGGTTCCT
AGTTAACCTGGAGATGGGGCAAAAGTTCAAGACCATAAAGATTGAGCATCTGAATTTTCATTCCTCTCACCATAGAGCCATTGTGGCCTCTTTTGATGGTTCTGGGCAGG
AGAAAAAAGGGGTGTTCATCTTCAGATCGAAAACCGACATGACCGATCAAAACCGACGTCGAAATTTTCTAAATCAGAGTAACAACAGTCCAGATTTAACCGCAATCAAG
AGGAAAATTGAGAGCATACCAGAAAAAGTCGATCAAATGTCGGATGTTCACCTGTTTAGTCAGAATTCGACTCCTCCAACTCCAAGTAGATCAAAGAGTCTTTCGAGTCA
CGCAAGCTGGAAGCCTCCGTTGGAAAATTCGTGGAAGTTAAACGTTGAAGCGACAAGTCTAGGGGTTTCAAAGGGCAGCATTAATTCGCCGTCTGCAGCAGCCTTGGGAA
GACCATCAGCCAGCGCTGATGAAGTTTTGACTTTTACAGTCGACATGAAGGTTGAGAACAACAAAAGTCGTTTGGTTCAGGAAGCCAAGGTTGATTCTCTTTTTAATGAA
GCAATTTTAACTCCGGGTTTCCCCTCCTTGCGTCGTCCATCGCACGTGATCTCCCGCTGCTGGGCCCAAGGGATCGATCTAGGCGGCCCCGATAGCCAATGGGCCAATGG
CCTGGTCGGCCTCGGCATTGGGCCGAGGCCGACAAGACCCTCGACCTCGGCTTTGGGCCGAGGCCGAGGTGGAAATCCACTCTTGCTGGTTTCGCTAACCCAGTTAGAGA
GCACAACGCTCAATTTTGGTAGCTTGGCCAGAGAACGTGCAACAAATTTGAATCCCCCCTCAAAGTTTGTCCACACGCGCGCCTCGGCCTCGGCAAAAAGCCAAGGCCGA
CGTGACTTGACGGGACCCTCCGGTCCCGTCACATCAGCCGCTCCCTCCAATAATTCTAAAATTCAAAATTTGAATTTTGAATCTCCTCGCTATTCCGACCAATTTCCGCC
TATAAAAGCCCCCTTTCCTCCCTCATTTTTCTCCACCAACACTTCCATTTTTTGCTTCACCAAGGGAGCTCCTGCACACTCCTCGGACTTATCATTCTCGTTGTCTGCGG
ACCGCCTAGAGTTTTTGCGGCACAAGTATGATATTCCCGACGATGTGCATCTTCGGCTTCCCAACGCTGACGAAAGCTTTGAGAATCCCCCTGATGGAGAGGTTGCGTTT
TACCACACCATGTTTAAGTTTGGGGTTCGCCTGCCATTGCCACTATTTTTGCAAGATTTCTTAGTCTGCACAGGTTTAGCCCCTGCCCAGCTCGCCCCGAATGGGTGGTG
CCACCTCATCGACTGCTTCACTCTTTGGGCGATGCACGGTGGGGGGTCTCTTATGACTGTTGACGATTTTTTATCTTTACATACCATCAATCGCAACCCTGCTTTTGGTG
ACCTTTTTTATTACGTAAGTGCCAAAAAAGGCACCTTAATCAGCGGACCCACTTCCGTTAAAAAGTGGAAAAACGGTTGGTTCTTTGTTAGTGGCAACTGGCTGGAAAAA
ACTGACGAGGGCTGCTTTTTTGGGGTTCCAATGAGGTTTGGAGAATATGTGCCTCGCAACGTTCGACGCTCCCCAACAGCCAGGAAGTTTGCCAAATACGTCCTAACCAT
TGAAAAGATTAGCCGCCACGACCCTTTACTAGTTGATCAAAGCGTCCTTGAATCATCTGGGTTAGCGAGGCGTCGCACCATTGGCTTAGAAGAAATGGCTTTCCGAGGAA
TGTATGACTCCCAGCAGAAGAGGCGTGAAGCACGCAACAGAGCTGGAACCTCCCGGGCGGCCTTTGTGGACTTAACCGAGGATGAGGCTCCACGGGTTGTTGCTGAGACT
TCTCGCCGACCTGCCCCTTCTACTCGTAGAACTCGGTATCAGACGCACTCCTCGGTAACCGAGACAGACCTTAGCACAGGCATCCCGGTTTTTGCCCTTCCCGAGGACTA
CGGGAGTGGCGGCAACGAGGTAGACATCCTAACCCAGAACTTCATGTGCTGGCAAGGGTTGCAATCCCGGAGGCCAGAAGGTGAGTTAGGGGTTGAGGATCCTGCCCAAG
GTATGCGAGAATTCCAGAGGCACCTCCGAGATGAGAGGCTCAACGAGGCCAATCGCCTGCTGGAGGAGGTGCGAACAAAGCTCAAATCTAGGGATGCTGAGTTGGAGTCC
ACAAAAGCTCAACTTATGGAGGCTAAAGCCCATTTGGCCAGCGCTGACAACCTAGCTGAAGAGTTCAAGAAAATCGGCGAATTCTATGCCATGCAGGACGAAATATGGAA
CGATGGCATCAAGTGGGCGCAAAAGAGATACAGCAAGCGTCATCCCATCGTGGATGGTTCATTCATTCAAGAGGATCTCGCTACTCTCGCCACTAACCCTGATGCCTTTG
TCTCTTCTGATGAGTCCTCTGGCGGTAGAGACCATATGGACCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCCAAACGGACAATTAATGGACACGGGCTTCAAGGGAGACAAATTCACTTGAAGAAGGGGAAAACTTGGAAAATTTGCATAAAGGAAAAGCTTGACAGGTTCCT
AGTTAACCTGGAGATGGGGCAAAAGTTCAAGACCATAAAGATTGAGCATCTGAATTTTCATTCCTCTCACCATAGAGCCATTGTGGCCTCTTTTGATGGTTCTGGGCAGG
AGAAAAAAGGGGTGTTCATCTTCAGATCGAAAACCGACATGACCGATCAAAACCGACGTCGAAATTTTCTAAATCAGAGTAACAACAGTCCAGATTTAACCGCAATCAAG
AGGAAAATTGAGAGCATACCAGAAAAAGTCGATCAAATGTCGGATGTTCACCTGTTTAGTCAGAATTCGACTCCTCCAACTCCAAGTAGATCAAAGAGTCTTTCGAGTCA
CGCAAGCTGGAAGCCTCCGTTGGAAAATTCGTGGAAGTTAAACGTTGAAGCGACAAGTCTAGGGGTTTCAAAGGGCAGCATTAATTCGCCGTCTGCAGCAGCCTTGGGAA
GACCATCAGCCAGCGCTGATGAAGTTTTGACTTTTACAGTCGACATGAAGGTTGAGAACAACAAAAGTCGTTTGGTTCAGGAAGCCAAGGTTGATTCTCTTTTTAATGAA
GCAATTTTAACTCCGGGTTTCCCCTCCTTGCGTCGTCCATCGCACGTGATCTCCCGCTGCTGGGCCCAAGGGATCGATCTAGGCGGCCCCGATAGCCAATGGGCCAATGG
CCTGGTCGGCCTCGGCATTGGGCCGAGGCCGACAAGACCCTCGACCTCGGCTTTGGGCCGAGGCCGAGGTGGAAATCCACTCTTGCTGGTTTCGCTAACCCAGTTAGAGA
GCACAACGCTCAATTTTGGTAGCTTGGCCAGAGAACGTGCAACAAATTTGAATCCCCCCTCAAAGTTTGTCCACACGCGCGCCTCGGCCTCGGCAAAAAGCCAAGGCCGA
CGTGACTTGACGGGACCCTCCGGTCCCGTCACATCAGCCGCTCCCTCCAATAATTCTAAAATTCAAAATTTGAATTTTGAATCTCCTCGCTATTCCGACCAATTTCCGCC
TATAAAAGCCCCCTTTCCTCCCTCATTTTTCTCCACCAACACTTCCATTTTTTGCTTCACCAAGGGAGCTCCTGCACACTCCTCGGACTTATCATTCTCGTTGTCTGCGG
ACCGCCTAGAGTTTTTGCGGCACAAGTATGATATTCCCGACGATGTGCATCTTCGGCTTCCCAACGCTGACGAAAGCTTTGAGAATCCCCCTGATGGAGAGGTTGCGTTT
TACCACACCATGTTTAAGTTTGGGGTTCGCCTGCCATTGCCACTATTTTTGCAAGATTTCTTAGTCTGCACAGGTTTAGCCCCTGCCCAGCTCGCCCCGAATGGGTGGTG
CCACCTCATCGACTGCTTCACTCTTTGGGCGATGCACGGTGGGGGGTCTCTTATGACTGTTGACGATTTTTTATCTTTACATACCATCAATCGCAACCCTGCTTTTGGTG
ACCTTTTTTATTACGTAAGTGCCAAAAAAGGCACCTTAATCAGCGGACCCACTTCCGTTAAAAAGTGGAAAAACGGTTGGTTCTTTGTTAGTGGCAACTGGCTGGAAAAA
ACTGACGAGGGCTGCTTTTTTGGGGTTCCAATGAGGTTTGGAGAATATGTGCCTCGCAACGTTCGACGCTCCCCAACAGCCAGGAAGTTTGCCAAATACGTCCTAACCAT
TGAAAAGATTAGCCGCCACGACCCTTTACTAGTTGATCAAAGCGTCCTTGAATCATCTGGGTTAGCGAGGCGTCGCACCATTGGCTTAGAAGAAATGGCTTTCCGAGGAA
TGTATGACTCCCAGCAGAAGAGGCGTGAAGCACGCAACAGAGCTGGAACCTCCCGGGCGGCCTTTGTGGACTTAACCGAGGATGAGGCTCCACGGGTTGTTGCTGAGACT
TCTCGCCGACCTGCCCCTTCTACTCGTAGAACTCGGTATCAGACGCACTCCTCGGTAACCGAGACAGACCTTAGCACAGGCATCCCGGTTTTTGCCCTTCCCGAGGACTA
CGGGAGTGGCGGCAACGAGGTAGACATCCTAACCCAGAACTTCATGTGCTGGCAAGGGTTGCAATCCCGGAGGCCAGAAGGTGAGTTAGGGGTTGAGGATCCTGCCCAAG
GTATGCGAGAATTCCAGAGGCACCTCCGAGATGAGAGGCTCAACGAGGCCAATCGCCTGCTGGAGGAGGTGCGAACAAAGCTCAAATCTAGGGATGCTGAGTTGGAGTCC
ACAAAAGCTCAACTTATGGAGGCTAAAGCCCATTTGGCCAGCGCTGACAACCTAGCTGAAGAGTTCAAGAAAATCGGCGAATTCTATGCCATGCAGGACGAAATATGGAA
CGATGGCATCAAGTGGGCGCAAAAGAGATACAGCAAGCGTCATCCCATCGTGGATGGTTCATTCATTCAAGAGGATCTCGCTACTCTCGCCACTAACCCTGATGCCTTTG
TCTCTTCTGATGAGTCCTCTGGCGGTAGAGACCATATGGACCTCTGA
Protein sequenceShow/hide protein sequence
MYSKRTINGHGLQGRQIHLKKGKTWKICIKEKLDRFLVNLEMGQKFKTIKIEHLNFHSSHHRAIVASFDGSGQEKKGVFIFRSKTDMTDQNRRRNFLNQSNNSPDLTAIK
RKIESIPEKVDQMSDVHLFSQNSTPPTPSRSKSLSSHASWKPPLENSWKLNVEATSLGVSKGSINSPSAAALGRPSASADEVLTFTVDMKVENNKSRLVQEAKVDSLFNE
AILTPGFPSLRRPSHVISRCWAQGIDLGGPDSQWANGLVGLGIGPRPTRPSTSALGRGRGGNPLLLVSLTQLESTTLNFGSLARERATNLNPPSKFVHTRASASAKSQGR
RDLTGPSGPVTSAAPSNNSKIQNLNFESPRYSDQFPPIKAPFPPSFFSTNTSIFCFTKGAPAHSSDLSFSLSADRLEFLRHKYDIPDDVHLRLPNADESFENPPDGEVAF
YHTMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIDCFTLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYVSAKKGTLISGPTSVKKWKNGWFFVSGNWLEK
TDEGCFFGVPMRFGEYVPRNVRRSPTARKFAKYVLTIEKISRHDPLLVDQSVLESSGLARRRTIGLEEMAFRGMYDSQQKRREARNRAGTSRAAFVDLTEDEAPRVVAET
SRRPAPSTRRTRYQTHSSVTETDLSTGIPVFALPEDYGSGGNEVDILTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDERLNEANRLLEEVRTKLKSRDAELES
TKAQLMEAKAHLASADNLAEEFKKIGEFYAMQDEIWNDGIKWAQKRYSKRHPIVDGSFIQEDLATLATNPDAFVSSDESSGGRDHMDL