; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021675 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021675
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:10555616..10557535
RNA-Seq ExpressionLag0021675
SyntenyLag0021675
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.2e-5340.16Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP
        M +S+LL +W    LTSEE+ ++V  D  A+E  G  L+L L+ KL   R +   V++   + AWK+D     +D +G N+F+F F    DR R+ R GP
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP

Query:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
        W F+R L+++  P+   +P+D  F +V+ W+H F+L +   N++MA RLGNA+G+FED ++      WGS LR++VR D+ +PL RGI++  DGP+ G W
Subjt:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTG
        +PI+YERLP+   +CG + H  + C+          L  QYG WLR+ G
Subjt:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.3e-6044.8Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPW
        MD  +LL DW +  LTSEE+++++  D +AV+ A   L   L+GKL   R + A+V+ R    AWK++  L ++ +G N+F+F F  + D  RV + GPW
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPW

Query:  LFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDAD-NRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
         F++ L+VL  P       +  F+ VAFW+H+F+LP+ W N++MA RLGNA+G F D D N  GF  WG+SLR++V +D+ +PLRRGI+I  DGP+ G W
Subjt:  LFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDAD-NRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGC-AMFLQSENRARLFQQYGDWLRYTG
        +PI+YERLP+ C +CG+I HSS  C A +L +++ +R   +YG WLR+ G
Subjt:  VPIRYERLPEVCSYCGLITHSSRGC-AMFLQSENRARLFQQYGDWLRYTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.1e-5234.69Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP
        M +  LL +W    LTSEEE+ ++  D  A    G  L+  L+GKL+  RP+   VM+   R+AWK++ +  ++  LG N+F+F F   +DR ++++ GP
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP

Query:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
        W F+R L+++  P+  + P +  F+ +  W+  F+LP+    + MA RLGNALG FE+AD  +    WGS+LR++V LD+++PLRRGI++  DGP+ G W
Subjt:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPTSASGPTVNELPRNRFNG
        +PI+YERLP+ C +CGL               + +R   QYG WLRY G      P + Q    +   + +     S    P  A    V   P      
Subjt:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPTSASGPTVNELPRNRFNG

Query:  IRICEPVAAPPQQSNHSTLQ
        I +  PV   P++    + Q
Subjt:  IRICEPVAAPPQQSNHSTLQ

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.8e-4238.52Show/hide
Query:  SSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLF
        + SLL+    LSLTSEE+ V  +         G S D+CL+GKL   RP   E M+    S W+    +Q+  +G N+F+F FG+ +D+ RV   GPW F
Subjt:  SSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLF

Query:  ERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPI
        ++ LL+L      ++P D   + V FW+H+  LP+   N+ + E +GNA+G F D D  +G + WG ++R++V LD+ +PLRRG+++        +WV  
Subjt:  ERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPI

Query:  RYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQ-QYGDWLR
        +YERLP  C +CG + HS R C   L S + +R+   QYG WLR
Subjt:  RYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQ-QYGDWLR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]3.5e-4036.89Show/hide
Query:  SSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLF
        + SL++    LSLTSEE+ V  +         G S D+CL+GKL   RP   E M+    S W+    +Q+  +G N+F+F FG+ +D+ RV   GPW F
Subjt:  SSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLF

Query:  ERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPI
        ++ LL+L      ++P D   + V FW+H+  LP+   N+ + + +GNA+G F D D  +G + WG ++R++V +D+ +PLRRG+++        +WV  
Subjt:  ERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPI

Query:  RYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQ-QYGDWLR
        +YERLP  C +CG + HS R C   L   +  R+   QYG WLR
Subjt:  RYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQ-QYGDWLR

TrEMBL top hitse value%identityAlignment
A0A6J1BSZ1 uncharacterized protein LOC1110054816.0e-5440.16Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP
        M +S+LL +W    LTSEE+ ++V  D  A+E  G  L+L L+ KL   R +   V++   + AWK+D     +D +G N+F+F F    DR R+ R GP
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP

Query:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
        W F+R L+++  P+   +P+D  F +V+ W+H F+L +   N++MA RLGNA+G+FED ++      WGS LR++VR D+ +PL RGI++  DGP+ G W
Subjt:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTG
        +PI+YERLP+   +CG + H  + C+          L  QYG WLR+ G
Subjt:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTG

A0A6J1D765 uncharacterized protein LOC1110179021.9e-3633.33Show/hide
Query:  WSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLFERFLLVL
        W     T++E + +V  DR        ++ LC++ KL+  + + AE +R   +S W++ +  + + LGMN+++  F +  +++RV   GPW F + LLVL
Subjt:  WSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLFERFLLVL

Query:  VFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPIRYERLPE
          P    +P+D  F+  AFW+ I  +P +  +  MA  LG  LG  E+ +      + G  +R++V++D+++PLRRGI++  +     +W P+RYE+LP+
Subjt:  VFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPIRYERLPE

Query:  VCSYCGLITHSSRGCAMFLQSENRARLF-----QQYGDWLRYT
         C  CG I HS R C      E R+++      +QYGDWLR T
Subjt:  VCSYCGLITHSSRGCAMFLQSENRARLF-----QQYGDWLRYT

A0A6J1DU55 uncharacterized protein LOC1110231351.1e-6044.8Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPW
        MD  +LL DW +  LTSEE+++++  D +AV+ A   L   L+GKL   R + A+V+ R    AWK++  L ++ +G N+F+F F  + D  RV + GPW
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPW

Query:  LFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDAD-NRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
         F++ L+VL  P       +  F+ VAFW+H+F+LP+ W N++MA RLGNA+G F D D N  GF  WG+SLR++V +D+ +PLRRGI+I  DGP+ G W
Subjt:  LFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDAD-NRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGC-AMFLQSENRARLFQQYGDWLRYTG
        +PI+YERLP+ C +CG+I HSS  C A +L +++ +R   +YG WLR+ G
Subjt:  VPIRYERLPEVCSYCGLITHSSRGC-AMFLQSENRARLFQQYGDWLRYTG

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-5234.69Show/hide
Query:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP
        M +  LL +W    LTSEEE+ ++  D  A    G  L+  L+GKL+  RP+   VM+   R+AWK++ +  ++  LG N+F+F F   +DR ++++ GP
Subjt:  MDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKID-SVLQMDRLGMNVFIFRFGNDIDRARVFRQGP

Query:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW
        W F+R L+++  P+  + P +  F+ +  W+  F+LP+    + MA RLGNALG FE+AD  +    WGS+LR++V LD+++PLRRGI++  DGP+ G W
Subjt:  WLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLW

Query:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPTSASGPTVNELPRNRFNG
        +PI+YERLP+ C +CGL               + +R   QYG WLRY G      P + Q    +   + +     S    P  A    V   P      
Subjt:  VPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPTSASGPTVNELPRNRFNG

Query:  IRICEPVAAPPQQSNHSTLQ
        I +  PV   P++    + Q
Subjt:  IRICEPVAAPPQQSNHSTLQ

A0A6P6S3G2 uncharacterized protein LOC1136872934.6e-3831Show/hide
Query:  LSLTSEEEDVSVV-ADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLFERFLLVLVF
        L L  +EE+V ++  D + ++ + LS +LC+LGKL+  +    E +    +  W     L    LG N+F+F+F + +D+ +VF  GPW F+  LLV+  
Subjt:  LSLTSEEEDVSVV-ADREAVERAGLSLDLCLLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLFERFLLVLVF

Query:  PIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPIRYERLPEVC
         I  ++  +    + +FW+ ++ LP+ W N   AE LGN LG++E  + R     WG  LR++V++ L  PL+R + +F +G +    V  +YERLP +C
Subjt:  PIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPIRYERLPEVC

Query:  SYCGLITHSSRGCAMFLQSENRARLFQQYGDWLR------YTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPT---------SASGPTVNELPRNR
         YCG I H  R C + L +        QYG WLR      ++G+  + P V I ++ +       +G+R +    P          + S P  N+L R++
Subjt:  SYCGLITHSSRGCAMFLQSENRARLFQQYGDWLR------YTGRGMTLPPVLIQEDNSVHPPNVAQGQRISMDVVPT---------SASGPTVNELPRNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein9.6e-1228.57Show/hide
Query:  FIFRFGNDIDRARVFRQGPWLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLN
        FIF     ++   V R+GPW F  ++++L    +   P    F  + FW+ I  +P  + N+ + E +G ALG   D D     +      R+ +  D+ 
Subjt:  FIFRFGNDIDRARVFRQGPWLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLDLN

Query:  RPLRRGIRIFPDGPLSGLWVPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRA
         PLR   R F         +  RYERL   C  CG++TH    C +    E +A
Subjt:  RPLRRGIRIFPDGPLSGLWVPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRA

AT3G47920.1 unknown protein7.1e-0724.66Show/hide
Query:  FRFGNDIDRARVFRQGPWLFERFLLVLVFPIRGLR----PVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLD
        F F N++D   V R+  WLF  +       +   R    PV +  +++  W+ +  +P+ +  +  A  + + +G     D  +  +   + +R++VR+ 
Subjt:  FRFGNDIDRARVFRQGPWLFERFLLVLVFPIRGLR----PVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNALGIFEDADNRNGFLFWGSSLRLKVRLD

Query:  LNRPLRRGIRIFPDGPLSGLWVPIRYERLPEVCSYCGLITHSSRGC
        +   LR   RI  D   + L +  +YERL  +CS C  +TH    C
Subjt:  LNRPLRRGIRIFPDGPLSGLWVPIRYERLPEVCSYCGLITHSSRGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCGATTTCGAGCTTGGATCTGGTCACTGTTATCTTTTTTCGTCCGTTTTGGATTCGCTTGCTCAATCTTGGCGGATTTTTAGCCATTGTGTTGATATATCTGTG
TCGTGGTGGTTGCAGAACCCCACCATCTCCAGGTTTGGGTTTTTCTATCTCCTTCGTCTACTGTTCTGCCTTTTTTCGGTCAGTCGTTCAAAAGTTCATGGATTCTTCTT
CTTTACTTAACGATTGGTCCCGATTGAGTTTGACTTCGGAGGAGGAGGATGTTTCTGTGGTTGCTGATAGGGAGGCTGTGGAGAGAGCGGGACTCTCTTTAGATCTGTGT
TTACTTGGCAAGTTATATTGCCATCGACCACTTGGTGCTGAGGTCATGCGGCGGAATTTTCGTTCGGCATGGAAAATTGACTCTGTTTTGCAGATGGATCGGCTGGGCAT
GAACGTGTTCATTTTCCGCTTTGGAAATGACATTGATCGTGCTCGAGTTTTTCGGCAGGGACCGTGGCTGTTTGAAAGATTTCTCCTCGTTTTGGTTTTTCCAATCCGGG
GGTTGAGGCCGGTTGATCATCCTTTTTCTTCGGTTGCGTTTTGGTTGCATATTTTTGAACTCCCAATTGATTGGTTCAACCAAAGCATGGCGGAACGACTCGGCAATGCC
CTTGGTATTTTCGAAGATGCCGACAACAGAAATGGATTTCTTTTTTGGGGATCCAGCCTTCGTTTGAAGGTTCGTTTGGATCTGAATCGTCCTTTACGCAGAGGGATCCG
GATTTTTCCAGATGGTCCGTTGAGCGGTCTTTGGGTTCCTATTCGGTATGAAAGGTTGCCGGAAGTGTGCTCATATTGTGGGCTCATTACTCATTCCTCGCGGGGTTGTG
CTATGTTTTTGCAGTCGGAAAATAGAGCTCGTCTTTTTCAACAGTACGGGGACTGGCTTAGGTATACAGGGAGAGGAATGACTCTGCCTCCAGTTCTGATTCAGGAAGAC
AACTCTGTTCATCCTCCGAATGTCGCTCAGGGTCAAAGGATTTCAATGGATGTAGTTCCAACTTCAGCCTCTGGCCCGACCGTAAATGAACTACCAAGGAATCGTTTCAA
CGGGATTCGTATCTGTGAACCTGTTGCTGCACCACCTCAACAATCGAATCATTCAACCTTGCAAGACTCTCGGCGACTAAAAGGTAAGGAAAAAATCTTGGAAGCACAGA
CCAAAGGGAAAATGGCAATGGAGGATTCCGATTCACGGTGGCGGCCGAGAAGCTCGAACGGGAGCCTTCCTTCGTCGGAGATCCACAGCGAGAAACGGCTGAATTTTTTC
AACGGTCAGGGCATTAATTTGCAACATTCAAAGGGTAATACGGTTTCAAACGGCCTTCTTAATGGAGAGGAAGTCGTGGAGGGCAGTAATCGTCGCTTGTCCAGGAATTT
GGTTACGGATTTTCAGCGCTCTGATCCAATCCCTTTGGGCCTCCGGAGGGTTAAAGCCCATTTGAAGGAGGCTGATTCGGAGATTGAAAGTGATCCTGAGATGCTTGAAG
AATTCGGGTCTGATTCTGATGGATTGGGCAAATCAAGTGCAAATCAGGATGGATGTGGGCATGCTGCTGTAGCAGATCAGGTGCAGCTACAAAGTCAAGTGGGCTTTATG
CAGGGATCAATGGAAGATGGGCCCATGGAGTTTGTGTTCGGGTCAAGTCAGTGTCTTCAGTCGGACGAATCTGAGGGGGTACGGAACTCGATGGACCCTTTGATTTTTAA
AGCTAAATCTCCCAAGCTTAGTCCTCCAAATTTGCAAGGTATTGGTTGGAAGAAGCGTGCTCGAGCTGGGATGGTTCCTCGTGGAATGAAGGCGGTTGAGGAACTTCAAA
AGAGAAAGGATGGGCCCATTTTGTTTTCTCCGGGCCAAGTCAAACGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCGATTTCGAGCTTGGATCTGGTCACTGTTATCTTTTTTCGTCCGTTTTGGATTCGCTTGCTCAATCTTGGCGGATTTTTAGCCATTGTGTTGATATATCTGTG
TCGTGGTGGTTGCAGAACCCCACCATCTCCAGGTTTGGGTTTTTCTATCTCCTTCGTCTACTGTTCTGCCTTTTTTCGGTCAGTCGTTCAAAAGTTCATGGATTCTTCTT
CTTTACTTAACGATTGGTCCCGATTGAGTTTGACTTCGGAGGAGGAGGATGTTTCTGTGGTTGCTGATAGGGAGGCTGTGGAGAGAGCGGGACTCTCTTTAGATCTGTGT
TTACTTGGCAAGTTATATTGCCATCGACCACTTGGTGCTGAGGTCATGCGGCGGAATTTTCGTTCGGCATGGAAAATTGACTCTGTTTTGCAGATGGATCGGCTGGGCAT
GAACGTGTTCATTTTCCGCTTTGGAAATGACATTGATCGTGCTCGAGTTTTTCGGCAGGGACCGTGGCTGTTTGAAAGATTTCTCCTCGTTTTGGTTTTTCCAATCCGGG
GGTTGAGGCCGGTTGATCATCCTTTTTCTTCGGTTGCGTTTTGGTTGCATATTTTTGAACTCCCAATTGATTGGTTCAACCAAAGCATGGCGGAACGACTCGGCAATGCC
CTTGGTATTTTCGAAGATGCCGACAACAGAAATGGATTTCTTTTTTGGGGATCCAGCCTTCGTTTGAAGGTTCGTTTGGATCTGAATCGTCCTTTACGCAGAGGGATCCG
GATTTTTCCAGATGGTCCGTTGAGCGGTCTTTGGGTTCCTATTCGGTATGAAAGGTTGCCGGAAGTGTGCTCATATTGTGGGCTCATTACTCATTCCTCGCGGGGTTGTG
CTATGTTTTTGCAGTCGGAAAATAGAGCTCGTCTTTTTCAACAGTACGGGGACTGGCTTAGGTATACAGGGAGAGGAATGACTCTGCCTCCAGTTCTGATTCAGGAAGAC
AACTCTGTTCATCCTCCGAATGTCGCTCAGGGTCAAAGGATTTCAATGGATGTAGTTCCAACTTCAGCCTCTGGCCCGACCGTAAATGAACTACCAAGGAATCGTTTCAA
CGGGATTCGTATCTGTGAACCTGTTGCTGCACCACCTCAACAATCGAATCATTCAACCTTGCAAGACTCTCGGCGACTAAAAGGTAAGGAAAAAATCTTGGAAGCACAGA
CCAAAGGGAAAATGGCAATGGAGGATTCCGATTCACGGTGGCGGCCGAGAAGCTCGAACGGGAGCCTTCCTTCGTCGGAGATCCACAGCGAGAAACGGCTGAATTTTTTC
AACGGTCAGGGCATTAATTTGCAACATTCAAAGGGTAATACGGTTTCAAACGGCCTTCTTAATGGAGAGGAAGTCGTGGAGGGCAGTAATCGTCGCTTGTCCAGGAATTT
GGTTACGGATTTTCAGCGCTCTGATCCAATCCCTTTGGGCCTCCGGAGGGTTAAAGCCCATTTGAAGGAGGCTGATTCGGAGATTGAAAGTGATCCTGAGATGCTTGAAG
AATTCGGGTCTGATTCTGATGGATTGGGCAAATCAAGTGCAAATCAGGATGGATGTGGGCATGCTGCTGTAGCAGATCAGGTGCAGCTACAAAGTCAAGTGGGCTTTATG
CAGGGATCAATGGAAGATGGGCCCATGGAGTTTGTGTTCGGGTCAAGTCAGTGTCTTCAGTCGGACGAATCTGAGGGGGTACGGAACTCGATGGACCCTTTGATTTTTAA
AGCTAAATCTCCCAAGCTTAGTCCTCCAAATTTGCAAGGTATTGGTTGGAAGAAGCGTGCTCGAGCTGGGATGGTTCCTCGTGGAATGAAGGCGGTTGAGGAACTTCAAA
AGAGAAAGGATGGGCCCATTTTGTTTTCTCCGGGCCAAGTCAAACGTTAA
Protein sequenceShow/hide protein sequence
MAAISSLDLVTVIFFRPFWIRLLNLGGFLAIVLIYLCRGGCRTPPSPGLGFSISFVYCSAFFRSVVQKFMDSSSLLNDWSRLSLTSEEEDVSVVADREAVERAGLSLDLC
LLGKLYCHRPLGAEVMRRNFRSAWKIDSVLQMDRLGMNVFIFRFGNDIDRARVFRQGPWLFERFLLVLVFPIRGLRPVDHPFSSVAFWLHIFELPIDWFNQSMAERLGNA
LGIFEDADNRNGFLFWGSSLRLKVRLDLNRPLRRGIRIFPDGPLSGLWVPIRYERLPEVCSYCGLITHSSRGCAMFLQSENRARLFQQYGDWLRYTGRGMTLPPVLIQED
NSVHPPNVAQGQRISMDVVPTSASGPTVNELPRNRFNGIRICEPVAAPPQQSNHSTLQDSRRLKGKEKILEAQTKGKMAMEDSDSRWRPRSSNGSLPSSEIHSEKRLNFF
NGQGINLQHSKGNTVSNGLLNGEEVVEGSNRRLSRNLVTDFQRSDPIPLGLRRVKAHLKEADSEIESDPEMLEEFGSDSDGLGKSSANQDGCGHAAVADQVQLQSQVGFM
QGSMEDGPMEFVFGSSQCLQSDESEGVRNSMDPLIFKAKSPKLSPPNLQGIGWKKRARAGMVPRGMKAVEELQKRKDGPILFSPGQVKR