; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013427 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013427
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr1:50138832..50143836
RNA-Seq ExpressionLag0013427
SyntenyLag0013427
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.7e-3437.05Show/hide
Query:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------
        S K   RAFVTQF+G R R +P    LT+KQ   ESL+D + RF  E LQVEG  D V+L A +SG++DE L  S  K +P   +EA  R          
Subjt:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------

Query:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ
                                      R   + ++   +F++YTP T   EQVL  I+D  L K PE++++   +R+K +YC+FH DHGHAT++C  
Subjt:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ

Query:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSE
        L++E+E LIR+GY K +V     + P     G+   +P  EI+TI+GG  E
Subjt:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSE

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]1.2e-2934.51Show/hide
Query:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------
        S K   RAFVTQF+G R R +P    LT+KQ   ESL D + RF  E LQ+E   D V+L A +SG++DE L  S  K +P   +E   R          
Subjt:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------

Query:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ
                                      R   + ++   +F++YTP T   EQVL  I+D  L K PE+++    +R+K +YC+FH DH HAT++   
Subjt:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ

Query:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGA
        L++E+E LIR+GY + +V     + P     G+   +P  EI+TI+GG  E   A
Subjt:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGA

XP_030955724.1 uncharacterized protein LOC115977839 [Quercus lobata]3.6e-3133.98Show/hide
Query:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQ-----------------
        + FV  F+G +  ++   + LT++QG  ESL+  I RF  E L V+  DD + L A  +G+  +  ++ + +  P   AE  +                 
Subjt:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQ-----------------

Query:  -RPRSHRAVRNQE---------ARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFV
         RP+  R    +E          R   YTPL A   QVL  I+D    K PEK++ DP++RNKNKYC FH DHGH T EC  L+ +IE LIR+G  K FV
Subjt:  -RPRSHRAVRNQE---------ARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFV

Query:  GNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNI
        G D++   L     +    PL EI+ I+GG+        G+   S++ Y  A+ N+
Subjt:  GNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNI

XP_030958631.1 uncharacterized protein LOC115980538 [Quercus lobata]1.3e-2832.23Show/hide
Query:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE----------------ANQR
        + FV  F+G +  ++   + LT++QG  ESL+  I RF  E L V+  DD + L A  +G+  +  ++ + +  P   AE                A +R
Subjt:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE----------------ANQR

Query:  PR-----SHRAVRNQEA-----------------------RFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQL
         R     +H A  +++A                       R   YTPL A   QVL  I+D    K PEK++ DP++RNKNKYC FH DHGH T EC  L
Subjt:  PR-----SHRAVRNQEA-----------------------RFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQL

Query:  RDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNI
        + +IE LIR+G  K FVG D++         +    PL EI+ I+GG+        G+   S++ Y  A+ N+
Subjt:  RDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNI

XP_030970531.1 uncharacterized protein LOC115990904 [Quercus lobata]1.3e-2832.27Show/hide
Query:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE----------------ANQR
        + FV  F+G +  ++   + LT++QG  ESL+  I RF  E L V+  DD + L A  +G+  +  ++ + +  P   AE                A +R
Subjt:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE----------------ANQR

Query:  PRSHRAVRN---------------QEARFDR-------------YTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQL
         RS R   N                E R DR             YTPL A  EQVL  I+D    K P+K++ DP++RN+NKYC FH DHGH T EC  L
Subjt:  PRSHRAVRN---------------QEARFDR-------------YTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQL

Query:  RDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNIDRKAQATSA
        + +IE LIR+G  +GF+G D+          +    PL EI+ I+GGSS       G+   S++ Y   + +I    ++  A
Subjt:  RDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNIDRKAQATSA

TrEMBL top hitse value%identityAlignment
A0A2N9FPT8 Reverse transcriptase domain-containing protein2.4e-2835.14Show/hide
Query:  GSGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQRPRSHRAVRN
        GS      +F   F+G +   +P  + L V+Q   E+L+  + RF  E L V+G DD V LTA ISGLQ    L S+ K  P   +E  +R   +R  R 
Subjt:  GSGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQRPRSHRAVRN

Query:  QEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGND---------KSKRPLPA
           RF+ +TPL A  +++   I+D    + P KL ++PD+R K+KYC FH DHGH T +C  L+ +IE LI++G  + FV  D         + +RP   
Subjt:  QEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGND---------KSKRPLPA

Query:  DQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGN--IDRKAQATSACGD
        D+ +    P+ EI  I GG      AA G  + SR+ Y   + N  + +KA  T    D
Subjt:  DQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGN--IDRKAQATSACGD

A0A2N9IAR9 Ribonuclease H1.4e-2836.02Show/hide
Query:  GSGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE-ANQ-RPRSHRAV
        GS      +F   F+G +   +P  + L V+Q   E+L+  + RF  E L V+G DD V LTA ISGLQ    L S+ K  P   +E  NQ + R  R  
Subjt:  GSGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAE-ANQ-RPRSHRAV

Query:  RNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGND---------KSKRPL
        R    RF+ +TPL A  +++   I+D    + P KL ++PDRR K+KYC FH DHGH T +C  L+ +IE LI++G  + FV  D         + +RP 
Subjt:  RNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGND---------KSKRPL

Query:  PADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGN--IDRKAQATSACGD
          D+ +    P+ EI  I GG      AA G  + SR+ Y   + N  + +KA  T    D
Subjt:  PADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGN--IDRKAQATSACGD

A0A2N9IUL6 Uncharacterized protein1.8e-2834.77Show/hide
Query:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSP--------------WPKAEANQRPR
        RAF+  F+G R   +P    L ++Q   ESL+  + RF  E +Q++  ++ VALTA  +GL     L  + K  P              +PK E      
Subjt:  RAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSP--------------WPKAEANQRPR

Query:  SHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGNDK-----SKRP
        S+R    +E R   +TPL  S +QVL  IQD    + P KLRSDP++R+KN YC FH DHGH T +C  L+++IEALIR+G    FV  DK       RP
Subjt:  SHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQLRDEIEALIRKGYPKGFVGNDK-----SKRP

Query:  LPADQGKGGANP-----LFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNIDRK
           D+ K          + EI+TI+GG +  G +    +  +R+ + + +    RK
Subjt:  LPADQGKGGANP-----LFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNIDRK

A0A6J1DWY0 uncharacterized protein LOC1110252931.3e-3437.05Show/hide
Query:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------
        S K   RAFVTQF+G R R +P    LT+KQ   ESL+D + RF  E LQVEG  D V+L A +SG++DE L  S  K +P   +EA  R          
Subjt:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------

Query:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ
                                      R   + ++   +F++YTP T   EQVL  I+D  L K PE++++   +R+K +YC+FH DHGHAT++C  
Subjt:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ

Query:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSE
        L++E+E LIR+GY K +V     + P     G+   +P  EI+TI+GG  E
Subjt:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSE

A0A6J1DYL6 uncharacterized protein LOC1110257855.6e-3034.51Show/hide
Query:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------
        S K   RAFVTQF+G R R +P    LT+KQ   ESL D + RF  E LQ+E   D V+L A +SG++DE L  S  K +P   +E   R          
Subjt:  SGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKDCINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQR----------

Query:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ
                                      R   + ++   +F++YTP T   EQVL  I+D  L K PE+++    +R+K +YC+FH DH HAT++   
Subjt:  -----------------------------PRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFHGDHGHATRECIQ

Query:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGA
        L++E+E LIR+GY + +V     + P     G+   +P  EI+TI+GG  E   A
Subjt:  LRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCCGAGTTCTTTTTGGGTGTCCAGAAGCTCGGTCGTCATAGGGAGATCATCTTTTCTAGCCTCGGCCTCGGCTGCGCTGGCCACGGGCTTAGCAGCCCTGGCTT
CCGCTTCGACAGTTTTGGCACTTCTGCACTTATCCTCGATCAGATTCCTCTGGTCTATCTTGGTATGACCGTAGCTTGGTGTTGGGCTCCTCAGAATTGCTCGACCTGTA
GGTCTGAGCCCCCTGACCGTGCTGCTCATGCTGTCAGTCTTCTTTCGATTTCTGCGGTTGATGGTTATTTTCAGAAGGGTTTGGGTTTGGATTTCGTCTTCCCTTCTCCA
TCCTCTTACTCATGGAGTGACCGATGTGCAGATTTGGCAAGCTTTCTTCCCCACAGACGGCGCCAACTGTTGATGGTCGATTTTGATGAGGGGGAAGACGTGGCCTGCAA
GACAGTAAACCTGCACACCGATGTGGTGCTCGCCACACCGACTCCGATGCTTAAGTCAGTAAACAGAACGGTAGGGAGTGGAAAAGTCCAACCAGGCAAAACCGGAGCTG
TCGGAGGCGGTAGGGGCCTAACGGCATCGGACGGACTCGGCCCGCGCGAGCGGGCCGAGGGTTGGCATGGCCGAGGCCGAGCAGGGGGTCGGGCCAAAAACACGACCCCT
TCGGTCTTGGCCCGTCCCACTTGTCGGTTTTGCCTCCTTGGTCCATCTTTCAGCCCGATTTCTCCCCGATTGTCCTCAATTGATGGTCATGGGAGTGAAGACAAACAGCT
GGTACCTCTCAGAAGTTGCTGCGGACTTCCATGGATGAAGATAGATAACCAAAAGCTGCAACATAAGTGGCACAGCAAATCCCAACCATTGACAACGTACGAGGAGCTGA
CAGGACAACCGGGAGGAGATAGGACCAGGAAAGGGACCCAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGCCCGACCTGTACGGTCGGCCTCGGCCTTGGGCCTC
GGCCGACCACTCGGCCCGTTTGCACGGGCCGAGTCCGTTTGCCTCCGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCCCAGAACGCCTCCAAAC
CCTAGGAGTCCGAGCAGCTGCTCAGTTTCCTAACTTAGGCATCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGTTGGTCTTGCAGGTCACGTCTTC
CCCAGCTTCTACAAATTCATTGTTGGTGTCACGTGAAGGGCAGGTGGATTGCTTGGCCAAATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGAATGCTTGCCAGT
TCGGATCGCACATCGGTGGTGGTCTGCATGATGGATCGGGTTCAGAAACCAGAGGGAAGCCTCGGGTATGAGCAGAAAGCGCCGAGCCAGACTGGGGTCGACCTCGGTAT
GGAAAAGGCCGACCCTAACTTCGGAGACTGTTGTAAGGTTTCGTGCGTCCACGATGCTGTGGACATGACCCAAGAGGATGACCAGCGGAAGAACTGGAGAGGCTCGGGCA
AAGTCCGAGGCCGAGCATTTGTTACGCAATTCCTAGGAGCCCGGAACCGACAGAAGCCTCAGATCAACTCGTTGACAGTAAAGCAGGGGCTTCGAGAAAGCTTGAAGGAT
TGTATTAACAGATTTTGTAATGAAGTTTTGCAGGTAGAAGGCCATGACGATGGAGTTGCCTTGACTGCTATGATTTCAGGTTTGCAGGATGAAAGACTGCTCAACTCGAT
CAGCAAGAGCAGTCCCTGGCCAAAGGCCGAGGCCAACCAGAGGCCAAGGAGCCACAGGGCTGTGCGGAACCAAGAAGCCAGATTCGACAGGTATACACCACTAACAGCTT
CCTTTGAACAGGTCTTGGCCGCAATACAGGACACAAATCTGTTTAAACGCCCAGAAAAGTTGAGATCAGACCCCGATAGGAGGAACAAAAACAAATACTGCATGTTCCAC
GGAGACCACGGTCATGCAACTCGGGAATGCATACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAAAAGGTTACCCCAAGGGGTTCGTTGGGAACGACAAAAGCAAGAG
GCCACTGCCGGCAGATCAAGGTAAGGGCGGTGCCAACCCGCTATTCGAGATACAAACAATTTTAGGAGGATCATCCGAAGAAGGTGGAGCAGCGCATGGCGAGCAAAAAA
TGTCGAGGGAATGTTACTTTATGGCGCTCGGGAACATCGACAGGAAGGCTCAAGCAACGTCGGCCTGTGGAGATGGCCGAGGCCGAGCCTTTGAGGGGTCAAGCTATCAT
CTTCCAATGAAACGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCCGAGTTCTTTTTGGGTGTCCAGAAGCTCGGTCGTCATAGGGAGATCATCTTTTCTAGCCTCGGCCTCGGCTGCGCTGGCCACGGGCTTAGCAGCCCTGGCTT
CCGCTTCGACAGTTTTGGCACTTCTGCACTTATCCTCGATCAGATTCCTCTGGTCTATCTTGGTATGACCGTAGCTTGGTGTTGGGCTCCTCAGAATTGCTCGACCTGTA
GGTCTGAGCCCCCTGACCGTGCTGCTCATGCTGTCAGTCTTCTTTCGATTTCTGCGGTTGATGGTTATTTTCAGAAGGGTTTGGGTTTGGATTTCGTCTTCCCTTCTCCA
TCCTCTTACTCATGGAGTGACCGATGTGCAGATTTGGCAAGCTTTCTTCCCCACAGACGGCGCCAACTGTTGATGGTCGATTTTGATGAGGGGGAAGACGTGGCCTGCAA
GACAGTAAACCTGCACACCGATGTGGTGCTCGCCACACCGACTCCGATGCTTAAGTCAGTAAACAGAACGGTAGGGAGTGGAAAAGTCCAACCAGGCAAAACCGGAGCTG
TCGGAGGCGGTAGGGGCCTAACGGCATCGGACGGACTCGGCCCGCGCGAGCGGGCCGAGGGTTGGCATGGCCGAGGCCGAGCAGGGGGTCGGGCCAAAAACACGACCCCT
TCGGTCTTGGCCCGTCCCACTTGTCGGTTTTGCCTCCTTGGTCCATCTTTCAGCCCGATTTCTCCCCGATTGTCCTCAATTGATGGTCATGGGAGTGAAGACAAACAGCT
GGTACCTCTCAGAAGTTGCTGCGGACTTCCATGGATGAAGATAGATAACCAAAAGCTGCAACATAAGTGGCACAGCAAATCCCAACCATTGACAACGTACGAGGAGCTGA
CAGGACAACCGGGAGGAGATAGGACCAGGAAAGGGACCCAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGCCCGACCTGTACGGTCGGCCTCGGCCTTGGGCCTC
GGCCGACCACTCGGCCCGTTTGCACGGGCCGAGTCCGTTTGCCTCCGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCCCAGAACGCCTCCAAAC
CCTAGGAGTCCGAGCAGCTGCTCAGTTTCCTAACTTAGGCATCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGTTGGTCTTGCAGGTCACGTCTTC
CCCAGCTTCTACAAATTCATTGTTGGTGTCACGTGAAGGGCAGGTGGATTGCTTGGCCAAATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGAATGCTTGCCAGT
TCGGATCGCACATCGGTGGTGGTCTGCATGATGGATCGGGTTCAGAAACCAGAGGGAAGCCTCGGGTATGAGCAGAAAGCGCCGAGCCAGACTGGGGTCGACCTCGGTAT
GGAAAAGGCCGACCCTAACTTCGGAGACTGTTGTAAGGTTTCGTGCGTCCACGATGCTGTGGACATGACCCAAGAGGATGACCAGCGGAAGAACTGGAGAGGCTCGGGCA
AAGTCCGAGGCCGAGCATTTGTTACGCAATTCCTAGGAGCCCGGAACCGACAGAAGCCTCAGATCAACTCGTTGACAGTAAAGCAGGGGCTTCGAGAAAGCTTGAAGGAT
TGTATTAACAGATTTTGTAATGAAGTTTTGCAGGTAGAAGGCCATGACGATGGAGTTGCCTTGACTGCTATGATTTCAGGTTTGCAGGATGAAAGACTGCTCAACTCGAT
CAGCAAGAGCAGTCCCTGGCCAAAGGCCGAGGCCAACCAGAGGCCAAGGAGCCACAGGGCTGTGCGGAACCAAGAAGCCAGATTCGACAGGTATACACCACTAACAGCTT
CCTTTGAACAGGTCTTGGCCGCAATACAGGACACAAATCTGTTTAAACGCCCAGAAAAGTTGAGATCAGACCCCGATAGGAGGAACAAAAACAAATACTGCATGTTCCAC
GGAGACCACGGTCATGCAACTCGGGAATGCATACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAAAAGGTTACCCCAAGGGGTTCGTTGGGAACGACAAAAGCAAGAG
GCCACTGCCGGCAGATCAAGGTAAGGGCGGTGCCAACCCGCTATTCGAGATACAAACAATTTTAGGAGGATCATCCGAAGAAGGTGGAGCAGCGCATGGCGAGCAAAAAA
TGTCGAGGGAATGTTACTTTATGGCGCTCGGGAACATCGACAGGAAGGCTCAAGCAACGTCGGCCTGTGGAGATGGCCGAGGCCGAGCCTTTGAGGGGTCAAGCTATCAT
CTTCCAATGAAACGTTGA
Protein sequenceShow/hide protein sequence
MISEFFLGVQKLGRHREIIFSSLGLGCAGHGLSSPGFRFDSFGTSALILDQIPLVYLGMTVAWCWAPQNCSTCRSEPPDRAAHAVSLLSISAVDGYFQKGLGLDFVFPSP
SSYSWSDRCADLASFLPHRRRQLLMVDFDEGEDVACKTVNLHTDVVLATPTPMLKSVNRTVGSGKVQPGKTGAVGGGRGLTASDGLGPRERAEGWHGRGRAGGRAKNTTP
SVLARPTCRFCLLGPSFSPISPRLSSIDGHGSEDKQLVPLRSCCGLPWMKIDNQKLQHKWHSKSQPLTTYEELTGQPGGDRTRKGTQRETIPTGRANVARPVRSASALGL
GRPLGPFARAESVCLRSAPTASSCLGPACFVPERLQTLGVRAAAQFPNLGIGGGVAYTTPVSSDSCWSCRSRLPQLLQIHCWCHVKGRWIAWPNFGINSWRRLWGRMLAS
SDRTSVVVCMMDRVQKPEGSLGYEQKAPSQTGVDLGMEKADPNFGDCCKVSCVHDAVDMTQEDDQRKNWRGSGKVRGRAFVTQFLGARNRQKPQINSLTVKQGLRESLKD
CINRFCNEVLQVEGHDDGVALTAMISGLQDERLLNSISKSSPWPKAEANQRPRSHRAVRNQEARFDRYTPLTASFEQVLAAIQDTNLFKRPEKLRSDPDRRNKNKYCMFH
GDHGHATRECIQLRDEIEALIRKGYPKGFVGNDKSKRPLPADQGKGGANPLFEIQTILGGSSEEGGAAHGEQKMSRECYFMALGNIDRKAQATSACGDGRGRAFEGSSYH
LPMKR