; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037607 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037607
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:50743912..50750281
RNA-Seq ExpressionSpg037607
SyntenySpg037607
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4362342.1 hypothetical protein F8388_008226 [Cannabis sativa]3.3e-2232.63Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNS--RISPET
        FTW+ + +G   ++ERLDRF       +LF S+ + + +  +S H P++A L   V   K  K KR  RF  + +  EEC +I+   W        S ++
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNS--RISPET

Query:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLIN-SEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
               C  +L +WN+++  GSI   ++  +  +  L++ +      ++++  E  L+ LL  EE Y K RSR DWL  GD NTK+FH+KA+  KKKN 
Subjt:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLIN-SEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPS
        I  ++   GQ +T EE++    +R     +SS  PS
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPS

KAF4366593.1 hypothetical protein F8388_004257 [Cannabis sativa]5.6e-2232.27Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNS--RISPET
        FTW+ +  G   ++ERLDR+       ELF S+ + + +  +S H PI+A L       K  K KR  RF  + +   EC+EII + W S      + ++
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNS--RISPET

Query:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLIN-SEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
              SC  +L  WN+ +  GS    ++  +  +  L++ S      E+++  E +L+ L   EE Y K RSR DWL  GD NTK+FH+KA+  KKKN 
Subjt:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLIN-SEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV
        I  ++   G+ ++ EE++    +R     +SS  P+L  VE+   +V + V
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV

KAF4399953.1 hypothetical protein G4B88_021167 [Cannabis sativa]1.0e-2328.44Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT
        FTW+ + +G   ++ERLDR+        LF S+ + + +  +S H PI+A+L   V + +  K KR  RF  + +   ECR+I+ + W S     P  + 
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT

Query:  SKIIS----CLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSE-EFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKK
          I+     C  +L +WN+ +  GSI   ++  +  +  L++ +      ++++  E +L+ LL  EE Y + RSR DWL  GD NTK+FH+KA+  KKK
Subjt:  SKIIS----CLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSE-EFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKK

Query:  NKIEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAVLIAEDPWMTKLGSRVPCWVQDEWKNKKVSDLLEKNGGWKEECIKDIF
        N I  ++   G+  T EE++    +      +SS  PS   VE    ++ + +   E      +    P  V        V+DL+  +G W    +   F
Subjt:  NKIEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAVLIAEDPWMTKLGSRVPCWVQDEWKNKKVSDLLEKNGGWKEECIKDIF

Query:  IPMEAEEILSIPREDWTTKD
        + ++ + ILS  +  W  +D
Subjt:  IPMEAEEILSIPREDWTTKD

XP_024044123.1 uncharacterized protein LOC112100121 [Citrus clementina]2.5e-2233.48Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVL---SPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRIS--
        +TW  R  G   I+E+LDRF      T+ F++    +L   +S H+PIL  +   S KV   +  K+     +     +YE C+EII   W  N   S  
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVL---SPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRIS--

Query:  --PETYTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWK
           +T+       L++L +W+RE   G  +   +  + + K   + ++  + E++++ E+++   L ++EIY K RSR DWL+ GD NTK+FHSKAS  K
Subjt:  --PETYTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWK

Query:  KKNKIEGLVDTAGQWVTDEEELGK
        +KNKI G+ +  G W  D+EE+ K
Subjt:  KKNKIEGLVDTAGQWVTDEEELGK

XP_030505522.1 uncharacterized protein LOC115720515 [Cannabis sativa]1.8e-2835.46Show/hide
Query:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPT-RFVGNCVNYEECREIILKHWDSNSRISPET
        +FTW K T G ++IKERLD  +     TE+F   ++ HL++++S H  ++  +  + +       KRP  RF        EC+ II   W S+S  S   
Subjt:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPT-RFVGNCVNYEECREIILKHWDSNSRISPET

Query:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEF-TNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
         TS I  C + LS W+R +  GS+   IK   D I  L N ++  T  E+L   E++L+ LL +EE+Y K RSR  WL+ GD NTK+FH KA   +K N 
Subjt:  YTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEF-TNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV
        I+GL++   +W +  E++ K         +++  PSLP +E+L + V+  V
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV

TrEMBL top hitse value%identityAlignment
A0A2N9GII4 Uncharacterized protein5.0e-2432.27Show/hide
Query:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY
        KFTW         +KERLDR +A    T LFN IS+ HL    S H PIL   + +    +    +R TRF      + +C  +I   W+         +
Subjt:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY

Query:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
          T KI  C   L+ W+++   GS Q  I+ + + ++ L N +   N   +   ++E++SLL  +EI+ K RSR  WL+ GD NTK+FH+ A+  ++ NK
Subjt:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV
        IEGL++  GQW+T+  +L    ++     ++S  P  P +E+  + V   V
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV

A0A2N9HE04 Reverse transcriptase domain-containing protein5.0e-2432.27Show/hide
Query:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY
        KFTW         +KERLDR +A    T LFN IS+ HL    S H PIL   + +    +    +R TRF      + +C  +I   W+         +
Subjt:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY

Query:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
          T KI  C   L+ W+++   GS Q  I+ + + ++ L N +   N   +   ++E++SLL  +EI+ K RSR  WL+ GD NTK+FH+ A+  ++ NK
Subjt:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV
        IEGL++  GQW+T+  +L    ++     ++S  P  P +E+  + V   V
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV

A0A2N9IJF6 Uncharacterized protein5.0e-2432.27Show/hide
Query:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY
        KFTW         +KERLDR +A    T LFN IS+ HL    S H PIL   + +    +    +R TRF      + +C  +I   W+         +
Subjt:  KFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETY

Query:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK
          T KI  C   L+ W+++   GS Q  I+ + + ++ L N +   N   +   ++E++SLL  +EI+ K RSR  WL+ GD NTK+FH+ A+  ++ NK
Subjt:  --TSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNK

Query:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV
        IEGL++  GQW+T+  +L    ++     ++S  P  P +E+  + V   V
Subjt:  IEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAV

A0A7J6HXD3 CCHC-type domain-containing protein5.0e-2428.44Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT
        FTW+ + +G   ++ERLDR+        LF S+ + + +  +S H PI+A+L   V + +  K KR  RF  + +   ECR+I+ + W S     P  + 
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT

Query:  SKIIS----CLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSE-EFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKK
          I+     C  +L +WN+ +  GSI   ++  +  +  L++ +      ++++  E +L+ LL  EE Y + RSR DWL  GD NTK+FH+KA+  KKK
Subjt:  SKIIS----CLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSE-EFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKK

Query:  NKIEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAVLIAEDPWMTKLGSRVPCWVQDEWKNKKVSDLLEKNGGWKEECIKDIF
        N I  ++   G+  T EE++    +      +SS  PS   VE    ++ + +   E      +    P  V        V+DL+  +G W    +   F
Subjt:  NKIEGLVDTAGQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAVLIAEDPWMTKLGSRVPCWVQDEWKNKKVSDLLEKNGGWKEECIKDIF

Query:  IPMEAEEILSIPREDWTTKD
        + ++ + ILS  +  W  +D
Subjt:  IPMEAEEILSIPREDWTTKD

A0A803MS51 Uncharacterized protein1.4e-2335.48Show/hide
Query:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT
        FTW     GD  I+ERLDR+LA +   +LF S  ++HL    S H PIL  +   + T K  K K+  RF    +  E C EI+   W+       E   
Subjt:  FTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRISPETYT

Query:  SKIISCLSKLSSWNRERLKGSIQ--MAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNKI
        SKI    S LS+W+RE+    ++   A K K + + G   +EE     ++R  +  +D L + EE+Y K RSR+DWL+ GD NT +FH+K      +N+I
Subjt:  SKIISCLSKLSSWNRERLKGSIQ--MAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNKI

Query:  EGLVDTAGQWVTDEEEL
        +   + AG    DEE++
Subjt:  EGLVDTAGQWVTDEEEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.9e-0825Show/hide
Query:  ILWSLWNARNGCNQTNKTPDAQQIRRLIINSFEECEEAKRIYPVT--VQSENLPSHQHWIPPEQLCW-KLNVDAAWSEKRGDGGLVWAIRDSSGSLVGAG
        +LW LW +RN      K  DA ++ R  +  FEE    + +       Q E   S Q   PP Q  W K N DA W  +    G+ W +R+ SG ++  G
Subjt:  ILWSLWNARNGCNQTNKTPDAQQIRRLIINSFEECEEAKRIYPVT--VQSENLPSHQHWIPPEQLCW-KLNVDAAWSEKRGDGGLVWAIRDSSGSLVGAG

Query:  CVQIRRKWSINCLEGKAMIEGLVAFTNQFRHEDSRFKALEVESDSSDVVCVLNRKLEDMSELAIIAEEIKNLGLEARVVSFSKCPHSCNTLVHDLAQAAS
           + R  ++   E +A+   ++  +         +K +  ESD+  +V +LN   +    L    E+I+ L      V F   P   N +   +A+ + 
Subjt:  CVQIRRKWSINCLEGKAMIEGLVAFTNQFRHEDSRFKALEVESDSSDVVCVLNRKLEDMSELAIIAEEIKNLGLEARVVSFSKCPHSCNTLVHDLAQAAS

Query:  FHGDFQHFLFAL
           ++   LF++
Subjt:  FHGDFQHFLFAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCAGAGAGGCAATTGTAGTAGAAGAAGGAATGAACAAATTCACATGGGTTAAAAGAACTAGGGGTGACCACATTATCAAGGAGAGGCTGGACCGTTTCCTAGC
TTTAAATGGGCTAACGGAATTGTTCAACAGCATAAGCATTAACCACTTGAATCACCATAATTCTTATCACAGCCCCATCCTAGCAGTCCTATCGCCAAAAGTAGAGACAA
TGAAAGGTGGCAAATGGAAAAGGCCCACCCGTTTCGTAGGGAACTGTGTCAATTACGAGGAATGCAGGGAAATTATCTTAAAGCACTGGGATTCGAACTCTAGGATTAGC
CCTGAAACTTACACGAGCAAGATTATCTCTTGTCTATCCAAATTGAGCTCTTGGAATAGGGAGAGATTAAAAGGATCCATTCAAATGGCTATTAAGAGAAAGGAAGACGT
GATTAAAGGCTTGATAAACAGCGAGGAGTTCACTAATGAAGAGAAGCTCAGGGTAGCAGAGAAAGAGTTAGATTCCCTTCTCAAAGAAGAAGAGATATACAGGAAATTCA
GATCTAGGGAGGATTGGTTGAGATGGGGCGACGGAAACACTAAATGGTTTCACTCGAAAGCAAGTCACTGGAAAAAGAAAAACAAAATCGAGGGTCTCGTAGACACGGCT
GGTCAATGGGTAACCGATGAGGAAGAGCTAGGGAAGGGCCGACAGCGTCGTGACACTGTGACGTATTCTTCGTGTTGCCCAAGTCTTCCCAGTGTCGAGAAGCTAAGGGA
CAGCGTCAGATCCGCGGTGCTTATTGCTGAGGACCCTTGGATGACCAAGCTTGGAAGCAGAGTCCCTTGCTGGGTTCAAGACGAGTGGAAAAACAAAAAAGTCAGTGATC
TCCTTGAGAAGAATGGAGGCTGGAAAGAGGAATGCATTAAGGATATTTTCATCCCTATGGAGGCCGAAGAAATTTTGTCCATTCCAAGGGAGGACTGGACGACGAAGGAC
TATTGGGAAGGGATTCGCAAGCTAGTGAATGGGGATCACATGGTGAAAGTGGTGCTAATCTTGTGGTCTCTATGGAATGCTAGAAACGGATGCAATCAAACCAACAAAAC
TCCAGATGCCCAACAGATTCGCAGATTGATTATCAACAGTTTCGAAGAATGCGAAGAAGCGAAAAGAATTTACCCGGTTACAGTGCAGTCGGAGAACCTGCCGAGTCATC
AACATTGGATCCCTCCGGAGCAGCTCTGCTGGAAATTGAACGTCGACGCCGCTTGGAGTGAGAAAAGGGGAGATGGAGGGCTTGTGTGGGCTATTCGTGACTCTTCAGGA
TCTCTGGTCGGAGCCGGATGTGTGCAAATTAGAAGAAAATGGTCGATCAATTGTCTAGAAGGGAAAGCGATGATTGAAGGTTTAGTGGCTTTCACTAACCAATTCAGGCA
TGAAGATAGTCGATTCAAGGCTCTCGAAGTTGAATCGGACTCGAGCGACGTCGTGTGCGTCCTCAACAGAAAGCTGGAAGATATGTCGGAGCTAGCGATCATTGCAGAAG
AGATAAAAAATTTGGGGCTCGAAGCGAGGGTGGTCTCCTTCTCCAAATGCCCGCATAGCTGTAACACTCTTGTGCACGATCTTGCTCAGGCTGCTTCCTTCCATGGCGAT
TTCCAGCATTTTTTGTTCGCTCTCTTCTCCCTGATAGGGAAGATGAAGCGTTTTGGAGGGAGGTCTGTCTCCCTTTTTGGTTTGCTTCAGTTATTACTGAAGGGTGTAAG
GGATGAAGTGACAGCCTTGAACGAGAGGGATGAATCTCTCGAGATGGTGAGATGGCGAAGCAATGAAGGAGGAGAAGAAGAACGATGGTGTGACAGTGTAGAAGAGGGAG
GACGAGAAGAAGAGGGGGTGAAACTACGAAAAGAGGGAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCAGAGAGGCAATTGTAGTAGAAGAAGGAATGAACAAATTCACATGGGTTAAAAGAACTAGGGGTGACCACATTATCAAGGAGAGGCTGGACCGTTTCCTAGC
TTTAAATGGGCTAACGGAATTGTTCAACAGCATAAGCATTAACCACTTGAATCACCATAATTCTTATCACAGCCCCATCCTAGCAGTCCTATCGCCAAAAGTAGAGACAA
TGAAAGGTGGCAAATGGAAAAGGCCCACCCGTTTCGTAGGGAACTGTGTCAATTACGAGGAATGCAGGGAAATTATCTTAAAGCACTGGGATTCGAACTCTAGGATTAGC
CCTGAAACTTACACGAGCAAGATTATCTCTTGTCTATCCAAATTGAGCTCTTGGAATAGGGAGAGATTAAAAGGATCCATTCAAATGGCTATTAAGAGAAAGGAAGACGT
GATTAAAGGCTTGATAAACAGCGAGGAGTTCACTAATGAAGAGAAGCTCAGGGTAGCAGAGAAAGAGTTAGATTCCCTTCTCAAAGAAGAAGAGATATACAGGAAATTCA
GATCTAGGGAGGATTGGTTGAGATGGGGCGACGGAAACACTAAATGGTTTCACTCGAAAGCAAGTCACTGGAAAAAGAAAAACAAAATCGAGGGTCTCGTAGACACGGCT
GGTCAATGGGTAACCGATGAGGAAGAGCTAGGGAAGGGCCGACAGCGTCGTGACACTGTGACGTATTCTTCGTGTTGCCCAAGTCTTCCCAGTGTCGAGAAGCTAAGGGA
CAGCGTCAGATCCGCGGTGCTTATTGCTGAGGACCCTTGGATGACCAAGCTTGGAAGCAGAGTCCCTTGCTGGGTTCAAGACGAGTGGAAAAACAAAAAAGTCAGTGATC
TCCTTGAGAAGAATGGAGGCTGGAAAGAGGAATGCATTAAGGATATTTTCATCCCTATGGAGGCCGAAGAAATTTTGTCCATTCCAAGGGAGGACTGGACGACGAAGGAC
TATTGGGAAGGGATTCGCAAGCTAGTGAATGGGGATCACATGGTGAAAGTGGTGCTAATCTTGTGGTCTCTATGGAATGCTAGAAACGGATGCAATCAAACCAACAAAAC
TCCAGATGCCCAACAGATTCGCAGATTGATTATCAACAGTTTCGAAGAATGCGAAGAAGCGAAAAGAATTTACCCGGTTACAGTGCAGTCGGAGAACCTGCCGAGTCATC
AACATTGGATCCCTCCGGAGCAGCTCTGCTGGAAATTGAACGTCGACGCCGCTTGGAGTGAGAAAAGGGGAGATGGAGGGCTTGTGTGGGCTATTCGTGACTCTTCAGGA
TCTCTGGTCGGAGCCGGATGTGTGCAAATTAGAAGAAAATGGTCGATCAATTGTCTAGAAGGGAAAGCGATGATTGAAGGTTTAGTGGCTTTCACTAACCAATTCAGGCA
TGAAGATAGTCGATTCAAGGCTCTCGAAGTTGAATCGGACTCGAGCGACGTCGTGTGCGTCCTCAACAGAAAGCTGGAAGATATGTCGGAGCTAGCGATCATTGCAGAAG
AGATAAAAAATTTGGGGCTCGAAGCGAGGGTGGTCTCCTTCTCCAAATGCCCGCATAGCTGTAACACTCTTGTGCACGATCTTGCTCAGGCTGCTTCCTTCCATGGCGAT
TTCCAGCATTTTTTGTTCGCTCTCTTCTCCCTGATAGGGAAGATGAAGCGTTTTGGAGGGAGGTCTGTCTCCCTTTTTGGTTTGCTTCAGTTATTACTGAAGGGTGTAAG
GGATGAAGTGACAGCCTTGAACGAGAGGGATGAATCTCTCGAGATGGTGAGATGGCGAAGCAATGAAGGAGGAGAAGAAGAACGATGGTGTGACAGTGTAGAAGAGGGAG
GACGAGAAGAAGAGGGGGTGAAACTACGAAAAGAGGGAGAATAG
Protein sequenceShow/hide protein sequence
MQGREAIVVEEGMNKFTWVKRTRGDHIIKERLDRFLALNGLTELFNSISINHLNHHNSYHSPILAVLSPKVETMKGGKWKRPTRFVGNCVNYEECREIILKHWDSNSRIS
PETYTSKIISCLSKLSSWNRERLKGSIQMAIKRKEDVIKGLINSEEFTNEEKLRVAEKELDSLLKEEEIYRKFRSREDWLRWGDGNTKWFHSKASHWKKKNKIEGLVDTA
GQWVTDEEELGKGRQRRDTVTYSSCCPSLPSVEKLRDSVRSAVLIAEDPWMTKLGSRVPCWVQDEWKNKKVSDLLEKNGGWKEECIKDIFIPMEAEEILSIPREDWTTKD
YWEGIRKLVNGDHMVKVVLILWSLWNARNGCNQTNKTPDAQQIRRLIINSFEECEEAKRIYPVTVQSENLPSHQHWIPPEQLCWKLNVDAAWSEKRGDGGLVWAIRDSSG
SLVGAGCVQIRRKWSINCLEGKAMIEGLVAFTNQFRHEDSRFKALEVESDSSDVVCVLNRKLEDMSELAIIAEEIKNLGLEARVVSFSKCPHSCNTLVHDLAQAASFHGD
FQHFLFALFSLIGKMKRFGGRSVSLFGLLQLLLKGVRDEVTALNERDESLEMVRWRSNEGGEEERWCDSVEEGGREEEGVKLRKEGE