; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002120 (gene) of Snake gourd v1 genome

Gene IDTan0002120
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG06:12019142..12022309
RNA-Seq ExpressionTan0002120
SyntenyTan0002120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7048395.1 unnamed protein product [Microthlaspi erraticum]1.2e-2832.16Show/hide
Query:  EEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRP-----VERNRGIQLARQLLERDLLEENIYWRQRARVDW
        +E IN+ W +     + D       L  K+ +C K ++ W+R  K NS  KI   ++++ +      V     + L   L      EE +YWRQ++RV W
Subjt:  EEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRP-----VERNRGIQLARQLLERDLLEENIYWRQRARVDW

Query:  LKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMG
        LK GDRN+ + H    QRR +N I+  K  +G WA  +  +E+  + YFQ+LFSS+  Q+  E    L++V  K+   +N  +T   T EE+ K +SEM 
Subjt:  LKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMG

Query:  PSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRLNSGQLEEFLVMCWCVWNERNRVVNQRKGEALGNLN
        P KAPG DG T++FY++       DV++   +    T SL+    TRLN   +      C     ER R +++ +  +L N++
Subjt:  PSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRLNSGQLEEFLVMCWCVWNERNRVVNQRKGEALGNLN

KAA3466274.1 reverse transcriptase [Gossypium australe]5.8e-2835.95Show/hide
Query:  KLQNCAKHLNSWERALKGNS---PYKISKAREEI---HRPVERNRGIQLARQLLERDLLEENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGF
        KL+N  K L  WE  +K        K+SK  E +    R  +    I   R  L  ++  + IYW QRAR +WLK GD+NS + H  AS R+R N I+  
Subjt:  KLQNCAKHLNSWERALKGNS---PYKISKAREEI---HRPVERNRGIQLARQLLERDLLEENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGF

Query:  KLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVL
        +L+NG   ++E ++ K  S +F+SLF+S    D   L   L+ ++ KI +++N L+ + FT EEV   L +MGP+KAPG DGF AMF+++       DV+
Subjt:  KLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVL

Query:  RFCMEALNGTLSLEDINLTRL-------NSGQLEEFLVMCWC
         FC+  LN   S   +N T +       N   L +F  +  C
Subjt:  RFCMEALNGTLSLEDINLTRL-------NSGQLEEFLVMCWC

KAA3468559.1 reverse transcriptase [Gossypium australe]9.9e-2831.3Show/hide
Query:  DCEEVINQS--------WTLGDT--DHLQDLGYSGA-RLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRPVERNRGIQLARQLLE------RDLL
        DCE  I+++        WT+ DT  + L+++  S +  +  KL+N    L  W   +KG       +  +E+   ++ +R      +++E       ++ 
Subjt:  DCEEVINQS--------WTLGDT--DHLQDLGYSGA-RLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRPVERNRGIQLARQLLE------RDLL

Query:  EENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNP
        ++  YW QRARV+WL+ GDRN+ + H CA+ RRR N I    L NG   + E  +++E   YF++LF+S+   +  E+   L+ ++  I   +N  +  P
Subjt:  EENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNP

Query:  FTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL
        FT EEVI  L  MGP+KAPG DGF  +F++K       +VL +C+  LN    ++ +N T +
Subjt:  FTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.6e-3028.53Show/hide
Query:  DCEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRPVERNRG------IQLARQLLERDLLEENIYWRQRAR
        DC+++I   W   ++ H  +   S   +  +L+ CA++L+ W + + GN P KI + +E ++  V  +R       I + R+ +   L  E I W+QR+R
Subjt:  DCEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRPVERNRG------IQLARQLLERDLLEENIYWRQRAR

Query:  VDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLS
        V WL  GDRN+ + HT AS RRR+N I+G   +NG W  +   + K    YFQ+++SS+      E+   L  +   +   +N  +   FT EE+   L+
Subjt:  VDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLS

Query:  EMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL-------NSGQLEEF--LVMCWCVWNERNRVVNQRKGEALGNLNDCNSR
        +M P+KAPG DG +A+F++K      ND++   ++ LN  +S+ +IN T +       N  ++ +F  + +C  V+   ++V+  R    L  +   N  
Subjt:  EMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL-------NSGQLEEF--LVMCWCVWNERNRVVNQRKGEALGNLNDCNSR

Query:  AILSKGNPKKASVVCLDCSDEVHELKGRKEKDVKWKPPNFSNFKINLDAAIDNL
        A LS        +V  +    +H L+ +KE         F+  K+++  A D +
Subjt:  AILSKGNPKKASVVCLDCSDEVHELKGRKEKDVKWKPPNFSNFKINLDAAIDNL

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.4e-2827.86Show/hide
Query:  MQIDCEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHR-------PVERNRGIQLARQLLERDLLE-ENIYW
        +Q +C  VI ++W  GD +         A +++K++ C   L +W  ++       I + ++++ R          +   + L++++   DLL+ + IYW
Subjt:  MQIDCEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHR-------PVERNRGIQLARQLLERDLLE-ENIYW

Query:  RQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEV
         QR+R++WL+ GDRN+ + H  ASQRRR+N+I G +   G W  N  ++ +  + YF +LF +    D  E C  L  V  K+   +   ++N FT EEV
Subjt:  RQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEV

Query:  IKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL-------NSGQLEEF--LVMCWCVWNERNRVVNQRKGEALGNLN
           L +MGP+KAPG DG  A+FY+K      + V+   ++ LN    L +IN T +       N  ++ EF  + +C  ++   ++V+  R  + L  + 
Subjt:  IKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL-------NSGQLEEF--LVMCWCVWNERNRVVNQRKGEALGNLN

Query:  DCNSRAILSKGNPKKASVVCLDCSDEVHELKGRKEKDVKWK
             A +         +V  +    +H  K  K+ DV  K
Subjt:  DCNSRAILSKGNPKKASVVCLDCSDEVHELKGRKEKDVKWK

TrEMBL top hitse value%identityAlignment
A0A2N9HWM9 Reverse transcriptase domain-containing protein8.7e-3035.32Show/hide
Query:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA
        CEEVI Q+W       +Q +G    RL QK++ C   L SW ++     P  I++ +   +EI+      V +  G  L R L  R LL +E IYWRQR+
Subjt:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA

Query:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL
        RV WL+ GDRN+ + H CA+QR++ N I G +  N  W ++++ +E+ V  YF  +++S+N      +    + V++ +   +N  +  PFT EEV   L
Subjt:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL

Query:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL
         +M PSKAPG DG TA+F++K       DV    ++ LN    L+ +N T +
Subjt:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL

A0A2N9J6I3 Uncharacterized protein2.3e-3035.32Show/hide
Query:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA
        CEEVI Q+W       +Q +G    RL QK++ C   L SW ++     P  I++ +   +EI+      V +  G  L R L  R LL +E IYWRQR+
Subjt:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA

Query:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL
        RV WL+ GDRN+ + H CA+QR++ N I G +  N  W ++++ +E+ V  YF  +++S+N      +    + V++ +   +N  +  PFT EEV + L
Subjt:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL

Query:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL
         +M PSKAPG DG TA+F++K       DV    ++ LN    L+ +N T +
Subjt:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL

A0A2N9J6K4 Reverse transcriptase domain-containing protein1.5e-2934.92Show/hide
Query:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA
        CEEVI  +W       +Q  G    RL QK++ C   L SW ++     P  I++ +   +EI+      V +  G  L R L  R LL +E IYWRQR+
Subjt:  CEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAR---EEIH----RPVERNRGIQLARQLLERDLL-EENIYWRQRA

Query:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL
        RV WL+ GDRN+ + H CA+QR++ N I G +  N  W ++++ +E+ V  YF  +++S+N      +    + V++ +   +N  +  PFT EEV + L
Subjt:  RVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGL

Query:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL
         +M PSKAPG DG TA+F++K       DV    ++ LN    L+ +N T +
Subjt:  SEMGPSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRL

A0A5B6VB34 Reverse transcriptase2.8e-2835.95Show/hide
Query:  KLQNCAKHLNSWERALKGNS---PYKISKAREEI---HRPVERNRGIQLARQLLERDLLEENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGF
        KL+N  K L  WE  +K        K+SK  E +    R  +    I   R  L  ++  + IYW QRAR +WLK GD+NS + H  AS R+R N I+  
Subjt:  KLQNCAKHLNSWERALKGNS---PYKISKAREEI---HRPVERNRGIQLARQLLERDLLEENIYWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGF

Query:  KLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVL
        +L+NG   ++E ++ K  S +F+SLF+S    D   L   L+ ++ KI +++N L+ + FT EEV   L +MGP+KAPG DGF AMF+++       DV+
Subjt:  KLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILVDNWNDVL

Query:  RFCMEALNGTLSLEDINLTRL-------NSGQLEEFLVMCWC
         FC+  LN   S   +N T +       N   L +F  +  C
Subjt:  RFCMEALNGTLSLEDINLTRL-------NSGQLEEFLVMCWC

A0A6D2K684 Uncharacterized protein5.7e-2932.16Show/hide
Query:  EEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRP-----VERNRGIQLARQLLERDLLEENIYWRQRARVDW
        +E IN+ W +     + D       L  K+ +C K ++ W+R  K NS  KI   ++++ +      V     + L   L      EE +YWRQ++RV W
Subjt:  EEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRP-----VERNRGIQLARQLLERDLLEENIYWRQRARVDW

Query:  LKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMG
        LK GDRN+ + H    QRR +N I+  K  +G WA  +  +E+  + YFQ+LFSS+  Q+  E    L++V  K+   +N  +T   T EE+ K +SEM 
Subjt:  LKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMG

Query:  PSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRLNSGQLEEFLVMCWCVWNERNRVVNQRKGEALGNLN
        P KAPG DG T++FY++       DV++   +    T SL+    TRLN   +      C     ER R +++ +  +L N++
Subjt:  PSKAPGEDGFTAMFYKKILVDNWNDVLRFCMEALNGTLSLEDINLTRLNSGQLEEFLVMCWCVWNERNRVVNQRKGEALGNLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.0e-0723.81Show/hide
Query:  YWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQ-YKIRNHLNSLMTNPFTV
        ++RQ++R+ WL+ GD N+ + H      + +N I   ++ +     N   +++ +  Y+  L  S+++   P+    +K +  ++  + L S ++   + 
Subjt:  YWRQRARVDWLKRGDRNSWWIHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQ-YKIRNHLNSLMTNPFTV

Query:  EEVIKGLSEMGPSKAPGEDGFTAMFY
        +E+   +  M  +KAPG D FTA F+
Subjt:  EEVIKGLSEMGPSKAPGEDGFTAMFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATAGATTGTGAGGAGGTAATTAACCAAAGTTGGACTCTAGGGGATACGGATCATCTTCAAGATTTGGGGTACTCAGGGGCACGGTTGAAACAGAAACTCCAAAA
CTGTGCTAAACATCTGAACAGTTGGGAGCGAGCCTTGAAAGGGAATTCCCCTTATAAAATCTCAAAAGCAAGAGAGGAGATTCATAGGCCAGTTGAGAGGAACAGAGGAA
TTCAATTGGCCAGACAATTACTTGAGAGGGACTTACTTGAAGAAAATATCTATTGGCGACAAAGAGCCAGAGTTGATTGGCTTAAACGAGGGGACAGAAATTCCTGGTGG
ATCCATACTTGTGCCTCCCAGAGAAGGCGTCAAAACTATATTTCTGGTTTTAAACTGAAAAATGGAGGATGGGCGTCAAATGAAATCGATATGGAAAAAGAGGTTAGTCA
GTATTTCCAATCTCTCTTTTCTTCCAATAATAATCAGGATCACCCTGAGCTCTGTTGTTTTTTAAAACATGTTCAGTACAAAATCCGAAATCACCTCAATTCTTTAATGA
CCAATCCCTTCACGGTGGAGGAAGTGATTAAAGGATTGAGTGAAATGGGACCATCAAAGGCACCTGGGGAAGATGGATTTACGGCTATGTTCTATAAAAAAATATTGGTC
GATAATTGGAATGATGTTCTTAGATTCTGTATGGAAGCTCTAAATGGAACTCTGTCTTTGGAAGACATAAACCTCACGCGTTTAAACTCAGGGCAATTAGAAGAATTTCT
TGTTATGTGTTGGTGTGTTTGGAATGAGAGAAACCGAGTTGTGAATCAGCGCAAGGGAGAGGCGCTTGGGAACCTAAACGACTGTAACTCACGCGCTATCCTATCTAAGG
GGAATCCAAAAAAAGCATCAGTCGTGTGCCTTGATTGCTCGGATGAAGTTCATGAACTCAAAGGTCGAAAGGAAAAGGATGTAAAGTGGAAACCTCCAAATTTTTCCAAC
TTTAAAATCAACTTAGATGCTGCTATCGACAACCTAAACAACAATGGTGCGTTAGGAGCTGTAATTCGAAACGAAAAAGGGGATGTGATGGCTATCTTTGTGAAGAAGTT
GCCCCGGGTGTTGCATGCTAAAACGGCGAAGGCATGGCAATGCGGGAGGCTCTGCAATCTGGACACCGGTTTCTCCTGTGTTGAAGTGGAATCAGATTGTGCTACAGTCG
CAACATGCTACGAAACAAAATGCTACCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATAGATTGTGAGGAGGTAATTAACCAAAGTTGGACTCTAGGGGATACGGATCATCTTCAAGATTTGGGGTACTCAGGGGCACGGTTGAAACAGAAACTCCAAAA
CTGTGCTAAACATCTGAACAGTTGGGAGCGAGCCTTGAAAGGGAATTCCCCTTATAAAATCTCAAAAGCAAGAGAGGAGATTCATAGGCCAGTTGAGAGGAACAGAGGAA
TTCAATTGGCCAGACAATTACTTGAGAGGGACTTACTTGAAGAAAATATCTATTGGCGACAAAGAGCCAGAGTTGATTGGCTTAAACGAGGGGACAGAAATTCCTGGTGG
ATCCATACTTGTGCCTCCCAGAGAAGGCGTCAAAACTATATTTCTGGTTTTAAACTGAAAAATGGAGGATGGGCGTCAAATGAAATCGATATGGAAAAAGAGGTTAGTCA
GTATTTCCAATCTCTCTTTTCTTCCAATAATAATCAGGATCACCCTGAGCTCTGTTGTTTTTTAAAACATGTTCAGTACAAAATCCGAAATCACCTCAATTCTTTAATGA
CCAATCCCTTCACGGTGGAGGAAGTGATTAAAGGATTGAGTGAAATGGGACCATCAAAGGCACCTGGGGAAGATGGATTTACGGCTATGTTCTATAAAAAAATATTGGTC
GATAATTGGAATGATGTTCTTAGATTCTGTATGGAAGCTCTAAATGGAACTCTGTCTTTGGAAGACATAAACCTCACGCGTTTAAACTCAGGGCAATTAGAAGAATTTCT
TGTTATGTGTTGGTGTGTTTGGAATGAGAGAAACCGAGTTGTGAATCAGCGCAAGGGAGAGGCGCTTGGGAACCTAAACGACTGTAACTCACGCGCTATCCTATCTAAGG
GGAATCCAAAAAAAGCATCAGTCGTGTGCCTTGATTGCTCGGATGAAGTTCATGAACTCAAAGGTCGAAAGGAAAAGGATGTAAAGTGGAAACCTCCAAATTTTTCCAAC
TTTAAAATCAACTTAGATGCTGCTATCGACAACCTAAACAACAATGGTGCGTTAGGAGCTGTAATTCGAAACGAAAAAGGGGATGTGATGGCTATCTTTGTGAAGAAGTT
GCCCCGGGTGTTGCATGCTAAAACGGCGAAGGCATGGCAATGCGGGAGGCTCTGCAATCTGGACACCGGTTTCTCCTGTGTTGAAGTGGAATCAGATTGTGCTACAGTCG
CAACATGCTACGAAACAAAATGCTACCATTAA
Protein sequenceShow/hide protein sequence
MQIDCEEVINQSWTLGDTDHLQDLGYSGARLKQKLQNCAKHLNSWERALKGNSPYKISKAREEIHRPVERNRGIQLARQLLERDLLEENIYWRQRARVDWLKRGDRNSWW
IHTCASQRRRQNYISGFKLKNGGWASNEIDMEKEVSQYFQSLFSSNNNQDHPELCCFLKHVQYKIRNHLNSLMTNPFTVEEVIKGLSEMGPSKAPGEDGFTAMFYKKILV
DNWNDVLRFCMEALNGTLSLEDINLTRLNSGQLEEFLVMCWCVWNERNRVVNQRKGEALGNLNDCNSRAILSKGNPKKASVVCLDCSDEVHELKGRKEKDVKWKPPNFSN
FKINLDAAIDNLNNNGALGAVIRNEKGDVMAIFVKKLPRVLHAKTAKAWQCGRLCNLDTGFSCVEVESDCATVATCYETKCYH