; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006359 (gene) of Snake gourd v1 genome

Gene IDTan0006359
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG07:66907981..66909286
RNA-Seq ExpressionTan0006359
SyntenyTan0006359
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4362436.1 hypothetical protein F8388_012228 [Cannabis sativa]3.1e-1826.43Show/hide
Query:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKR-----------------SALDYWDFMRRALAKEELG
        W+  W  ++ P +K+ GWK+    +P+  NL  +G+ +D   + C +F ES ++ +W C+  K+                 S +D     R+ LA+EE  
Subjt:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKR-----------------SALDYWDFMRRALAKEELG

Query:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWS
          I +LW+IW  RN  ++NN P      L    F   S +   +Y   R   T                   T P  +    W  P  G + ++ DA+  
Subjt:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWS

Query:  PKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDI-SSIFVESDSSEVINLLNLLDSDL
        P     GLG+I+RDW G  +  G   +    PV + EA A++  L+D    RP+ + +S  + SD  ++++ ++  DS L
Subjt:  PKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDI-SSIFVESDSSEVINLLNLLDSDL

KAF4363716.1 hypothetical protein G4B88_030215 [Cannabis sativa]3.1e-1826.43Show/hide
Query:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKR-----------------SALDYWDFMRRALAKEELG
        W+  W  ++ P +K+ GWK+    +P+  NL  +G+ +D   + C +F ES ++ +W C+  K+                 S +D     R+ LA+EE  
Subjt:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKR-----------------SALDYWDFMRRALAKEELG

Query:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWS
          I +LW+IW  RN  ++NN P      L    F   S +   +Y   R   T                   T P  +    W  P  G + ++ DA+  
Subjt:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWS

Query:  PKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDI-SSIFVESDSSEVINLLNLLDSDL
        P     GLG+I+RDW G  +  G   +    PV + EA A++  L+D    RP+ + +S  + SD  ++++ ++  DS L
Subjt:  PKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDI-SSIFVESDSSEVINLLNLLDSDL

XP_010693635.1 PREDICTED: uncharacterized protein LOC104906563 [Beta vulgaris subsp. vulgaris]1.1e-1828.14Show/hide
Query:  LWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKF------------------------AKRSALDYWDFMRRALAK
        +W  DV P +KI  WK+ ND +P+   LEK  + +     LC    E+ T+L   C F                        A  S LD    ++ +L K
Subjt:  LWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKF------------------------AKRSALDYWDFMRRALAK

Query:  EELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINID
        ++L   + + W IW +RN + FN    S     F++                     +    R   +E  +G     SP S  S  W PP  G  KIN D
Subjt:  EELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINID

Query:  ASWSPKLNRG--GLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI
         S   KL+ G   LG++ RD  G  L  G   + C   V   EA  +L+ ++ A   R +DIS++ +E D+  VIN +N +     EI  ++ DI
Subjt:  ASWSPKLNRG--GLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI

XP_021855862.1 uncharacterized protein LOC110795183 [Spinacia oleracea]4.1e-1824.92Show/hide
Query:  SVWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKF---------------AKRSALDYW--DFMRRALAKEE
        S W+  WG  +M  +K+  WK++   +P A +L+ +G+ +D A S C ++PE+  +L W+C                 +   +L+YW   FM    +   
Subjt:  SVWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKF---------------AKRSALDYW--DFMRRALAKEE

Query:  ----LGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKIN
            L K I +LWS+W  RN++RF NA      LL  V       W      +  R+A        P  + AR  +     L    F W         + 
Subjt:  ----LGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKIN

Query:  IDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDILALTC
         D +W+   N  GLGWI +D R     GG             E  A L G++ A+++     + + + SDS+ +++L+         I + ++D+  L  
Subjt:  IDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDILALTC

Query:  GNPLF
           +F
Subjt:  GNPLF

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]9.1e-1827.71Show/hide
Query:  RKFPESTTYLMWECKFAKR--------------------SALDYWDFMRRALAKEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHH
        RK  E+T +++WECK  K                     +  +YW+++     +EE  ++++I   IW  RN   F         +  ++  Y+ +    
Subjt:  RKFPESTTYLMWECKFAKR--------------------SALDYWDFMRRALAKEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHH

Query:  HQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAIL
            L R++  + P  R+     AR               WKPP   +WK+N DA+W    N  G+GWI RD +G  +  GC ++  ER +  LE +AI 
Subjt:  HQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAIL

Query:  KGLKDALSKRPVDISSIFVESDSSEVINLLN
        +GL+   + R      I +ESDS E I+LL+
Subjt:  KGLKDALSKRPVDISSIFVESDSSEVINLLN

TrEMBL top hitse value%identityAlignment
A0A1R3GKQ5 Reverse transcriptase1.9e-1626.21Show/hide
Query:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFA-----------------KRSALDYWDFM-RRALAKEEL
        W+ +W A+V+P V+   W +V +I+P+  NL  +G+++     +C    EST +  + C FA                   S  D W F+  +A    +L
Subjt:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFA-----------------KRSALDYWDFM-RRALAKEEL

Query:  GKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASW
         K    +W +W  RN   F         L+ SV +++       + GL R+                            R  +W  P  G WKIN DAS+
Subjt:  GKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASW

Query:  SPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI
        S      GLG + RD+ G+ L  G  ++         E  AIL G + AL      I+   +ESDS   I+ +N  ++ L E   ++E+I
Subjt:  SPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI

A0A5B7A5C8 Uncharacterized protein1.9e-2125.52Show/hide
Query:  VWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAK-------------RSALDY---WDFMRRALAKEELG
        +W+ LW   +   +KI  W+ + DI+P+   L ++ I VD    LC    E+  +L  +C + +             +  LD+   ++F+  +L   E+ 
Subjt:  VWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAK-------------RSALDY---WDFMRRALAKEELG

Query:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRS-PWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASW
           ++LW +W +RN +  +       G+      YL+      H+ G++            P R           P+ + S  W PP +G +K+NID SW
Subjt:  KAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRS-PWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASW

Query:  SPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI
         P  N GG+G + RDW+G  + G    +         +A+AIL G+   L  R + I  + VE D   VI+ +     DL+++  +++DI
Subjt:  SPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI

A0A5C7H0P0 Uncharacterized protein3.7e-1724.41Show/hide
Query:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSALDYWDFMRRALAKEELGKAIM-----ILWSIWCY
        W+ LW  ++    KI  WK  N  +P+   L ++ + V     +C    ES T+++W C     SA++ W   R+ L  + + + ++     I+ S+W  
Subjt:  WQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSALDYWDFMRRALAKEELGKAIM-----ILWSIWCY

Query:  RNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSF-----QWKPPCEGAWKINIDASWSPKLNRGG
                   S+D ++F++          ++  ++  +A W     + + +       + + L+ +        WK P  G +KIN DAS+  +  + G
Subjt:  RNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSF-----QWKPPCEGAWKINIDASWSPKLNRGG

Query:  LGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDILALTCGNPLFFFV
        +G I RD++G  +    + V C   V++LEA A L+G+  A+    + +S + +ESD++ VI LL+      TE+  ++   LAL     L  +V
Subjt:  LGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDILALTCGNPLFFFV

A0A6J1CP26 uncharacterized protein LOC1110134122.9e-1730.41Show/hide
Query:  KEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINI
        +EE  ++++I W IW  RN   F    P    +  ++  Y+             RN        L  +   +    I         QWKPP   +WK+N 
Subjt:  KEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINI

Query:  DASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI
        +A+W    N GG+GWI RD +G  +   C ++  ER +  LE +AI +GL+   + R      I +ESDS E I+LL+    D TEI++++E+I
Subjt:  DASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDI

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X14.4e-1827.71Show/hide
Query:  RKFPESTTYLMWECKFAKR--------------------SALDYWDFMRRALAKEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHH
        RK  E+T +++WECK  K                     +  +YW+++     +EE  ++++I   IW  RN   F         +  ++  Y+ +    
Subjt:  RKFPESTTYLMWECKFAKR--------------------SALDYWDFMRRALAKEELGKAIMILWSIWCYRNHVRFNNAPPSIDGLLFSVFFYLRSPWHH

Query:  HQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAIL
            L R++  + P  R+     AR               WKPP   +WK+N DA+W    N  G+GWI RD +G  +  GC ++  ER +  LE +AI 
Subjt:  HQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWSPKLNRGGLGWIFRDWRGRPLFGGCTVVTCERPVKLLEALAIL

Query:  KGLKDALSKRPVDISSIFVESDSSEVINLLN
        +GL+   + R      I +ESDS E I+LL+
Subjt:  KGLKDALSKRPVDISSIFVESDSSEVINLLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.5e-0629.23Show/hide
Query:  DLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSAL
        D+W   + P +K+  WK +N+ +P    L  + I ++   + CR F E+ T++++ C FA+R  +
Subjt:  DLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSAL

AT4G29090.1 Ribonuclease H-like superfamily protein4.1e-0823.05Show/hide
Query:  VWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSALDYWDFMRRAL-AKEELGKAIMI--LWSIWCYR
        ++Q +W +   P ++   WK +++ +P A  L  + +  + A   C    E+  +L+++C FA+ +    W      +    E   +I +   W      
Subjt:  VWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSALDYWDFMRRAL-AKEELGKAIMI--LWSIWCYR

Query:  NHVRFNNAPPSIDGLLFSVF-----FYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSF--QWKPPCEGAWKINIDASWSPKLNRG
         + ++  A   +  LL+ ++        R    + Q  ++RR       +R+     + G    T P   RS   +W+PP     K N DA+W+    R 
Subjt:  NHVRFNNAPPSIDGLLFSVF-----FYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSF--QWKPPCEGAWKINIDASWSPKLNRG

Query:  GLGWIFRDWRGRPLFGGCTVVTCERPVKLLEAL-AILKGLKDA-LSKRPVDISSIFVESDSSEVINLLN
        G+GW+ R+ +G   + G   +      KL   L A L+ ++ A LS      + +  ESDS  +I +LN
Subjt:  GLGWIFRDWRGRPLFGGCTVVTCERPVKLLEAL-AILKGLKDA-LSKRPVDISSIFVESDSSEVINLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATCGAAAACGGCAGGCATGTTTGATATATTCTGTGTGGCAGGATCTCTGGGGGGCAGACGTTATGCCCCATGTCAAGATAGCAGGCTGGAAAATTGTTAATGA
TATCATCCCATCCGCGTTGAATTTGGAAAAGAAGGGGATTCATGTGGATATGGCTTACTCCCTCTGTCGGAAGTTTCCTGAATCAACAACCTACCTCATGTGGGAGTGTA
AGTTCGCGAAGAGGAGCGCGTTGGACTATTGGGACTTCATGAGGAGGGCGCTGGCTAAAGAAGAGCTTGGAAAGGCGATTATGATCCTCTGGAGCATTTGGTGTTACAGG
AACCATGTCCGTTTCAACAACGCTCCCCCATCAATTGATGGCCTTCTCTTCTCAGTTTTTTTCTACCTTAGATCGCCATGGCATCATCACCAATATGGACTCATCAGAAG
GAATGCAACATGGCCCCCCTGCTTTAGATTGCCATATCGGGAAGCCGCGCGTGGGTGGGCCTTTATCACCTCCCCCCTTTCTCTGCGTTCTTTCCAGTGGAAACCTCCCT
GTGAAGGGGCTTGGAAGATCAATATTGATGCTTCTTGGTCTCCCAAGCTCAATCGCGGTGGGTTAGGTTGGATTTTTCGTGATTGGAGAGGTCGTCCCTTGTTTGGAGGA
TGCACCGTGGTGACTTGCGAGAGACCAGTTAAACTTCTTGAGGCGCTAGCCATCCTTAAAGGTCTGAAGGACGCCCTTTCCAAAAGACCGGTTGACATCTCATCCATTTT
TGTTGAGTCGGACTCTAGTGAGGTTATTAATTTGCTTAACCTTCTTGATTCTGACTTGACTGAAATTCTTTTTGTTGTCGAGGATATTCTTGCTCTTACTTGCGGTAACC
CCTTATTCTTTTTTGTAACATCCCTAGAGAAGAGAACAAAACTGCCCATTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATCGAAAACGGCAGGCATGTTTGATATATTCTGTGTGGCAGGATCTCTGGGGGGCAGACGTTATGCCCCATGTCAAGATAGCAGGCTGGAAAATTGTTAATGA
TATCATCCCATCCGCGTTGAATTTGGAAAAGAAGGGGATTCATGTGGATATGGCTTACTCCCTCTGTCGGAAGTTTCCTGAATCAACAACCTACCTCATGTGGGAGTGTA
AGTTCGCGAAGAGGAGCGCGTTGGACTATTGGGACTTCATGAGGAGGGCGCTGGCTAAAGAAGAGCTTGGAAAGGCGATTATGATCCTCTGGAGCATTTGGTGTTACAGG
AACCATGTCCGTTTCAACAACGCTCCCCCATCAATTGATGGCCTTCTCTTCTCAGTTTTTTTCTACCTTAGATCGCCATGGCATCATCACCAATATGGACTCATCAGAAG
GAATGCAACATGGCCCCCCTGCTTTAGATTGCCATATCGGGAAGCCGCGCGTGGGTGGGCCTTTATCACCTCCCCCCTTTCTCTGCGTTCTTTCCAGTGGAAACCTCCCT
GTGAAGGGGCTTGGAAGATCAATATTGATGCTTCTTGGTCTCCCAAGCTCAATCGCGGTGGGTTAGGTTGGATTTTTCGTGATTGGAGAGGTCGTCCCTTGTTTGGAGGA
TGCACCGTGGTGACTTGCGAGAGACCAGTTAAACTTCTTGAGGCGCTAGCCATCCTTAAAGGTCTGAAGGACGCCCTTTCCAAAAGACCGGTTGACATCTCATCCATTTT
TGTTGAGTCGGACTCTAGTGAGGTTATTAATTTGCTTAACCTTCTTGATTCTGACTTGACTGAAATTCTTTTTGTTGTCGAGGATATTCTTGCTCTTACTTGCGGTAACC
CCTTATTCTTTTTTGTAACATCCCTAGAGAAGAGAACAAAACTGCCCATTCTCTAG
Protein sequenceShow/hide protein sequence
MENRKRQACLIYSVWQDLWGADVMPHVKIAGWKIVNDIIPSALNLEKKGIHVDMAYSLCRKFPESTTYLMWECKFAKRSALDYWDFMRRALAKEELGKAIMILWSIWCYR
NHVRFNNAPPSIDGLLFSVFFYLRSPWHHHQYGLIRRNATWPPCFRLPYREAARGWAFITSPLSLRSFQWKPPCEGAWKINIDASWSPKLNRGGLGWIFRDWRGRPLFGG
CTVVTCERPVKLLEALAILKGLKDALSKRPVDISSIFVESDSSEVINLLNLLDSDLTEILFVVEDILALTCGNPLFFFVTSLEKRTKLPIL