; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013829 (gene) of Snake gourd v1 genome

Gene IDTan0013829
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:18597328..18601726
RNA-Seq ExpressionTan0013829
SyntenyTan0013829
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023896927.1 uncharacterized protein LOC112008817 [Quercus suber]2.5e-2930.74Show/hide
Query:  ENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKN
        + M DFR+ LD C L D GY G  FTW  +  N    W RLD  +  + +  K+   ++ HL   SSDH+P++   + + ++     +   RFE  W+K+
Subjt:  ENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKN

Query:  SEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKMT-ELEDLLEEDNTYWRQRSREEWL
           + +V                      C T+LK W+ + + G++R A+ +    +   E D    +     + ++ E+  L++ +   W QRS+ EWL
Subjt:  SEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKMT-ELEDLLEEDNTYWRQRSREEWL

Query:  QWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP
        ++GD+NTK+FH RAT R KKN I G+ ++ G+W+  EE++G + TS+++ LF + +P
Subjt:  QWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP

XP_023911628.1 uncharacterized protein LOC112023240 [Quercus suber]9.4e-2932.44Show/hide
Query:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKG-LIRFEPGWIKNS
        ++ FRE L  C L D GY+G+ +TW  K      T  RLD  + N  +  ++   +V+HL  ++SDH P+   H  +  S TK   G   +FE  W+  +
Subjt:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKG-LIRFEPGWIKNS

Query:  EAKEIV-----------------------CLTELKRWNQRRIGGSIRG----AIQKKEAEI-KILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSR
        E  +++                       C  +L  W     G SI      AI++ + ++ ++ E++LT++ +  +     ++++LL++   YW QRSR
Subjt:  EAKEIV-----------------------CLTELKRWNQRRIGGSIRG----AIQKKEAEI-KILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSR

Query:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPD
          WLQ GDRNTK+FH +A+ RR+KN I GI NS G W+ N EE+G++A+ +F  LF++++ D
Subjt:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPD

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.0e-3031.85Show/hide
Query:  INSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPG
        I   + M DFR+ LD C L D G+ G  FTW  +  N    W RLD  +    +  K+   ++ HL   SSDH+P++   + + ++     K   RFE  
Subjt:  INSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPG

Query:  WIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKMT-ELEDLLEEDNTYWRQRSR
        W+ +   + +V                      C T+LK W+ + + G++R A+ +    + I E D    K     + ++ E+  L++ +   W QRS+
Subjt:  WIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKMT-ELEDLLEEDNTYWRQRSR

Query:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI
         EWL++GD+NTK+FH RAT R KKN I G+ N+ G+WI  EE++G L T +++ LF + +P  T +E V+
Subjt:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]1.0e-3030.94Show/hide
Query:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKNSE
        M  FR+ LD+C L D G+ G+ FTW  +       W RLD  +  + +  ++   ++ HL  + SDH+PI      E K   ++ +   RFE  W+K+  
Subjt:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKNSE

Query:  AKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESD--LTQEKEEIWKRKMTELEDLLEEDNTYWRQRSREEWLQ
         ++++                          L+ WN R+  G +R ++ KK  E+K++E       +   ++  + TE+E L  ++   W+QRSR  WL+
Subjt:  AKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESD--LTQEKEEIWKRKMTELEDLLEEDNTYWRQRSREEWLQ

Query:  WGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI
         GDRNT +FH RAT R K+NLI G+ + +G W+  EE+LGR+   +F  +F S++P  +  EE++
Subjt:  WGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]7.2e-2930.54Show/hide
Query:  EGSLENREVINSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYE
        E S +N  V  S   M  FR+ LD C  +D G+ G +FTW  +   G   WERLD  + N  +  ++   +V HL  Y+SDHRP+        K   ++ 
Subjt:  EGSLENREVINSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYE

Query:  KGLIRFEPGWIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEE-IWKRKMTELEDLLEED
        +   RFE  W+ N   K  V                      C   LKRW++    G+++  I+  + ++ + E +  Q  ++        EL  LLE++
Subjt:  KGLIRFEPGWIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEE-IWKRKMTELEDLLEED

Query:  NTYWRQRSREEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS-----TAIEEVIENTF-YDFRKSWGA
           W QRSR +WLQ GD+NT++FH  AT R+++N IKG+ + +G W + E+    L T  + KLF+S++P +       +++V+ N+   D  K + A
Subjt:  NTYWRQRSREEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS-----TAIEEVIENTF-YDFRKSWGA

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein7.0e-3032.52Show/hide
Query:  NKKARASEGSLENREVINSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEP
        N+  +A E    NR     M+    FR+ LDEC+LMD G++G  FTW    +   TTW RLD  + N+ +  K+S+  V HL   +SDH+ +    T EP
Subjt:  NKKARASEGSLENREVINSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEP

Query:  KSNTKYEKGLIRFEPGWIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TEL
        ++   +++   RFE  W  ++  +E +                      C  +L  W++++  GSI   +++K  E    E +  Q       RK+ +E+
Subjt:  KSNTKYEKGLIRFEPGWIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TEL

Query:  EDLLEEDNTYWRQRSREEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI
          LLE++   WRQRSR  WL+ GDRNT++FH RA+ RR++N I G+ +  G W   + E   L   HF  +FR++ P++  IEE +
Subjt:  EDLLEEDNTYWRQRSREEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVI

A0A2N9I921 Reverse transcriptase domain-containing protein5.8e-3232.3Show/hide
Query:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKNSE
        M  FRE LD+C  +D GY G  FTW     +G T WERLD  + +  + +++ Q +V HL +  SDH+P++        +  +      RFE  W+ +S 
Subjt:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKNSE

Query:  AKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQ-EKEEIWKRKMTELEDLLEEDNTYWRQRSREEWLQW
          E +                      C  +LK W++    GS+R  +Q+K  E+K+ E    Q +  ++      E+  LL ++   WRQRSR +WL+ 
Subjt:  AKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQ-EKEEIWKRKMTELEDLLEEDNTYWRQRSREEWLQW

Query:  GDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS
        GDRNT +FH RAT R+++N I G+ +S G+W T+ +++  +  S+F  +F S++P S
Subjt:  GDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS

A0A2N9IIR5 Uncharacterized protein5.4e-3033.07Show/hide
Query:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWI----
        M  FR+VLDEC   D G+ G +FTW     NG T WERLD  ++N  +  ++ +  V H+    SDH P++         +    K L RFE  W+    
Subjt:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWI----

Query:  ----------KNSEAKEI--------VCLTELKRWNQRRIGGSIRGAIQKKEAEI---KILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSREEWL
                   NS+  ++        +C   L  W+ R+  GS+R  + +K +++   ++L        + +  R   EL  LL ++ T W QRSR  WL
Subjt:  ----------KNSEAKEI--------VCLTELKRWNQRRIGGSIRGAIQKKEAEI---KILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSREEWL

Query:  QWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP
        + GDRNT++FH RA+ RR++N I G+ + SGDW    +++ RLA ++F  LFR+  P
Subjt:  QWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP

A0A2N9IXK4 RNase H domain-containing protein1.4e-3032.57Show/hide
Query:  INSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPG
        I S   M DFR+ +D C   D G+ G  FTW        T WERLD  L    + + +   QV HL   SSDH PI    +  P S  +  + + RFE  
Subjt:  INSMENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPG

Query:  WIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TELEDLLEEDNTYWRQRSR
        W+ +   KE +                      C   L++W+ R   G++   ++KK   ++  ES+  + K       +  E+  LL  +   WRQRSR
Subjt:  WIKNSEAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TELEDLLEEDNTYWRQRSR

Query:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP
        ++WL+WGD+NT +FH  AT RR++NLI  I ++ G+   +EE++ R    HF  LF S+ P
Subjt:  EEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSP

A0A2N9J109 Uncharacterized protein6.4e-3133.33Show/hide
Query:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGL-IRFEPGWIKNS
        M  FR+ LD C  +D GY G  FTW     +G T WERLD  +    +   + Q +V HL +  SDH+P++      PKSN          FE  W+ + 
Subjt:  MDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGL-IRFEPGWIKNS

Query:  EAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TELEDLLEEDNTYWRQRSREEWLQ
           E +                      C   LK W++    GSIR  +Q K  E+K+ E +  Q +  +    +  E+  LL ++   WRQRSR +WL+
Subjt:  EAKEIV----------------------CLTELKRWNQRRIGGSIRGAIQKKEAEIKILESDLTQEKEEIWKRKM-TELEDLLEEDNTYWRQRSREEWLQ

Query:  WGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS
        +GDRNT +FH RAT R+++NLI G+ ++ G W T ++++  L TS+F  +F +++P S
Subjt:  WGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-0621.43Show/hide
Query:  MENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTK----------YEKG
        M  +++F+  L + +L+D   +G  +TW    ++ P    +LD  + N  +F+ +     +      SDH P        PK + K          +   
Subjt:  MENMDDFREVLDECNLMDPGYQGHDFTWIRKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTK----------YEKG

Query:  LIRFEPGWIKN--------SEAKEIVCLTELKRWNQRRIGGSIR-------GAIQKKEAEIKILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSRE
        L+     W +         S  + +    +  +   R+  G+I+        +++  ++++    SD     E + ++K       LE   +++RQ+SR 
Subjt:  LIRFEPGWIKN--------SEAKEIVCLTELKRWNQRRIGGSIR-------GAIQKKEAEIKILESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSRE

Query:  EWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNS----PDS
        +WLQ GD NT++FH      + KNLIK +       + N  ++  +  +++  L  S+S    PDS
Subjt:  EWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNS----PDS

AT2G33160.1 glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein1.2e-0528.86Show/hide
Query:  ILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTINE----LTQNPISRARVRLGSQNIPQQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHVICA
        ILW++WN+RN  +        W+   +  QM + E    +T + I+     + S +IP  W  PT GW     D S+ +    G  GW+VR+  G    A
Subjt:  ILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTINE----LTQNPISRARVRLGSQNIPQQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHVICA

Query:  GHSCINTYWPILVMELFGIIKGMRSITDKGI-PLMVESDSLEAILLIEG
        G +  N     L  E   ++  M+    KG   +  E D+ E   LI G
Subjt:  GHSCINTYWPILVMELFGIIKGMRSITDKGI-PLMVESDSLEAILLIEG

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)8.6e-0421.97Show/hide
Query:  LWKIWNNRNDTVSDVNTDTDWDRWRRTTQ-------MTINELTQNPISRARVRLGSQNIPQQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHVI
        +W++W +RN+ +        W   ++  Q        TIN+ T N  S  +         ++WSPP +G+     D+ +    +     W++R+  GHVI
Subjt:  LWKIWNNRNDTVSDVNTDTDWDRWRRTTQ-------MTINELTQNPISRARVRLGSQNIPQQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHVI

Query:  CAGHSCINTYWPILVMELFGIIKGMRSITDKG
         +G + +   +  L  E  G +  ++ +  +G
Subjt:  CAGHSCINTYWPILVMELFGIIKGMRSITDKG

AT4G29090.1 Ribonuclease H-like superfamily protein6.8e-0920.89Show/hide
Query:  YDYFWWIMDRGN-----KEELHVFIIILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTINELTQNPISRARVRLGSQNI---PQ-------QWSPPTQGW
        Y   +W+ + GN     ++   +   +LW++W NRN+ V          R R      +    ++ +   R+R  +++    PQ       +W PP   W
Subjt:  YDYFWWIMDRGN-----KEELHVFIIILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTINELTQNPISRARVRLGSQNI---PQ-------QWSPPTQGW

Query:  WSLCTDASWSDELNSGGIGWMVRNWEGHVICAGHSCINTYWPILVMELFGIIKGMRSITDKGIPLMVESDSLEAILLIEGKIEDCTEARDFTDIIHNMRE
            TDA+W+ +    GIGW++RN +G V   G   +     +L  EL  +   + S++      ++     + ++ I    E     +     +  +  
Subjt:  WSLCTDASWSDELNSGGIGWMVRNWEGHVICAGHSCINTYWPILVMELFGIIKGMRSITDKGIPLMVESDSLEAILLIEGKIEDCTEARDFTDIIHNMRE

Query:  DWNDIMFRHIPRSSNQEAHKLAQRA
         + ++ F  IPR  N  A ++A+ +
Subjt:  DWNDIMFRHIPRSSNQEAHKLAQRA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.6e-0424.77Show/hide
Query:  ILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTIN---ELTQNPISRARVRLGSQNIP----QQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHV
        ++W+IW + ND V + +T T   +++ T +M +N   E   N ++  + + G++N       +WSPP +       DAS  +     G+GW++RN +G V
Subjt:  ILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTIN---ELTQNPISRARVRLGSQNIP----QQWSPPTQGWWSLCTDASWSDELNSGGIGWMVRNWEGHV

Query:  ICAG------------HSCINTYWPILVMELFGIIKGMRSITDKGIPLMVESDSLEAILLIEGKIEDCTEARDFTDIIHNMREDWNDIMFRHIPRSSNQE
        I  G              C    W I     FG  K           ++ E D+     +I  K  +    + F D I +    +  I F    R  N  
Subjt:  ICAG------------HSCINTYWPILVMELFGIIKGMRSITDKGIPLMVESDSLEAILLIEGKIEDCTEARDFTDIIHNMREDWNDIMFRHIPRSSNQE

Query:  AHKLAQRASHLQQDEIWS
        A  LA++A  ++++  WS
Subjt:  AHKLAQRASHLQQDEIWS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTAGATTTCAACATGGGTTGTGAAACAAATGAGAAGCAGCAAGCTCCTAGCTCCGGATCCAAAACTAAATTAAGCCTCCAAAAGAAGAATAAACCAATTGACGG
AGAGATGGGCCACAAAAATCACAAAAAGATTTCAGGAAAGAAGCACTTGCTGGAAACCAACACTGAAGCCCAATCAAATAAAAAAGCACGAGCTTCTGAAGGCTCGCTCG
AAAACAGGGAAGTTATTAATTCGATGGAAAATATGGATGATTTTAGAGAGGTGTTGGATGAATGCAACTTGATGGACCCGGGGTATCAAGGTCATGATTTCACTTGGATC
AGAAAGGTAAACAATGGTCCAACAACGTGGGAACGTTTGGATTGTTTCCTTCTGAACATAGGTTATTTTACTAAATGGAGCCAGTTTCAGGTTATTCACTTAGGTTTTTA
TTCATCAGACCATAGACCAATTTTTGGATATCATACAGGGGAACCTAAGTCTAACACCAAATATGAAAAAGGCCTCATAAGATTCGAACCTGGTTGGATAAAAAACTCAG
AAGCCAAAGAAATTGTGTGCTTAACAGAATTAAAACGATGGAATCAGCGTCGTATAGGTGGATCAATCCGAGGGGCCATCCAAAAGAAGGAGGCAGAAATAAAAATTTTA
GAAAGTGATTTAACACAGGAGAAAGAAGAGATCTGGAAACGTAAAATGACAGAATTGGAGGACTTACTTGAAGAAGATAATACTTATTGGCGTCAAAGATCAAGGGAAGA
ATGGTTACAATGGGGAGATCGCAACACAAAATGGTTTCACATTAGAGCGACGACCCGACGGAAGAAAAATCTCATCAAAGGTATTTTCAATAGTTCTGGTGACTGGATCA
CAAATGAAGAGGAACTGGGTAGGTTGGCAACCTCACATTTTGCCAAACTGTTCAGGTCAAACTCGCCCGATTCCACGGCAATTGAAGAGGTTATCGAGAATACTTTTTAT
GATTTCAGGAAATCCTGGGGCGCGTACGACTATTTTTGGTGGATAATGGACCGTGGCAATAAGGAGGAACTCCATGTTTTCATAATAATTTTGTGGAAAATTTGGAATAA
CAGAAATGACACTGTATCTGATGTGAATACCGATACTGATTGGGATAGATGGAGAAGAACCACACAAATGACCATTAATGAGTTAACTCAGAACCCCATCAGTCGAGCCC
GTGTTCGACTAGGAAGTCAAAATATCCCTCAACAATGGAGCCCTCCCACTCAGGGTTGGTGGAGCCTTTGTACGGATGCGTCCTGGAGTGACGAATTGAATAGTGGAGGT
ATTGGTTGGATGGTACGAAACTGGGAAGGTCATGTGATCTGTGCAGGACATTCTTGCATTAACACTTACTGGCCTATTTTGGTTATGGAATTATTTGGGATAATTAAAGG
AATGAGATCGATTACGGATAAAGGTATTCCCTTGATGGTGGAATCGGATTCGCTCGAAGCCATACTTTTGATAGAAGGAAAAATTGAAGATTGCACAGAAGCACGAGATT
TCACAGATATAATTCACAACATGCGAGAGGACTGGAATGACATCATGTTCCGGCACATCCCTCGATCATCGAACCAAGAAGCTCACAAGCTCGCACAGAGAGCATCTCAT
CTTCAACAAGACGAAATTTGGTCGGGGGAGTCTTTTGGACTCCATTACTTTATTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTAGATTTCAACATGGGTTGTGAAACAAATGAGAAGCAGCAAGCTCCTAGCTCCGGATCCAAAACTAAATTAAGCCTCCAAAAGAAGAATAAACCAATTGACGG
AGAGATGGGCCACAAAAATCACAAAAAGATTTCAGGAAAGAAGCACTTGCTGGAAACCAACACTGAAGCCCAATCAAATAAAAAAGCACGAGCTTCTGAAGGCTCGCTCG
AAAACAGGGAAGTTATTAATTCGATGGAAAATATGGATGATTTTAGAGAGGTGTTGGATGAATGCAACTTGATGGACCCGGGGTATCAAGGTCATGATTTCACTTGGATC
AGAAAGGTAAACAATGGTCCAACAACGTGGGAACGTTTGGATTGTTTCCTTCTGAACATAGGTTATTTTACTAAATGGAGCCAGTTTCAGGTTATTCACTTAGGTTTTTA
TTCATCAGACCATAGACCAATTTTTGGATATCATACAGGGGAACCTAAGTCTAACACCAAATATGAAAAAGGCCTCATAAGATTCGAACCTGGTTGGATAAAAAACTCAG
AAGCCAAAGAAATTGTGTGCTTAACAGAATTAAAACGATGGAATCAGCGTCGTATAGGTGGATCAATCCGAGGGGCCATCCAAAAGAAGGAGGCAGAAATAAAAATTTTA
GAAAGTGATTTAACACAGGAGAAAGAAGAGATCTGGAAACGTAAAATGACAGAATTGGAGGACTTACTTGAAGAAGATAATACTTATTGGCGTCAAAGATCAAGGGAAGA
ATGGTTACAATGGGGAGATCGCAACACAAAATGGTTTCACATTAGAGCGACGACCCGACGGAAGAAAAATCTCATCAAAGGTATTTTCAATAGTTCTGGTGACTGGATCA
CAAATGAAGAGGAACTGGGTAGGTTGGCAACCTCACATTTTGCCAAACTGTTCAGGTCAAACTCGCCCGATTCCACGGCAATTGAAGAGGTTATCGAGAATACTTTTTAT
GATTTCAGGAAATCCTGGGGCGCGTACGACTATTTTTGGTGGATAATGGACCGTGGCAATAAGGAGGAACTCCATGTTTTCATAATAATTTTGTGGAAAATTTGGAATAA
CAGAAATGACACTGTATCTGATGTGAATACCGATACTGATTGGGATAGATGGAGAAGAACCACACAAATGACCATTAATGAGTTAACTCAGAACCCCATCAGTCGAGCCC
GTGTTCGACTAGGAAGTCAAAATATCCCTCAACAATGGAGCCCTCCCACTCAGGGTTGGTGGAGCCTTTGTACGGATGCGTCCTGGAGTGACGAATTGAATAGTGGAGGT
ATTGGTTGGATGGTACGAAACTGGGAAGGTCATGTGATCTGTGCAGGACATTCTTGCATTAACACTTACTGGCCTATTTTGGTTATGGAATTATTTGGGATAATTAAAGG
AATGAGATCGATTACGGATAAAGGTATTCCCTTGATGGTGGAATCGGATTCGCTCGAAGCCATACTTTTGATAGAAGGAAAAATTGAAGATTGCACAGAAGCACGAGATT
TCACAGATATAATTCACAACATGCGAGAGGACTGGAATGACATCATGTTCCGGCACATCCCTCGATCATCGAACCAAGAAGCTCACAAGCTCGCACAGAGAGCATCTCAT
CTTCAACAAGACGAAATTTGGTCGGGGGAGTCTTTTGGACTCCATTACTTTATTAGTTAA
Protein sequenceShow/hide protein sequence
MGLDFNMGCETNEKQQAPSSGSKTKLSLQKKNKPIDGEMGHKNHKKISGKKHLLETNTEAQSNKKARASEGSLENREVINSMENMDDFREVLDECNLMDPGYQGHDFTWI
RKVNNGPTTWERLDCFLLNIGYFTKWSQFQVIHLGFYSSDHRPIFGYHTGEPKSNTKYEKGLIRFEPGWIKNSEAKEIVCLTELKRWNQRRIGGSIRGAIQKKEAEIKIL
ESDLTQEKEEIWKRKMTELEDLLEEDNTYWRQRSREEWLQWGDRNTKWFHIRATTRRKKNLIKGIFNSSGDWITNEEELGRLATSHFAKLFRSNSPDSTAIEEVIENTFY
DFRKSWGAYDYFWWIMDRGNKEELHVFIIILWKIWNNRNDTVSDVNTDTDWDRWRRTTQMTINELTQNPISRARVRLGSQNIPQQWSPPTQGWWSLCTDASWSDELNSGG
IGWMVRNWEGHVICAGHSCINTYWPILVMELFGIIKGMRSITDKGIPLMVESDSLEAILLIEGKIEDCTEARDFTDIIHNMREDWNDIMFRHIPRSSNQEAHKLAQRASH
LQQDEIWSGESFGLHYFIS